Analysis on Network Clustering Algorithm of Data Mining Methods Based on Rough Set Theory

Rough Set Theory (RST), since its introduction in Pawlak (1982), continues to develop as an effective tool in data mining. Within a set theoretical structure, its remit is closely concerned with the classification of objects to decision attribute values, based on their description by a number of condition attributes. With regards to RST, this classification is through the construction of ‘if .. then ..’ decision rules. The development of RST has been in many directions, amongst the earliest was with the allowance for miss-classification in the constructed decision rules, namely the Variable Precision Rough Sets model (VPRS) (Ziarko, 1993), the recent references for this include; Beynon (2001), Mi et al. (2004), and Slezak and Ziarko (2005). Further developments of RST have included; its operation within a fuzzy environment (Greco et al., 2006), and using a dominance relation based approach (Greco et al., 2004). The regular major international conferences of ‘International Conference on Rough Sets and Current Trends in Computing’ (RSCTC, 2004) and ‘International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing’ (RSFDGrC, 2005) continue to include RST research covering the varying directions of its development. This is true also for the associated book series entitled ‘Transactions on Rough Sets’ (Peters and Skowron, 2005), which further includes doctoral theses on this subject. What is true, is that RST is still evolving, with the eclectic attitude to its development meaning that the definitive concomitant RST data mining techniques are still to be realised. Grzymala-Busse and Ziarko (2000), in a defence of RST, discussed a number of points relevant to data mining, and also made comparisons between RST and other techniques. Within the area of data mining and the desire to identify relationships between condition attributes, the effectiveness of RST is particularly pertinent due to the inherent intent within RST type methodologies for data reduction and feature selection (Jensen and Shen, 2005). That is, subsets of condition attributes identified that perform the same role as all the condition attributes in a considered data set (termed ß-reducts in VPRS, see later). Chen (2001) addresses this, when discussing the original RST, they state it follows a reductionist approach and is lenient to inconsistent data (contradicting condition attributes - one aspect of underlying uncertainty). This encyclopaedia article describes and demonstrates the practical application of a RST type methodology in data mining, namely VPRS, using nascent software initially described in Griffiths and Beynon (2005). The use of VPRS, through its relative simplistic structure, outlines many of the rudiments of RST based methodologies. The software utilised is oriented towards ‘hands on’ data mining, with graphs presented that clearly elucidate ‘veins’ of possible information identified from ß-reducts, over different allowed levels of missclassification associated with the constructed decision rules (Beynon and Griffiths, 2004). Further findings are briefly reported when undertaking VPRS in a resampling environment, with leave-one-out and bootstrapping approaches adopted (Wisnowski et al., 2003). The importance of these results is in the identification of the more influential condition attributes, pertinent to accruing the most effective data mining results.

Download Full-text

Intrusion Detection Using Modern Techniques

Intelligent Information Technologies ◽

10.4018/978-1-59904-941-0.ch012 ◽

2011 ◽

pp. 259-273

Author(s):

Tarum Bhaskar ◽

Narasimha Kamath B.

Keyword(s):

Neural Network ◽

Data Mining ◽

Artificial Neural Network ◽

Intrusion Detection ◽

Set Theory ◽

Rough Set ◽

Intrusion Detection System ◽

Rough Set Theory ◽

Detection System ◽

Mining Tools

Intrusion detection system (IDS) is now becoming an integral part of the network security infrastructure. Data mining tools are widely used for developing an IDS. However, this requires an ability to find the mapping from the input space to the output space with the help of available data. Rough sets and neural networks are the best known data mining tools to analyze data and help solve this problem. This chapter proposes a novel hybrid method to integrate rough set theory, genetic algorithm (GA), and artificial neural network. Our method consists of two stages: First, rough set theory is applied to find the reduced dataset. Second, the results are used as inputs for the neural network, where a GA-based learning approach is used to train the intrusion detection system. The method is characterized not only by using attribute reduction as a pre-processing technique of an artificial neural network but also by an improved learning algorithm. The effectiveness of the proposed method is demonstrated on the KDD cup data.

Download Full-text

Distributed Data Mining for Modeling and Prediction of Skin Condition in Cosmetic Industry—A Rough Set Theory Approach

Application of Computational Intelligence to Biology - SpringerBriefs in Applied Sciences and Technology ◽

10.1007/978-981-10-0391-2_9 ◽

2016 ◽

pp. 87-102

Author(s):

P. M. Prasuna ◽

Y. Ramadevi ◽

A. Vinay Babu

Keyword(s):

Data Mining ◽

Set Theory ◽

Rough Set ◽

Rough Set Theory ◽

Skin Condition ◽

Distributed Data Mining ◽

Distributed Data ◽

Theory Approach ◽

Modeling And Prediction ◽

Cosmetic Industry

Download Full-text

Fuzzy Miner

Mathematical Methods for Knowledge Discovery and Data Mining ◽

10.4018/978-1-59904-528-3.ch018 ◽

2011 ◽

pp. 299-321 ◽

Cited By ~ 1

Author(s):

Nikos Pelekis ◽

Babis Theodoulidis ◽

Ioannis Kopanakis ◽

Yannis Theodoridis

Keyword(s):

Data Mining ◽

Set Theory ◽

Rough Set ◽

Rough Set Theory ◽

Attribute Selection ◽

Internet Routing ◽

Data Mining Approach ◽

New Information

QOSP Quality of Service Open Shortest Path First based on QoS routing has been recognized as a missing piece in the evolution of QoS-based services in the Internet. Data mining has emerged as a tool for data analysis, discovery of new information, and autonomous decision-making. This paper focuses on routing algorithms and their appli-cations for computing QoS routes in OSPF protocol. The proposed approach is based on a data mining approach using rough set theory, for which the attribute-value system about links of networks is created from network topology. Rough set theory offers a knowledge discovery approach to extracting routing-decisions from attribute set. The extracted rules can then be used to select significant routing-attributes and make routing-selections in routers. A case study is conducted to demonstrate that rough set theory is effective in finding the most significant attribute set. It is shown that the algorithm based on data mining and rough set offers a promising approach to the attribute-selection prob-lem in internet routing.

Download Full-text

Application of Rough Set Theory to Data Mining of Condenser Diagnosis in Power Plants

2003 International Joint Power Generation Conference ◽

10.1115/ijpgc2003-40135 ◽

2003 ◽

Author(s):

Zhongguang Fu ◽

Tao Jin ◽

Kun Yang

Keyword(s):

Data Mining ◽

Set Theory ◽

Rough Set ◽

Power Plants ◽

Rough Set Theory ◽

A Priori ◽

Reduction Algorithm ◽

Useful Knowledge ◽

Fault Features ◽

Basic Concepts

Rough set theory is a powerful tool in deal with vagueness and uncertainty. It is particularly suitable to discover hidden and potentially useful knowledge in data and can be used to reduce features and extract rules. This paper introduces the basic concepts and fundamental elements of the rough set theory. A reduction algorithm that integrates a priori with significance is proposed to illustrate how the rough set theory could be used to extract fault features of the condenser in a power plant. Two testing examples are then presented to demonstrate the effectiveness of the theory in fault diagnosis.

Download Full-text

The Application of Data Mining in the Civil Aviation Accident Analysis

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.241-244.3000 ◽

2012 ◽

Vol 241-244 ◽

pp. 3000-3004

Author(s):

Dai Wu Zhu ◽

Yin Ni

Keyword(s):

Data Mining ◽

Set Theory ◽

Rough Set ◽

Rough Set Theory ◽

Attribute Reduction ◽

Passive State ◽

Civil Aviation ◽

Accident Analysis ◽

Analysis Method ◽

Accident Prediction

At present, our analysis of the aviation accident mainly limited to the methods of mathematical statistics, the analysis method means of a single, and in a passive state, so the accident prediction is poor. This paper, basis on the rough set theory in data mining and preferential information ,we improve the rough set attribute reduction algorithm, and applied to civil aviation accident analysis to indentify the potential law of accident.

Download Full-text

Financial Distress Study Based on PSO K-Means Clustering Algorithm and Rough Set Theory

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.411-414.2377 ◽

2013 ◽

Vol 411-414 ◽

pp. 2377-2383 ◽

Cited By ~ 1

Author(s):

Peng Wu ◽

Cheng Liu

Keyword(s):

Decision Making ◽

Set Theory ◽

Rough Set ◽

Financial Distress ◽

Clustering Algorithm ◽

Rough Set Theory ◽

Financial Indicators ◽

Financial Status ◽

Level Data ◽

Research Objects

The traditional financial distress method normally divided samples into two categories by healthy and bankruptcy. And the financial indicators are typically chosen without using a systematic and reasonable theory. To be more realistic, this paper selected all the companies in a certain industry as the research objects. Twenty-one financial indicators were primarily chosen as the condition attributes, reduction set was obtained by matrix reduction identification based on rough set theory. Then PSO-based clustering algorithm K-means was used to divide subjects into 5 categories of different financial status. The decision-making table was formed with the reduction set using the classification as a decision attribute. Finally, we tested the reasonableness of the classification and generated early warning rules together with rough set theory to evaluate the financial status of listed companies. The results showed that PSO-based K-means algorithm was able to reasonably classify companies, at the same time to overcome the subjective impacts in the artificial measure of financial crisis level. Data generated using this method agreed with the rough set theory for up to 87.0%, thus proving this method to be effective and feasible.

Download Full-text

Research on the Novel Weighted Fuzzy Clustering Algorithm based on Fuzzy Sets and Rough Set Theory

Proceedings of the 2015 Conference on Informatization in Education, Management and Business ◽

10.2991/iemb-15.2015.2 ◽

2015 ◽

Author(s):

Chen Liwei

Keyword(s):

Fuzzy Sets ◽

Set Theory ◽

Rough Set ◽

Fuzzy Clustering ◽

Clustering Algorithm ◽

Rough Set Theory ◽

The Novel ◽

Fuzzy Clustering Algorithm

Download Full-text

Topological properties of a pair of relation-based approximation operators

Filomat ◽

10.2298/fil1719175z ◽

2017 ◽

Vol 31 (19) ◽

pp. 6175-6183

Author(s):

Yan-Lan Zhang ◽

Chang-Qing Li

Keyword(s):

Data Mining ◽

Set Theory ◽

Rough Set ◽

Rough Set Theory ◽

Equivalence Relations ◽

Topological Properties ◽

Upper Approximation ◽

Approximation Operators ◽

Rough Approximation ◽

Basic Concepts

Rough set theory is an important tool for data mining. Lower and upper approximation operators are two important basic concepts in the rough set theory. The classical Pawlak rough approximation operators are based on equivalence relations and have been extended to relation-based generalized rough approximation operators. This paper presents topological properties of a pair of relation-based generalized rough approximation operators. A topology is induced by the pair of generalized rough approximation operators from an inverse serial relation. Then, connectedness, countability, separation property and Lindel?f property of the topological space are discussed. The results are not only beneficial to obtain more properties of the pair of approximation operators, but also have theoretical and actual significance to general topology.

Download Full-text

Analysis on Network Clustering Algorithm of Data Mining Methods Based on Rough Set Theory

The Study on Data Mining Methods Based on Rough Set Theory and CART for Incomplete Data

Information Veins and Resampling with Rough Set Theory

Intrusion Detection Using Modern Techniques

Distributed Data Mining for Modeling and Prediction of Skin Condition in Cosmetic Industry—A Rough Set Theory Approach

Fuzzy Miner

Application of Rough Set Theory to Data Mining of Condenser Diagnosis in Power Plants

The Application of Data Mining in the Civil Aviation Accident Analysis

Financial Distress Study Based on PSO K-Means Clustering Algorithm and Rough Set Theory

Research on the Novel Weighted Fuzzy Clustering Algorithm based on Fuzzy Sets and Rough Set Theory

Topological properties of a pair of relation-based approximation operators

Export Citation Format