Cluster Analysis Using Rough Clustering and k-Means Clustering

Encyclopedia of Information Science and Technology, Second Edition ◽

10.4018/978-1-60566-026-4.ch091 ◽

2011 ◽

pp. 561-565

Author(s):

Kevin E. Voges

Keyword(s):

Cluster Analysis ◽

Computational Intelligence ◽

Rough Sets ◽

Information Science ◽

Clustering Algorithms ◽

Clustering Methods ◽

Rough Sets Theory ◽

Clustering Problem ◽

Data Reduction Technique ◽

Sets Theory

Cluster analysis is a fundamental data reduction technique used in the physical and social sciences. It is of potential interest to managers in Information Science, as it can be used to identify user needs though segmenting users such as Web site visitors. In addition, the theory of Rough sets is the subject of intense interest in computational intelligence research. The extension of this theory into rough clustering provides an important and potentially useful addition to the range of cluster analysis techniques available to the manager. Cluster analysis is defined as the grouping of “individuals or objects into clusters so that objects in the same cluster are more similar to one another than they are to objects in other clusters” (Hair, Black, Babin, Anderson, & Tatham, 2006). There are a number of comprehensive introductions to cluster analysis (Abonyi & Feil, 2007; Arabie, Hubert, & De Soete, 1994; Cramer, 2003; Everitt, Landau, & Leese, 2001; Gan, Ma, & Wu, 2007; Härdle & Hlávka, 2007). Techniques are often classified as hierarchical or nonhierarchical (Hair et al., 2006), and the most commonly used nonhierarchical technique is the k-means approach developed by MacQueen (1967). Recently, techniques based on developments in computational intelligence have also been used as clustering algorithms. For example, the theory of fuzzy sets developed by Zadeh (1965), which introduced the concept of partial set membership, has been applied to clustering (Abonyi & Feil, 2007; Dumitrescu, Lazzerini, & Jain, 2000). Another technique receiving considerable attention is the theory of rough sets (Pawlak, 1982), which has led to clustering algorithms referred to as rough clustering (do Prado, Engel, & Filho, 2002; Kumar, Krishna, Bapi, & De, 2007; Parmar, Wu, & Blackhurst, 2007; Voges, Pope, & Brown, 2002). This article provides brief introductions to k-means cluster analysis, rough sets theory, and rough clustering, and compares k-means clustering and rough clustering. It shows that rough clustering provides a more flexible solution to the clustering problem, and can be conceptualized as extracting concepts from the data, rather than strictly delineated subgroupings (Pawlak, 1991). Traditional clustering methods generate extensional descriptions of groups (i.e., which objects are members of each cluster), whereas clustering techniques based on rough sets theory generate intentional descriptions (i.e., what are the main characteristics of each cluster) (do Prado et al., 2002). These different goals suggest that both k-means clustering and rough clustering have their place in the data analyst’s and the information manager’s toolbox.

Download Full-text

Cluster Analysis Using Rough Clustering and k-Means Clustering

Encyclopedia of Information Science and Technology, First Edition ◽

10.4018/978-1-59140-553-5.ch077 ◽

2005 ◽

pp. 435-438

Author(s):

Kevin E. Voges

Keyword(s):

Cluster Analysis ◽

Data Reduction ◽

Computational Intelligence ◽

Rough Sets ◽

Information Science ◽

Reduction Technique ◽

Analysis Techniques ◽

The Subject ◽

Data Reduction Technique ◽

Potential Use

Cluster analysis is a fundamental data reduction technique used in the physical and social sciences. The technique is of interest to managers in information science because of its potential use in identifying user needs though segmenting users such as Web site visitors. In addition, the theory of rough sets is the subject of intense interest in computational intelligence research. The extension of this theory into rough clustering provides an important and potentially useful addition to the range of cluster analysis techniques available to the manager.

Download Full-text

Presentation of precedents in decision support system knowledge base under diagnosis of portal cranes on the basis of the rough sets theory

Shipbuilding and marine infrastructure ◽

10.15589/smi20140109 ◽

2014 ◽

Vol 1 (1) ◽

Author(s):

Igor I. Kovalenko ◽

Anton V. Melnik

Keyword(s):

Decision Support ◽

Decision Support System ◽

Knowledge Base ◽

Support System ◽

Rough Sets ◽

Rough Sets Theory ◽

Sets Theory ◽

System Knowledge

Download Full-text

Fuzzy neural classifier for fault diagnosis of transformer based on rough sets theory

2005 International Conference on Electrical Machines and Systems ◽

10.1109/icems.2005.202962 ◽

2005 ◽

Author(s):

Hongsheng Su ◽

Qunzhan Li

Keyword(s):

Fault Diagnosis ◽

Rough Sets ◽

Rough Sets Theory ◽

Fuzzy Neural ◽

Neural Classifier ◽

Sets Theory

Download Full-text

Compound-Fault Diagnosis of Bearing Based on Order Tracking Wavelet Packet and Rough Sets

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.130-134.1681 ◽

2011 ◽

Vol 130-134 ◽

pp. 1681-1685 ◽

Cited By ~ 1

Author(s):

Guang Tian ◽

Hao Tian ◽

Guang Sheng Liu ◽

Jin Hui Zhao ◽

Li Ping Luo

Keyword(s):

Fault Diagnosis ◽

Rough Sets ◽

Wavelet Packet ◽

Decision Rules ◽

Vibration Signals ◽

Rough Sets Theory ◽

Equal Angle ◽

Order Tracking ◽

Start Up ◽

Sets Theory

The diagnosis of compound-fault is always a difficult point, and there is not an effective method in equipment diagnosis field, then a new method of compound-fault diagnosis was presented. The vibration signals at start-up in the gearbox are non-stationary signals, and traditional ways of diagnosis have low precision. Order tracking and wavelet packet and rough sets theory are introduced in the compound-fault diagnosis of bearing. First, the vibration signals at start-up were resampled using computer order tracking arithmetic and equal angle distributed vibration signals were obtained, and wavelet packet has been used for equal angle distributed vibration signals decomposition and reconstruction. Then, energy distribution of every frequency band can be calculated according to normalization process. A new feature vector can be obtained, then clear and concise decision rules can be obtained by rough sets theory. Finally, the result of compound-fault example proves that the proposed method has high validity and more amplitude appliance foreground.

Download Full-text