Evolutionary Clustering Algorithm Using Supervised Classifiers

Author(s):  
Benjamin Mario Sainz-Tinajero ◽  
Andres Eduardo Gutierrez-Rodriguez ◽  
Hector G. Ceballos ◽  
Francisco J. Cantu-Ortiz
Author(s):  
Catherine Cheung ◽  
Julio J. Valdés ◽  
Richard Salas Chavez ◽  
Srishti Sehgal

In this work, the sensor data related to a diesel engine system and specifically its turbocharger subsystem were analyzed. An incident where the turbocharger seized was recorded by dozens of standard turbocharger-related sensors. By training models to distinguish between normal healthy operating conditions and deteriorated conditions, there is an opportunity to develop prognostic and predictive tools to ideally help prevent a similar occurrence in the future. Analysis of this event provides an opportunity to identify changes in equipment indicators with a known outcome. A number of data analysis tools were used to characterize the healthy and deteriorated states of the turbocharger system, including various supervised classification as well as semi-supervised and unsupervised anomaly detection techniques. The leader clustering algorithm was also implemented to reduce the amount of data to train and develop the models. This paper describes the results of this modeling process, validated by testing on healthy data from the same propulsion system and a second distinct one. Although this problem posed challenges due to the severely imbalanced class distribution, the supervised classifiers, in particular Support Vector Machine (SVM) and Random Forest (RF), performed very well in all metrics while the unsupervised anomaly detection models achieved near-perfect accuracy for identifying healthy turbocharger states.


Author(s):  
C. James Li ◽  
C. Jansuwan

Projection network, being a non-linear dynamic system itself, has been shown to be superior to static classifiers such as neural networks in some applications where noise is significant. However it is a supervised classifier by nature. To extend its utility for unsupervised classification, this study proposes an unsupervised pattern classifier integrating a clustering algorithm based on DBSCAN and a dynamic classifier based on the projection network. The former is used to form clusters out of un-labeled data and eliminate outliers. Then, significant clusters in terms of size are identified. Subsequently, a system of projection networks is established to recognize all the significant clusters. The unsupervised classifier is tested with three well-known benchmark data sets (by ignoring data labels during training) including the Fisher’s iris data, the heart disease data and the credit screening data and the results are compared to those of supervised classifiers based on the projection network. The difference in performance is small. However, the ability of unsupervised classification comes at a price of a more complex classifier system and the need of data pre-conditioning. The former is because more than one cluster could be formed for a class and therefore more computational units are needed for the classifier, and the latter is because increased similarity of data after clustering increases the chances of numerical instability in the least square algorithm used to initialize the classifier.


2014 ◽  
Vol 2014 ◽  
pp. 1-7 ◽  
Author(s):  
Huaixiao Wang ◽  
Wanhong Zhu ◽  
Jianyong Liu ◽  
Ling Li ◽  
Zhuchen Yin

To determine the multidistribution center location and the distribution scope of the distribution center with high efficiency, the real-parameter quantum-inspired evolutionary clustering algorithm (RQECA) is proposed. RQECA is applied to choose multidistribution center location on the basis of the conventional fuzzy C-means clustering algorithm (FCM). The combination of the real-parameter quantum-inspired evolutionary algorithm (RQIEA) and FCM can overcome the local search defect of FCM and make the optimization result independent of the choice of initial values. The comparison of FCM, clustering based on simulated annealing genetic algorithm (CSAGA), and RQECA indicates that RQECA has the same good convergence as CSAGA, but the search efficiency of RQECA is better than that of CSAGA. Therefore, RQECA is more efficient to solve the multidistribution center location problem.


Sign in / Sign up

Export Citation Format

Share Document