An unsupervised classification scheme for multi-class problems including feature selection based on MTS philosophy

Author(s):  
Prasun Das ◽  
Sandip Mukherjee
2020 ◽  
Vol 13 (6) ◽  
pp. 2949-2964
Author(s):  
Jussi Leinonen ◽  
Alexis Berne

Abstract. The increasing availability of sensors imaging cloud and precipitation particles, like the Multi-Angle Snowflake Camera (MASC), has resulted in datasets comprising millions of images of falling snowflakes. Automated classification is required for effective analysis of such large datasets. While supervised classification methods have been developed for this purpose in recent years, their ability to generalize is limited by the representativeness of their labeled training datasets, which are affected by the subjective judgment of the expert and require significant manual effort to derive. An alternative is unsupervised classification, which seeks to divide a dataset into distinct classes without expert-provided labels. In this paper, we introduce an unsupervised classification scheme based on a generative adversarial network (GAN) that learns to extract the key features from the snowflake images. Each image is then associated with a distribution of points in the feature space, and these distributions are used as the basis of K-medoids classification and hierarchical clustering. We found that the classification scheme is able to separate the dataset into distinct classes, each characterized by a particular size, shape and texture of the snowflake image, providing signatures of the microphysical properties of the snowflakes. This finding is supported by a comparison of the results to an existing supervised scheme. Although training the GAN is computationally intensive, the classification process proceeds directly from images to classes with minimal human intervention and therefore can be repeated for other MASC datasets with minor manual effort. As the algorithm is not specific to snowflakes, we also expect this approach to be relevant to other applications.


2018 ◽  
Vol 2018 ◽  
pp. 1-5 ◽  
Author(s):  
Asriyanti Indah Pratiwi ◽  
Adiwijaya

Sentiment analysis in a movie review is the needs of today lifestyle. Unfortunately, enormous features make the sentiment of analysis slow and less sensitive. Finding the optimum feature selection and classification is still a challenge. In order to handle an enormous number of features and provide better sentiment classification, an information-based feature selection and classification are proposed. The proposed method reduces more than 90% unnecessary features while the proposed classification scheme achieves 96% accuracy of sentiment classification. From the experimental results, it can be concluded that the combination of proposed feature selection and classification achieves the best performance so far.


2016 ◽  
Vol 10 (1) ◽  
pp. 35-41 ◽  
Author(s):  
Tatjana Liogienė ◽  
Gintautas Tamulevičius

Abstract The intensive research of speech emotion recognition introduced a huge collection of speech emotion features. Large feature sets complicate the speech emotion recognition task. Among various feature selection and transformation techniques for one-stage classification, multiple classifier systems were proposed. The main idea of multiple classifiers is to arrange the emotion classification process in stages. Besides parallel and serial cases, the hierarchical arrangement of multi-stage classification is most widely used for speech emotion recognition. In this paper, we present a sequential-forward-feature-selection-based multi-stage classification scheme. The Sequential Forward Selection (SFS) and Sequential Floating Forward Selection (SFFS) techniques were employed for every stage of the multi-stage classification scheme. Experimental testing of the proposed scheme was performed using the German and Lithuanian emotional speech datasets. Sequential-feature-selection-based multi-stage classification outperformed the single-stage scheme by 12–42 % for different emotion sets. The multi-stage scheme has shown higher robustness to the growth of emotion set. The decrease in recognition rate with the increase in emotion set for multi-stage scheme was lower by 10–20 % in comparison with the single-stage case. Differences in SFS and SFFS employment for feature selection were negligible.


2020 ◽  
Vol 2020 ◽  
pp. 1-8
Author(s):  
Han-Wei Huang ◽  
Pei-Ying Wu ◽  
Pei-Fang Su ◽  
Chung-I Li ◽  
Yu-Min Yeh ◽  
...  

Background and Objective. The main purpose of this study was to develop a simple automatic diagnostic classification scheme for chemotherapy-induced peripheral neuropathy. Methods. This was a prospective cohort study that enrolled patients with colorectal or gynecologic cancer post chemotherapy for more than 1 year. The patients underwent laboratory examinations (nerve conduction studies and quantitative sensory tests), and a questionnaire about the quality of life. An unsupervised classification algorithm was used to classify the patients into groups using a small number of variables derived from the laboratory tests. A panel of five neurologists also diagnosed the types of neuropathies according to the laboratory tests. The results by the unsupervised classification algorithm and the neurologists were compared. Results. The neurologists’ diagnoses showed much higher rates of entrapment syndromes (66.1%) and radiculopathies (55.1%) than polyneuropathy (motor/sensory: 33.1%/29.7%). A multivariate analysis showed that the questionnaire was not significantly correlated with the results of quantitative sensory tests (r=0.27) or the neurologists’ diagnoses (r=0.2). All of the patients were classified into four groups by the unsupervised classification algorithm. The classification corresponded to the severity of neuropathy and correlated well with the neurologists’ diagnoses and the scales of neurological examinations. The overall correct rate of classification by the unsupervised classification algorithm was 78.8% (95% confidence interval: 73.1%-88.3%). Conclusion. The results of our unsupervised classification algorithm based on three variables of laboratory tests correlated well with the neurologists’ diagnoses.


2019 ◽  
Author(s):  
Jussi Leinonen ◽  
Alexis Berne

Abstract. The increasing availability of sensors imaging cloud and precipitation particles, like the Multi-Angle Snowflake Camera (MASC), has resulted in datasets comprising millions of images of falling snowflakes. Automated classification is required for effective analysis of such large datasets. While supervised classification methods have been developed for this purpose in the recent years, their ability to generalize is limited by the representativeness of their labeled training datasets, which are affected by the subjective judgment of the expert and require significant manual effort to derive. An alternative is unsupervised classification, which seeks to divide a dataset into distinct classes without expert-provided labels. In this study, we introduce an unsupervised classification scheme based on a generative adversarial network (GAN) that learns to extract the key features from the images. Each image is then associated with a distribution of points in the feature space, and these distributions are used as the basis of K-medoids classification and hierarchical clustering. We find that the classification scheme is able to separate the dataset into distinct classes, each characterized by a particular size, shape and texture of the snowflake image, providing signatures of the microphysical properties of the snowflakes. This finding is supported by a comparison of the results to an existing supervised scheme. Although training the GAN is computationally intensive, the classification process proceeds directly from images to classes with minimal human intervention and therefore can be repeated for other MASC datasets with minor manual effort. As the algorithm is not specific to snowflakes, we also expect this approach to be relevant to other applications.


2013 ◽  
Vol 2013 ◽  
pp. 1-13 ◽  
Author(s):  
Zhi Zhao ◽  
Kefeng Ji ◽  
Xiangwei Xing ◽  
Wenting Chen ◽  
Huanxin Zou

Ship surveillance using space-borne synthetic aperture radar (SAR), taking advantages of high resolution over wide swaths and all-weather working capability, has attracted worldwide attention. Recent activity in this field has concentrated mainly on the study of ship detection, but the classification is largely still open. In this paper, we propose a novel ship classification scheme based on analytic hierarchy process (AHP) in order to achieve better performance. The main idea is to apply AHP on both feature selection and classification decision. On one hand, the AHP based feature selection constructs a selection decision problem based on several feature evaluation measures (e.g., discriminability, stability, and information measure) and provides objective criteria to make comprehensive decisions for their combinations quantitatively. On the other hand, we take the selected feature sets as the input of KNN classifiers and fuse the multiple classification results based on AHP, in which the feature sets’ confidence is taken into account when the AHP based classification decision is made. We analyze the proposed classification scheme and demonstrate its results on a ship dataset that comes from TerraSAR-X SAR images.


Sign in / Sign up

Export Citation Format

Share Document