Using machine learning method to classify polar stratospheric cloud types from Envisat MIPAS observations

<p>Polar stratospheric clouds (PSC) play a key role in polar ozone depletion in the stratosphere. Improved observations and continuous monitoring of PSCs can help to validate and enhance chemistry-climate models that are used to predict the evolution of the polar ozone hole. Here we present the results of our study in which we explored the potential of applying machine learning (ML) methods to classify PSC observations of infrared limb sounders. Two datasets have been considered. The first dataset is a collection of infrared spectra captured in Northern Hemisphere winter 2006/2007 and Southern Hemisphere winter 2009 by the Michelson Interferometer for Passive Atmospheric Sounding (MIPAS) instrument onboard ESA's Envisat satellite. The second dataset is the cloud scenario database (CSDB) of simulated MIPAS spectra. We first performed an initial analysis to assess the basic characteristics of these datasets and to decide which features to extract from them. More than 10,000 Brightness temperature differences (BTDs) features have been generated and fed as input to the ML methods instead of directly using the infrared spectra. Next, we assessed the use of ML methods for the reduction of the dimensionality of this large feature space using principal component analysis (PCA) and kernel principal component analysis (KPCA) as well as the classification with the random forest (RF) and support vector machine (SVM) techniques. All methods were found to be suitable to retrieve information on the composition of PSCs. Of these, RF seems to be the most promising method, being less prone to overfitting and producing results that agree well with established results based on conventional classification methods.</p>

Download Full-text

Exploration of machine learning methods for the classification of infrared limb spectra of polar stratospheric clouds

Atmospheric Measurement Techniques ◽

10.5194/amt-13-3661-2020 ◽

2020 ◽

Vol 13 (7) ◽

pp. 3661-3682

Author(s):

Rocco Sedona ◽

Lars Hoffmann ◽

Reinhold Spang ◽

Gabriele Cavallaro ◽

Sabine Griessbach ◽

...

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Infrared Spectra ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Polar Stratospheric Clouds ◽

Hemisphere Winter ◽

Stratospheric Clouds ◽

Polar Ozone

Abstract. Polar stratospheric clouds (PSCs) play a key role in polar ozone depletion in the stratosphere. Improved observations and continuous monitoring of PSCs can help to validate and improve chemistry–climate models that are used to predict the evolution of the polar ozone hole. In this paper, we explore the potential of applying machine learning (ML) methods to classify PSC observations of infrared limb sounders. Two datasets were considered in this study. The first dataset is a collection of infrared spectra captured in Northern Hemisphere winter 2006/2007 and Southern Hemisphere winter 2009 by the Michelson Interferometer for Passive Atmospheric Sounding (MIPAS) instrument on board the European Space Agency's (ESA) Envisat satellite. The second dataset is the cloud scenario database (CSDB) of simulated MIPAS spectra. We first performed an initial analysis to assess the basic characteristics of the CSDB and to decide which features to extract from it. Here, we focused on an approach using brightness temperature differences (BTDs). From both the measured and the simulated infrared spectra, more than 10 000 BTD features were generated. Next, we assessed the use of ML methods for the reduction of the dimensionality of this large feature space using principal component analysis (PCA) and kernel principal component analysis (KPCA) followed by a classification with the support vector machine (SVM). The random forest (RF) technique, which embeds the feature selection step, has also been used as a classifier. All methods were found to be suitable to retrieve information on the composition of PSCs. Of these, RF seems to be the most promising method, being less prone to overfitting and producing results that agree well with established results based on conventional classification methods.

Download Full-text

Exploration of machine learning methods for the classification of infrared limb spectra of polar stratospheric clouds

10.5194/amt-2019-481 ◽

2020 ◽

Author(s):

Rocco Sedona ◽

Lars Hoffmann ◽

Reinhold Spang ◽

Gabriele Cavallaro ◽

Sabine Griessbach ◽

...

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Infrared Spectra ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Polar Stratospheric Clouds ◽

Hemisphere Winter ◽

Stratospheric Clouds ◽

Polar Ozone

Abstract. Polar stratospheric clouds (PSC) play a key role in polar ozone depletion in the stratosphere. Improved observations and continuous monitoring of PSCs can help to validate and enhance chemistry-climate models that are used to predict the evolution of the polar ozone hole. In this paper, we explore the potential of applying machine learning (ML) methods to classify PSC observations of infrared limb sounders. Two datasets have been considered in this study. The first dataset is a collection of infrared spectra captured in Northern Hemisphere winter 2006/2007 and Southern Hemisphere winter 2009 by the Michelson Interferometer for Passive Atmospheric Sounding (MIPAS) instrument onboard ESA's Envisat satellite. The second dataset is the cloud scenario database (CSDB) of simulated MIPAS spectra. We first performed an initial analysis to assess the basic characteristics of these datasets and to decide which features to extract from them. Here, we focused on an approach using brightness temperature differences (BTDs). From the both, the measured and the simulated infrared spectra, more than 10,000 BTD features have been generated. Next, we assessed the use of ML methods for the reduction of the dimensionality of this large feature space using principal component analysis (PCA) and kernel principal component analysis (KPCA) as well as the classification with the random forest (RF) and support vector machine (SVM) techniques. All methods were found to be suitable to retrieve information on the composition of PSCs. Of these, RF seems to be the most promising method, being less prone to overfitting and producing results that agree well with established results based on conventional classification methods.

Download Full-text

Multi-View Face Detection Based on Kernel Principal Component Analysis and Kernel Support Vector Techniques

International Journal on Soft Computing ◽

10.5121/ijsc.2011.2201 ◽

2011 ◽

Vol 2 (2) ◽

pp. 1-13 ◽

Cited By ~ 5

Author(s):

Muzhir Shaban Al Ani ◽

Alaa Sulaiman Al Waisy

Keyword(s):

Principal Component Analysis ◽

Face Detection ◽

Principal Component ◽

Component Analysis ◽

Kernel Principal Component Analysis ◽

Support Vector

Download Full-text

Spam Detection Approach Based on C-Support Vector Machine and Kernel Principal-Component Analysis

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing ◽

10.1109/iih-msp.2014.64 ◽

2014 ◽

Author(s):

Shu Geng ◽

Liu Lv ◽

Rongjun Liu

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Principal Component ◽

Component Analysis ◽

Kernel Principal Component Analysis ◽

Support Vector ◽

Spam Detection ◽

Detection Approach

Download Full-text

Research on Recognition Method of Driving Fatigue State Based on Sample Entropy and Kernel Principal Component Analysis

Entropy ◽

10.3390/e20090701 ◽

2018 ◽

Vol 20 (9) ◽

pp. 701 ◽

Cited By ~ 6

Author(s):

Beige Ye ◽

Taorong Qiu ◽

Xiaoming Bai ◽

Ping Liu

Keyword(s):

Principal Component Analysis ◽

Recognition Accuracy ◽

Principal Component ◽

Component Analysis ◽

Sample Entropy ◽

Kernel Principal Component Analysis ◽

Support Vector ◽

Recognition Method ◽

State Recognition ◽

Driving Fatigue

In view of the nonlinear characteristics of electroencephalography (EEG) signals collected in the driving fatigue state recognition research and the issue that the recognition accuracy of the driving fatigue state recognition method based on EEG is still unsatisfactory, this paper proposes a driving fatigue recognition method based on sample entropy (SE) and kernel principal component analysis (KPCA), which combines the advantage of the high recognition accuracy of sample entropy and the advantages of KPCA in dimensionality reduction for nonlinear principal components and the strong non-linear processing capability. By using support vector machine (SVM) classifier, the proposed method (called SE_KPCA) is tested on the EEG data, and compared with those based on fuzzy entropy (FE), combination entropy (CE), three kinds of entropies including SE, FE and CE that merged with KPCA. Experiment results show that the method is effective.

Download Full-text

Face Recognition with Kernel Principal Component Analysis and Support Vector Machine

2019 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS) ◽

10.1109/icimcis48181.2019.8985199 ◽

2019 ◽

Author(s):

Dewi Yanti Liliana ◽

I Made Agus Setiawan

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Face Recognition ◽

Principal Component ◽

Component Analysis ◽

Kernel Principal Component Analysis ◽

Support Vector

Download Full-text

Safety assessment in power supply enterprise based on kernel principal component analysis and fast multi-class support vector machine

2009 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2009.5212424 ◽

2009 ◽

Author(s):

Wei Sun ◽

Guo-Zhen Ma

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Power Supply ◽

Safety Assessment ◽

Principal Component ◽

Component Analysis ◽

Kernel Principal Component Analysis ◽

Support Vector ◽

Power Supply Enterprise

Download Full-text

Multivariate Analysis and Machine Learning for Ripeness Classification of Cape Gooseberry Fruits

Processes ◽

10.3390/pr7120928 ◽

2019 ◽

Vol 7 (12) ◽

pp. 928 ◽

Cited By ~ 2

Author(s):

Miguel De-la-Torre ◽

Omar Zatarain ◽

Himer Avila-George ◽

Mirna Muñoz ◽

Jimy Oblitas ◽

...

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Feature Selection ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Color Spaces ◽

Combination Methods ◽

Fruit Samples ◽

Cape Gooseberry

This paper explores five multivariate techniques for information fusion on sorting the visual ripeness of Cape gooseberry fruits (principal component analysis, linear discriminant analysis, independent component analysis, eigenvector centrality feature selection, and multi-cluster feature selection.) These techniques are applied to the concatenated channels corresponding to red, green, and blue (RGB), hue, saturation, value (HSV), and lightness, red/green value, and blue/yellow value (L*a*b) color spaces (9 features in total). Machine learning techniques have been reported for sorting the Cape gooseberry fruits’ ripeness. Classifiers such as neural networks, support vector machines, and nearest neighbors discriminate on fruit samples using different color spaces. Despite the color spaces being equivalent up to a transformation, a few classifiers enable better performances due to differences in the pixel distribution of samples. Experimental results show that selection and combination of color channels allow classifiers to reach similar levels of accuracy; however, combination methods still require higher computational complexity. The highest level of accuracy was obtained using the seven-dimensional principal component analysis feature space.

Download Full-text