Multiple CNN Variants and Ensemble Learning for Sunspot Group Classification by Magnetic Type

2021 ◽  
Vol 257 (2) ◽  
pp. 38
Author(s):  
Rongxin Tang ◽  
Xunwen Zeng ◽  
Zhou Chen ◽  
Wenti Liao ◽  
Jingsong Wang ◽  
...  

Abstract A solar active region is a source of disturbance for the Sun–terrestrial space environment and usually causes extreme space weather, such as geomagnetic storms. The main indicator of an active region is sunspots. Certain types of sunspots are related to extreme space weather caused by eruptive events such as coronal mass ejections or solar flares. Thus, the automatic classification of sunspot groups is helpful to predict solar activity quickly and accurately. This paper completed the automatic classification of a sunspot group data set based on the Mount Wilson classification scheme, which contains continuum and magnetogram images provided by the Solar Dynamics Observatory’s Helioseismic and Magnetic Imager SHARP data from 2010 May 1 to 2017 December 12. After applying some data preprocessing steps such as image cropping and data standardization, the features of magnetic type in the data are more obvious, and the amount of data is increased. The processed data are spliced into two frames of single-channel data for the neural network to perform 3D convolution operations. This paper constructs a variety of convolutional neural networks with different structures and numbers of layers, selects 10 models as representatives, and chooses XGBoost, which is commonly used in ensemble-learning algorithms, to fuse the results of independent classification models. We found that XGBoost is an effective way to fuse models, which is proved by the relatively balanced high scores in the three magnetic types. The accuracy of the ensemble model is above 92%. The F1 scores of the magnetic types of Alpha, Beta, and Beta-x reached 0.95, 0.91, and 0.82 respectively.

1981 ◽  
Vol 46 (2) ◽  
pp. 381-396 ◽  
Author(s):  
Robert A. Benfer ◽  
Alice N. Benfer

The application of extremely complex multivariate models of classification to subjective inspectional methods of categorization is analyzed in detail, with the widely used Texas system of dart point typology as a case study. The history of the development of the Texas dart point typological system is sketched. An attempt by Gunn and Prewitt (1975) to objectify the classificatory system by multivariate methods is criticized. The techniques applied were too idiosyncratic to the particular data set used to be of predictive value. Discriminant function and multivariate classification analysis are discussed in detail, emphasizing simple geometrical examples by which the major principles may be grasped. Suggestions for improvement are offered for those who wish to follow Gunn and Prewitt in constructing automatic classification schemes.


2020 ◽  
Vol 48 (4) ◽  
pp. 2316-2327
Author(s):  
Caner KOC ◽  
Dilara GERDAN ◽  
Maksut B. EMİNOĞLU ◽  
Uğur YEGÜL ◽  
Bulent KOC ◽  
...  

Classification of hazelnuts is one of the values adding processes that increase the marketability and profitability of its production. While traditional classification methods are used commonly, machine learning and deep learning can be implemented to enhance the hazelnut classification processes. This paper presents the results of a comparative study of machine learning frameworks to classify hazelnut (Corylus avellana L.) cultivars (‘Sivri’, ‘Kara’, ‘Tombul’) using DL4J and ensemble learning algorithms. For each cultivar, 50 samples were used for evaluations. Maximum length, width, compression strength, and weight of hazelnuts were measured using a caliper and a force transducer. Gradient boosting machine (Boosting), random forest (Bagging), and DL4J feedforward (Deep Learning) algorithms were applied in traditional machine learning algorithms. The data set was partitioned into a 10-fold-cross validation method. The classifier performance criteria of accuracy (%), error percentage (%), F-Measure, Cohen’s Kappa, recall, precision, true positive (TP), false positive (FP), true negative (TN), false negative (FN) values are provided in the results section. The results showed classification accuracies of 94% for Gradient Boosting, 100% for Random Forest, and 94% for DL4J Feedforward algorithms.


2000 ◽  
Vol 179 ◽  
pp. 119-125
Author(s):  
Robert F. Howard

AbstractThe tilt angles of sunspot groups are defined, using the Mount Wilson data set. It is shown that groups with tilt angles greater than or less than the average value (≈5 deg) show different latitude dependences. This effect is also seen in synoptic magnetic field data defining plages. The fraction of the total sunspot group area that is found in the leading spots is discussed as a parameter that can be useful in studying the dynamics of sunspot groups. This parameter is larger for low tilt angles, and small for extreme tilt angles in either direction. The daily variations of sunspot group tilt angles are discussed. The result that sunspot tilt angles tend to rotate toward the average value is reviewed. It is suggested that at some depth, perhaps 50 Mm, there is a flow relative to the surface that results from a rotation rate faster than the surface rate by about 60 m/sec and a meridional drift that is slower than the surface rate by about 5 m/sec. This results in a slanted relative flow at that depth that is in the direction of the average tilt angle and may be responsible for the tendency for sunspot groups (and plages) to rotate their magnetic axes in the direction of the average tilt angle.


2021 ◽  
Vol 45 (4) ◽  
pp. 233-238
Author(s):  
Lazar Kats ◽  
Marilena Vered ◽  
Johnny Kharouba ◽  
Sigalit Blumer

Objective: To apply the technique of transfer deep learning on a small data set for automatic classification of X-ray modalities in dentistry. Study design: For solving the problem of classification, the convolution neural networks based on VGG16, NASNetLarge and Xception architectures were used, which received pre-training on ImageNet subset. In this research, we used an in-house dataset created within the School of Dental Medicine, Tel Aviv University. The training dataset contained anonymized 496 digital Panoramic and Cephalometric X-ray images for orthodontic examinations from CS 8100 Digital Panoramic System (Carestream Dental LLC, Atlanta, USA). The models were trained using NVIDIA GeForce GTX 1080 Ti GPU. The study was approved by the ethical committee of Tel Aviv University. Results: The test dataset contained 124 X-ray images from 2 different devices: CS 8100 Digital Panoramic System and Planmeca ProMax 2D (Planmeca, Helsinki, Finland). X-ray images in the test database were not pre-processed. The accuracy of all neural network architectures was 100%. Following a result of almost absolute accuracy, the other statistical metrics were not relevant. Conclusions: In this study, good results have been obtained for the automatic classification of different modalities of X-ray images used in dentistry. The most promising direction for the development of this kind of application is the transfer deep learning. Further studies on automatic classification of modalities, as well as sub-modalities, can maximally reduce occasional difficulties arising in this field in the daily practice of the dentist and, eventually, improve the quality of diagnosis and treatment.


2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Pei Pan ◽  
Yijin Chen

Abstract Public messages on the Internet political inquiry platform rely on manual classification, which has the problems of heavy workload, low efficiency, and high error rate. A Bi-directional long short-term memory (Bi-LSTM) network model based on attention mechanism was proposed in this paper to realize the automatic classification of public messages. Considering the network political inquiry data set provided by the BdRace platform as samples, the Bi-LSTM algorithm is used to strengthen the correlation between the messages before and after the training process, and the semantic attention to important text features is strengthened in combination with the characteristics of attention mechanism. Feature weights are integrated through the full connection layer to carry out classification calculations. The experimental results show that the F1 value of the message classification model proposed here reaches 0.886 and 0.862, respectively, in the data set of long text and short text. Compared with three algorithms of long short-term memory (LSTM), logistic regression, and naive Bayesian, the Bi-LSTM model can achieve better results in the automatic classification of public message subjects.


2015 ◽  
Vol 33 (1) ◽  
pp. 119
Author(s):  
Alexandre Cruz Sanchetta ◽  
Emilson Pereira Leite ◽  
Bruno César Zanardo Honório ◽  
Alexandre Campane Vidal

ABSTRACT. The problem of automatic classification of facies was addressed using the Fast Independent Component Analysis (FastICA) of a data set of geophysical well logs of the Namorado Field, Campos Basin, Brazil, followed by a k-nearest neighbor (k-NN) classification. The goal of an automatic classification of facies is to produce spatial models of facies that assist the geological characterization of petroleum reservoirs. The FastICA technique provides a new data set that has the most stable and less Gaussian distribution possible. The k-NN classifies this new data set according to its characteristics. The previous application of FastICA improves the accuracy of the k-NN automatic classification and it also provides better results in comparison with the automatic classification by means of the Principal Component Analysis (PCA).Keywords: automatic classification, geophysical well logs, Independent Component Analysis.RESUMO. O problema da classificação automática de fácies foi abordado através da Análise de Componentes Independentes Rápida (FastICA – Fast Independent Component Analysis ) de um conjunto de dados de perfis geofísicos de poços do Campo de Namorado, Bacia de Campos, seguida de classificação por k vizinhos mais próximos (k-NN – k-nearest neighbor ). A classificação automática de fácies é utilizada para gerar modelos de distribuição espacial de fácies que auxiliam a caracterização geológica dos reservatórios de petróleo. A técnica FastICA encontra um novo conjunto de dados com distribuição mais estável e menos gaussiana possível e o k-NN classifica esse novo conjunto de acordo com suas características. A aplicação prévia da FastICA melhora a porcentagem de acerto da classificação automática pelo k-NN, fornecendo melhores resultados quando comparada com a classificação automática por Análise de Componentes Principais (PCA – Principal Component Analysis ).Palavras-chave: classificação automática, perfis geofísicos de poços, Análise de Componentes Independentes.


Author(s):  
Paul DeCosta ◽  
Kyugon Cho ◽  
Stephen Shemlon ◽  
Heesung Jun ◽  
Stanley M. Dunn

Introduction: The analysis and interpretation of electron micrographs of cells and tissues, often requires the accurate extraction of structural networks, which either provide immediate 2D or 3D information, or from which the desired information can be inferred. The images of these structures contain lines and/or curves whose orientation, lengths, and intersections characterize the overall network.Some examples exist of studies that have been done in the analysis of networks of natural structures. In, Sebok and Roemer determine the complexity of nerve structures in an EM formed slide. Here the number of nodes that exist in the image describes how dense nerve fibers are in a particular region of the skin. Hildith proposes a network structural analysis algorithm for the automatic classification of chromosome spreads (type, relative size and orientation).


Sign in / Sign up

Export Citation Format

Share Document