Deep learning based on stacked sparse autoencoder applied to viral genome classification of SARS-CoV-2 virus

Mapping Intimacies ◽

10.1101/2021.10.14.464414 ◽

2021 ◽

Author(s):

Gracielly G. F. Coutinho ◽

Gabriel B. M. Câmara ◽

Raquel de M. Barbosa ◽

Marcelo A. C. Fernandes

Keyword(s):

Deep Learning ◽

Viral Genome ◽

Genomic Sequence ◽

Confusion Matrix ◽

Taxonomic Classification ◽

Classification Problems ◽

Virus Identification ◽

Sparse Autoencoder ◽

Stacked Sparse Autoencoder

Since December 2019, the world has been intensely affected by the COVID-19 pandemic, caused by the SARS-CoV-2 virus, first identified in Wuhan, China. In the case of a novel virus identification, the early elucidation of taxonomic classification and origin of the virus genomic sequence is essential for strategic planning, containment, and treatments. Deep learning techniques have been successfully used in many viral classification problems associated with viral infections diagnosis, metagenomics, phylogenetic, and analysis. This work proposes to generate an efficient viral genome classifier for the SARS-CoV-2 virus using the deep neural network (DNN) based on the stacked sparse autoencoder (SSAE) technique. We performed four different experiments to provide different levels of taxonomic classification of the SARS-CoV-2 virus. The confusion matrix presented the validation and test sets and the ROC curve for the validation set. In all experiments, the SSAE technique provided great performance results. In this work, we explored the utilization of image representations of the complete genome sequences as the SSAE input to provide a viral classification of the SARS-CoV-2. For that, a dataset based on k-mers image representation, with k=6, was applied. The results indicated the applicability of using this deep learning technique in genome classification problems.

Download Full-text

Classification of Epileptic EEG Signals with Stacked Sparse Autoencoder Based on Deep Learning

Intelligent Computing Methodologies - Lecture Notes in Computer Science ◽

10.1007/978-3-319-42297-8_74 ◽

2016 ◽

pp. 802-810 ◽

Cited By ~ 16

Author(s):

Qin Lin ◽

Shu-qun Ye ◽

Xiu-mei Huang ◽

Si-you Li ◽

Mei-zhen Zhang ◽

...

Keyword(s):

Deep Learning ◽

Eeg Signals ◽

Sparse Autoencoder ◽

Stacked Sparse Autoencoder

Download Full-text

Taxonomic classification of metagenomic sequences from Relative Abundance Index profiles using deep learning

Biomedical Signal Processing and Control ◽

10.1016/j.bspc.2021.102539 ◽

2021 ◽

Vol 67 ◽

pp. 102539

Author(s):

Meryem Altın Karagöz ◽

O. Ufuk Nalbantoglu

Keyword(s):

Deep Learning ◽

Relative Abundance ◽

Taxonomic Classification ◽

Abundance Index

Download Full-text

Spectral-spatial classification of hyperspectral images based on joint bilateral filter and stacked sparse autoencoder

2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS) ◽

10.1109/eiis.2017.8298563 ◽

2017 ◽

Author(s):

Chunhui Zhao ◽

Xiaoqing Wan ◽

Yiming Yan

Keyword(s):

Bilateral Filter ◽

Hyperspectral Images ◽

Spatial Classification ◽

Sparse Autoencoder ◽

Stacked Sparse Autoencoder ◽

Joint Bilateral Filter

Download Full-text

Spectral–spatial classification of hyperspectral images using trilateral filter and stacked sparse autoencoder

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.11.016033 ◽

2017 ◽

Vol 11 (1) ◽

pp. 016033 ◽

Cited By ~ 4

Author(s):

Chunhui Zhao ◽

Xiaoqing Wan ◽

Genping Zhao ◽

Yiming Yan

Keyword(s):

Hyperspectral Images ◽

Spatial Classification ◽

Trilateral Filter ◽

Sparse Autoencoder ◽

Stacked Sparse Autoencoder

Download Full-text

Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy

Sensors ◽

10.3390/s21196661 ◽

2021 ◽

Vol 21 (19) ◽

pp. 6661

Author(s):

Lars Schmarje ◽

Johannes Brünger ◽

Monty Santarossa ◽

Simon-Martin Schröder ◽

Rainer Kiko ◽

...

Keyword(s):

Deep Learning ◽

Real World ◽

Supervised Classification ◽

Limited Information ◽

Classification Problems ◽

Previous State ◽

Supervised Methods ◽

Real World Datasets ◽

Fuzzy Labels

Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes. For underwater classification, and uncurated real-world datasets in general, clean class boundaries can often not be given due to a limited information content in the images and transitional stages of the depicted objects. This leads to different experts having different opinions and thus producing fuzzy labels which could also be considered ambiguous or divergent. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. It is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show the benefit of overclustering for fuzzy labels. We show that our framework is superior to previous state-of-the-art semi-supervised methods when applied to real-world plankton data with fuzzy labels. Moreover, we acquire 5 to 10% more consistent predictions of substructures.

Download Full-text

Deep learning models for classification of gases detected by sensor arrays of artificial nose

10.5753/eniac.2019.9339 ◽

2019 ◽

Author(s):

Ismael Araujo ◽

Juan Gamboa ◽

Adenilton Silva

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sensor Arrays ◽

Machine Learning Algorithms ◽

Human Beings ◽

Learning Models ◽

Classification Problems ◽

Artificial Nose ◽

Learning Techniques

To recognize patterns that are usually imperceptible by human beings has been one of the main advantages of using machine learning algorithms The use of Deep Learning techniques has been promising to the classification problems, especially the ones related to image classification. The classification of gases detected by an artificial nose is one other area where Deep Learning techniques can be used to seek classification improvements. Succeeding in a classification task can result in many advantages to quality control, as well as to preventing accidents. In this work, it is presented some Deep Learning models specifically created to the task of gas classification.

Download Full-text

Deep Learning Based Stacked Sparse Autoencoder for PAPR Reduction in OFDM Systems

Intelligent Automation & Soft Computing ◽

10.32604/iasc.2022.019473 ◽

2022 ◽

Vol 31 (1) ◽

pp. 311-324

Author(s):

A. Jayamathi ◽

T. Jayasankar

Keyword(s):

Deep Learning ◽

Papr Reduction ◽

Ofdm Systems ◽

Sparse Autoencoder ◽

Stacked Sparse Autoencoder

Download Full-text

Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study

10.1101/2020.02.03.932350 ◽

2020 ◽

Cited By ~ 10

Author(s):

Gurjit S. Randhawa ◽

Maximillian P.M. Soltysiak ◽

Hadi El Roz ◽

Camila P.E. de Souza ◽

Kathleen A. Hill ◽

...

Keyword(s):

Machine Learning ◽

Death Rate ◽

Genomic Sequence ◽

Sequence Data ◽

Rank Correlation ◽

Taxonomic Classification ◽

Supervised Machine Learning ◽

Biological Knowledge ◽

Alignment Free

AbstractAs of February 20, 2020, the 2019 novel coronavirus (renamed to COVID-19) spread to 30 countries with 2130 deaths and more than 75500 confirmed cases. COVID-19 is being compared to the infamous SARS coronavirus, which resulted, between November 2002 and July 2003, in 8098 confirmed cases worldwide with a 9.6% death rate and 774 deaths. Though COVID-19 has a death rate of 2.8% as of 20 February, the 75752 confirmed cases in a few weeks (December 8, 2019 to February 20, 2020) are alarming, with cases likely being under-reported given the comparatively longer incubation period. Such outbreaks demand elucidation of taxonomic classification and origin of the virus genomic sequence, for strategic planning, containment, and treatment. This paper identifies an intrinsic COVID-19 genomic signature and uses it together with a machine learning-based alignment-free approach for an ultra-fast, scalable, and highly accurate classification of whole COVID-19 genomes. The proposed method combines supervised machine learning with digital signal processing for genome analyses, augmented by a decision tree approach to the machine learning component, and a Spearman’s rank correlation coefficient analysis for result validation. These tools are used to analyze a large dataset of over 5000 unique viral genomic sequences, totalling 61.8 million bp. Our results support a hypothesis of a bat origin and classify COVID-19 as Sarbecovirus, within Betacoronavirus. Our method achieves high levels of classification accuracy and discovers the most relevant relationships among over 5,000 viral genomes within a few minutes, ab initio, using raw DNA sequence data alone, and without any specialized biological knowledge, training, gene or genome annotations. This suggests that, for novel viral and pathogen genome sequences, this alignment-free whole-genome machine-learning approach can provide a reliable real-time option for taxonomic classification.

Download Full-text

Rolling Bearing Fault Diagnosis Based on STFT-Deep Learning and Sound Signals

Shock and Vibration ◽

10.1155/2016/6127479 ◽

2016 ◽

Vol 2016 ◽

pp. 1-12 ◽

Cited By ~ 48

Author(s):

Hongmei Liu ◽

Lianfeng Li ◽

Jian Ma

Keyword(s):

Fourier Transform ◽

Deep Learning ◽

Fault Diagnosis ◽

Rolling Bearing ◽

Short Time Fourier Transform ◽

Bearing Fault Diagnosis ◽

Fault Features ◽

Sparse Autoencoder ◽

Stacked Sparse Autoencoder ◽

Short Time

The main challenge of fault diagnosis lies in finding good fault features. A deep learning network has the ability to automatically learn good characteristics from input data in an unsupervised fashion, and its unique layer-wise pretraining and fine-tuning using the backpropagation strategy can solve the difficulties of training deep multilayer networks. Stacked sparse autoencoders or other deep architectures have shown excellent performance in speech recognition, face recognition, text classification, image recognition, and other application domains. Thus far, however, there have been very few research studies on deep learning in fault diagnosis. In this paper, a new rolling bearing fault diagnosis method that is based on short-time Fourier transform and stacked sparse autoencoder is first proposed; this method analyzes sound signals. After spectrograms are obtained by short-time Fourier transform, stacked sparse autoencoder is employed to automatically extract the fault features, and softmax regression is adopted as the method for classifying the fault modes. The proposed method, when applied to sound signals that are obtained from a rolling bearing test rig, is compared with empirical mode decomposition, Teager energy operator, and stacked sparse autoencoder when using vibration signals to verify the performance and effectiveness of the proposed method.

Download Full-text

Deep learning models for bacteria taxonomic classification of metagenomic data

BMC Bioinformatics ◽

10.1186/s12859-018-2182-6 ◽

2018 ◽

Vol 19 (S7) ◽

Cited By ~ 29

Author(s):

Antonino Fiannaca ◽

Laura La Paglia ◽

Massimo La Rosa ◽

Giosue’ Lo Bosco ◽

Giovanni Renda ◽

...

Keyword(s):

Deep Learning ◽

Taxonomic Classification ◽

Metagenomic Data ◽

Learning Models

Download Full-text