A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Identifying viruses from metagenomic data using deep learning

Quantitative Biology ◽

10.1007/s40484-019-0187-4 ◽

2020 ◽

Vol 8 (1) ◽

pp. 64-77 ◽

Cited By ~ 20

Author(s):

Jie Ren ◽

Kai Song ◽

Chao Deng ◽

Nathan A. Ahlgren ◽

Jed A. Fuhrman ◽

...

Keyword(s):

Deep Learning ◽

Metagenomic Data

Download Full-text

A cost-benefit analysis of GPU-based EC2 instances for a deep learning algorithm

10.5753/eradsp.2019.13588 ◽

2019 ◽

Author(s):

Eva Malta ◽

Charles Rodamilans ◽

Sandra Avila ◽

Edson Borin

Keyword(s):

Deep Learning ◽

Virtual Machine ◽

Learning Algorithm ◽

Cost Benefit Analysis ◽

Cost Benefit ◽

Benefit Analysis ◽

Machine Type ◽

Deep Learning Algorithm ◽

Batch Sizes ◽

The Cost

This paper analyzes the cost-benefit of using EC2 instances, specif- ically the p2 and p3 virtual machine types, which have GPU accelerators, to execute a machine learning algorithm. This analysis includes the runtime of a convolutional neural network executions, and it takes into consideration the necessary time to stabilize the accuracy value with different batch sizes. Also, we measure the cost of using each machine type, and we define a relation be- tween this cost and the execution time for each virtual machine. The results show that, although the price per hour of the p3 instance is three times bigger, it is faster and costs almost the same as the p2 instance type to train the deep learning algorithm.

Download Full-text

Positional SHAP for Interpretation of Deep Learning Models Trained from Biological Sequences

10.1101/2021.03.04.433939 ◽

2021 ◽

Author(s):

Quinn Dickinson ◽

Jesse G. Meyer

Keyword(s):

Deep Learning ◽

Rhesus Macaque ◽

Short Term Memory ◽

Peptide Binding ◽

Disease Diagnosis ◽

Biological Sequences ◽

Mhc I ◽

Binding Motifs ◽

Model Interpretation ◽

Biological Phenomena

AbstractMachine learning with artificial neural networks, also known as “deep learning”, accurately predicts biological phenomena such as disease diagnosis and protein structure. Despite the ability of deep learning to make accurate biological predictions, a challenge is model interpretation, which is especially challenging for recurrent neural network architectures due to the sequential input data. Here we train multi-output long short-term memory (LSTM) regression models to predict peptide binding affinity to five rhesus macaque major histocompatibility complex (MHC) I alleles. We adapt SHapely Additive exPlanations (SHAP) to generate positional model interpretations of which amino acids are important for peptide binding. These positional SHAP values reproduced known rhesus macaque MHC class I (Mamu-A1*001) peptide binding motifs and provided insights into inter-positional dependencies of peptide-MHC interactions. Positional SHAP should find widespread utility for interpreting a variety of models trained from biological sequences.

Download Full-text

Toward Deep Learning Approaches for Learning Structure Motifs and Classifying Biological Sequences From RNA A-to-I Editing Events

IEEE Access ◽

10.1109/access.2019.2939281 ◽

2019 ◽

Vol 7 ◽

pp. 127464-127474

Author(s):

Thi Kieu Khanh Ho ◽

Jeonghwan Gwak

Keyword(s):

Deep Learning ◽

Learning Approaches ◽

Biological Sequences

Download Full-text

Faculty Opinions recommendation of DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.732589817.793574758 ◽

2020 ◽

Author(s):

Elhanan Borenstein ◽

Efrat Muller

Keyword(s):

Antibiotic Resistance ◽

Deep Learning ◽

Resistance Genes ◽

Antibiotic Resistance Genes ◽

Metagenomic Data ◽

Learning Approach

Download Full-text

Deep learning models for bacteria taxonomic classification of metagenomic data

BMC Bioinformatics ◽

10.1186/s12859-018-2182-6 ◽

2018 ◽

Vol 19 (S7) ◽

Cited By ~ 29

Author(s):

Antonino Fiannaca ◽

Laura La Paglia ◽

Massimo La Rosa ◽

Giosue’ Lo Bosco ◽

Giovanni Renda ◽

...

Keyword(s):

Deep Learning ◽

Taxonomic Classification ◽

Metagenomic Data ◽

Learning Models

Download Full-text

DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data

Microbiome ◽

10.1186/s40168-018-0401-z ◽

2018 ◽

Vol 6 (1) ◽

Cited By ~ 115

Author(s):

Gustavo Arango-Argoty ◽

Emily Garner ◽

Amy Pruden ◽

Lenwood S. Heath ◽

Peter Vikesland ◽

...

Keyword(s):

Antibiotic Resistance ◽

Deep Learning ◽

Resistance Genes ◽

Antibiotic Resistance Genes ◽

Metagenomic Data ◽

Learning Approach

Download Full-text

Analyzing Large Microbiome Datasets Using Machine Learning and Big Data

BioMedInformatics ◽

10.3390/biomedinformatics1030010 ◽

2021 ◽

Vol 1 (3) ◽

pp. 138-165

Author(s):

Thomas Krause ◽

Jyotsna Talreja Wassan ◽

Paul Mc Kevitt ◽

Haiying Wang ◽

Huiru Zheng ◽

...

Keyword(s):

Machine Learning ◽

Big Data ◽

Deep Learning ◽

Machine Learning Algorithms ◽

Metagenomic Data ◽

Data Sets ◽

Raw Data ◽

Public And Private ◽

Rumen Microbiome

Metagenomics promises to provide new valuable insights into the role of microbiomes in eukaryotic hosts such as humans. Due to the decreasing costs for sequencing, public and private repositories for human metagenomic datasets are growing fast. Metagenomic datasets can contain terabytes of raw data, which is a challenge for data processing but also an opportunity for advanced machine learning methods like deep learning that require large datasets. However, in contrast to classical machine learning algorithms, the use of deep learning in metagenomics is still an exception. Regardless of the algorithms used, they are usually not applied to raw data but require several preprocessing steps. Performing this preprocessing and the actual analysis in an automated, reproducible, and scalable way is another challenge. This and other challenges can be addressed by adjusting known big data methods and architectures to the needs of microbiome analysis and DNA sequence processing. A conceptual architecture for the use of machine learning and big data on metagenomic data sets was recently presented and initially validated to analyze the rumen microbiome. The same architecture can be used for clinical purposes as is discussed in this paper.

Download Full-text

AMBIENT: Accelerated Convolutional Neural Network Architecture Search for Regulatory Genomics

10.1101/2021.02.25.432960 ◽

2021 ◽

Author(s):

Zijun Zhang ◽

Evan M. Cofer ◽

Olga G. Troyanskaya

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Network ◽

Network Architecture ◽

Environmental Issue ◽

Biological Sequences ◽

Neural Network Architecture ◽

Computing Power ◽

Neural Architecture

Convolutional neural networks (CNN) have become a standard approach for modeling genomic sequences. CNNs can be effectively built by Neural Architecture Search (NAS) by trading computing power for accurate neural architectures. Yet, the consumption of immense computing power is a major practical, financial, and environmental issue for deep learning. Here, we present a novel NAS framework, AMBIENT, that generates highly accurate CNN architectures for biological sequences of diverse functions, while substantially reducing the computing cost of conventional NAS.

Download Full-text

A Study on the Stimulating Strategy of Deep Learning in the Cultivation of Innovative Thinking of Computer Professionals

Advances in Education ◽

10.12677/ae.2017.76062 ◽

2017 ◽

Vol 07 (06) ◽

pp. 390-396

Author(s):

光伟徐

Keyword(s):

Deep Learning ◽

Computer Professionals ◽

Innovative Thinking

Download Full-text