Accurate classification of protein subcellular localization from high throughput microscopy images using deep learning

Mapping Intimacies ◽

10.1101/050757 ◽

2016 ◽

Cited By ~ 4

Author(s):

Tanel Pärnamaa ◽

Leopold Parts

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

High Throughput ◽

Single Cells ◽

Cellular Compartment ◽

Cell Localization ◽

Image Characteristics ◽

Basic Image ◽

Training Examples

High throughput microscopy of many single cells generates high-dimensional data that are far from straightforward to analyze. One important problem is automatically detecting the cellular compartment where a fluorescently tagged protein resides, a task relatively simple for an experienced human, but difficult to automate on a computer. Here, we train an 11-layer neural network on data from mapping thousands of yeast proteins, achieving per cell localization classification accuracy of 91%, and per protein accuracy of 99% on held out images. We confirm that low-level network features correspond to basic image characteristics, while deeper layers separate localization classes. Using this network as a feature calculator, we train standard classifiers that assign proteins to previously unseen compartments after observing only a small number of training examples. Our results are the most accurate subcellular localization classifications to date, and demonstrate the usefulness of deep learning for high throughput microscopy.

Download Full-text

Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning

G3 Genes|Genome|Genetics ◽

10.1534/g3.116.033654 ◽

2017 ◽

Vol 7 (5) ◽

pp. 1385-1392 ◽

Cited By ~ 58

Author(s):

Tanel Pärnamaa ◽

Leopold Parts

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

High Throughput ◽

Protein Subcellular Localization ◽

Microscopy Images

Download Full-text

Deep Learning-Based Classification of Protein Subcellular Localization from Immunohistochemistry Images

2017 4th IAPR Asian Conference on Pattern Recognition (ACPR) ◽

10.1109/acpr.2017.125 ◽

2017 ◽

Author(s):

Jin-Xian Hu ◽

Ying-Ying Xu ◽

Yang-Yang ◽

Hong-Bin Shen

Keyword(s):

Deep Learning ◽

Subcellular Localization ◽

Protein Subcellular Localization

Download Full-text

High-throughput precision measurement of subcellular localization in single cells

Cytometry Part A ◽

10.1002/cyto.a.23054 ◽

2017 ◽

Vol 91 (2) ◽

pp. 180-189 ◽

Cited By ~ 7

Author(s):

Tyler J. Burns ◽

Andreas P. Frei ◽

Pier F. Gherardini ◽

Felice A. Bava ◽

Jake E. Batchelder ◽

...

Keyword(s):

Subcellular Localization ◽

High Throughput ◽

Precision Measurement ◽

Single Cells

Download Full-text

Deep learning enables high-throughput early detection and classification of bacterial colonies using time-lapse coherent imaging (Conference Presentation)

Optics and Biophotonics in Low-Resource Settings VI ◽

10.1117/12.2547399 ◽

2020 ◽

Author(s):

Hongda Wang ◽

Hatice C. Koydemir ◽

Yunzhe Qiu ◽

Bijie Bai ◽

Yibo Zhang ◽

...

Keyword(s):

Deep Learning ◽

Early Detection ◽

High Throughput ◽

Time Lapse ◽

Coherent Imaging

Download Full-text

Image-based taxonomic classification of bulk biodiversity samples using deep learning and domain adaptation

10.1101/2021.12.22.473797 ◽

2021 ◽

Author(s):

Tomochika Fujisawa ◽

Victor Noguerales ◽

Emmanouil Meramveliotakis ◽

Anna Papadopoulou ◽

Alfried P Vogler

Keyword(s):

Deep Learning ◽

High Throughput ◽

Domain Adaptation ◽

Network Models ◽

Neural Network Models ◽

Data Set ◽

Model Training ◽

Trained Neural Network ◽

Domain Transfer

Complex bulk samples of invertebrates from biodiversity surveys present a great challenge for taxonomic identification, especially if obtained from unexplored ecosystems. High-throughput imaging combined with machine learning for rapid classification could overcome this bottleneck. Developing such procedures requires that taxonomic labels from an existing source data set are used for model training and prediction of an unknown target sample. Yet the feasibility of transfer learning for the classification of unknown samples remains to be tested. Here, we assess the efficiency of deep learning and domain transfer algorithms for family-level classification of below-ground bulk samples of Coleoptera from understudied forests of Cyprus. We trained neural network models with images from local surveys versus global databases of above-ground samples from tropical forests and evaluated how prediction accuracy was affected by: (a) the quality and resolution of images, (b) the size and complexity of the training set and (c) the transferability of identifications across very disparate source-target pairs that do not share any species or genera. Within-dataset classification accuracy reached 98% and depended on the number and quality of training images and on dataset complexity. The accuracy of between-datasets predictions was reduced to a maximum of 82% and depended greatly on the standardisation of the imaging procedure. When the source and target images were of similar quality and resolution, albeit from different faunas, the reduction of accuracy was minimal. Application of algorithms for domain adaptation significantly improved the prediction performance of models trained by non-standardised, low-quality images. Our findings demonstrate that existing databases can be used to train models and successfully classify images from unexplored biota, when the imaging conditions and classification algorithms are carefully considered. Also, our results provide guidelines for data acquisition and algorithmic development for high-throughput image-based biodiversity surveys.

Download Full-text

HD Spot: Interpretable Deep Learning Classification of Single Cell Transcript Data

10.1101/822759 ◽

2019 ◽

Cited By ~ 1

Author(s):

Eric Prince ◽

Todd C. Hankinson

Keyword(s):

Deep Learning ◽

Single Cell ◽

High Throughput ◽

Ground Truth ◽

Sequencing Technologies ◽

Bioinformatic Tool ◽

Complex Relationships ◽

Insight Into ◽

Generation Sequencing

ABSTRACTHigh throughput data is commonplace in biomedical research as seen with technologies such as single-cell RNA sequencing (scRNA-seq) and other Next Generation Sequencing technologies. As these techniques continue to be increasingly utilized it is critical to have analysis tools that can identify meaningful complex relationships between variables (i.e., in the case of scRNA-seq: genes) in a way such that human bias is absent. Moreover, it is equally paramount that both linear and non-linear (i.e., one-to-many) variable relationships be considered when contrasting datasets. HD Spot is a deep learning-based framework that generates an optimal interpretable classifier a given high-throughput dataset using a simple genetic algorithm as well as an autoencoder to classifier transfer learning approach. Using four unique publicly available scRNA-seq datasets with published ground truth, we demonstrate the robustness of HD Spot and the ability to identify ontologically accurate gene lists for a given data subset. HD Spot serves as a bioinformatic tool to allow novice and advanced analysts to gain complex insight into their respective datasets enabling novel hypotheses development.

Download Full-text

Convolutional neural networks for high throughput screening of catalyst layer inks for polymer electrolyte fuel cells

RSC Advances ◽

10.1039/d1ra05324h ◽

2021 ◽

Vol 11 (51) ◽

pp. 32126-32134

Author(s):

Mohammad J. Eslamibidgoli ◽

Fabian P. Tipp ◽

Jenia Jitsev ◽

Jasna Jankovic ◽

Michael H. Eikerling ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Fuel Cells ◽

Polymer Electrolyte ◽

Convolutional Neural Networks ◽

High Throughput ◽

High Throughput Screening ◽

Catalyst Layer ◽

Polymer Electrolyte Fuel Cells

Deep learning enables the robust and accurate classification of the TEM images of catalyst layer inks for the polymer electrolyte fuel cells.

Download Full-text

A DEEP LEARNING FRAMEWORK FOR CLASSIFICATION OF SEVERITY IN CHRONIC OBSTRUCTIVE PULMONARY DISEASE (COPD)

10.26226/morressier.5ade45fed462b8029238e7b4 ◽

2018 ◽

Author(s):

Roger Tam

Keyword(s):

Chronic Obstructive Pulmonary Disease ◽

Deep Learning ◽

Pulmonary Disease ◽

Chronic Obstructive ◽

Obstructive Pulmonary Disease ◽

Learning Framework

Download Full-text

Deep learning for cell-specific high-throughput quantification of oligodendrocyte ensheathment

10.26226/morressier.5b719e475aff74008ae4cd05 ◽

2018 ◽

Author(s):

Jack Antel

Keyword(s):

Deep Learning ◽

High Throughput

Download Full-text

Convolutional Neural Network of Atomic Surface Structures to Predict Binding Energies for High-Throughput Screening of Catalysts

10.26434/chemrxiv.8150666.v1 ◽

2019 ◽

Author(s):

Seoin Back ◽

Junwoong Yoon ◽

Nianhan Tian ◽

Wen Zhong ◽

Kevin Tran ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

High Throughput ◽

High Throughput Screening ◽

Binding Energies ◽

Surface Structures ◽

Voronoi Polyhedra ◽

Atomic Surface

We present an application of deep-learning convolutional neural network of atomic surface structures using atomic and Voronoi polyhedra-based neighbor information to predict adsorbate binding energies for the application in catalysis.

Download Full-text