Ontology-guided segmentation and object identification for developmental mouse lung immunofluorescent images

AbstractBackgroundImmunofluorescent confocal microscopy uses labeled antibodies as probes against specific macromolecules to discriminate between multiple cell types. For images of the developmental mouse lung, these cells are themselves organized into densely packed higher-level anatomical structures. These types of images can be challenging to segment automatically for several reasons, including the relevance of biomedical context, dependence on the specific set of probes used, prohibitive cost of generating labeled training data, as well as the complexity and dense packing of anatomical structures in the image. The use of an application ontology surmounts these challenges by combining image data with its metadata to provide a meaningful biological context, and hence constraining and simplifying the process of segmentation and object identification.ResultsWe propose an innovative approach for the automated analysis of complex and densely packed anatomical structures from immunofluorescent images that utilizes an application ontology to provide a simplified context for image segmentation and object identification. We describe how the logical organization of biological facts in the form of an ontology can provide useful constraints that enhance automatic processing of complex images. We demonstrate the results of ontology-guided segmentation and object identification in mouse developmental lung images from the Bioinformatics REsource ATlas for the Healthy lung (BREATH) database of the Molecular Atlas of Lung Development (LungMAP1) program.ConclusionThe microscopy analysis pipeline library (micap) is available at https://github.com/duke-lungmap-team/microscopy-analysis-pipeline. Code to reproduce our analysis of LungMAP images is also available at https://github.com/duke-lungmap-team/lungmap-pipeline. Finally, the application ontology is available at https://github.com/duke-lungmap-team/lung_ontology and includes example SPARQL queries.ContactAnna Maria Masci email: [email protected]

Download Full-text

Ontology-guided segmentation and object identification for developmental mouse lung immunofluorescent images

BMC Bioinformatics ◽

10.1186/s12859-021-04008-8 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Anna Maria Masci ◽

◽

Scott White ◽

Ben Neely ◽

Maryanne Ardini-Polaske ◽

...

Keyword(s):

Image Segmentation ◽

Contextual Information ◽

Image Data ◽

Cell Types ◽

Object Identification ◽

Training Data ◽

Mouse Lung ◽

Anatomical Structures ◽

Application Ontology ◽

Prohibitive Cost

Abstract Background Immunofluorescent confocal microscopy uses labeled antibodies as probes against specific macromolecules to discriminate between multiple cell types. For images of the developmental mouse lung, these cells are themselves organized into densely packed higher-level anatomical structures. These types of images can be challenging to segment automatically for several reasons, including the relevance of biomedical context, dependence on the specific set of probes used, prohibitive cost of generating labeled training data, as well as the complexity and dense packing of anatomical structures in the image. The use of an application ontology helps surmount these challenges by combining image data with its metadata to provide a meaningful biological context, modeled after how a human expert would make use of contextual information to identify histological structures, that constrains and simplifies the process of segmentation and object identification. Results We propose an innovative approach for the semi-supervised analysis of complex and densely packed anatomical structures from immunofluorescent images that utilizes an application ontology to provide a simplified context for image segmentation and object identification. We describe how the logical organization of biological facts in the form of an ontology can provide useful constraints that facilitate automatic processing of complex images. We demonstrate the results of ontology-guided segmentation and object identification in mouse developmental lung images from the Bioinformatics REsource ATlas for the Healthy lung database of the Molecular Atlas of Lung Development (LungMAP1) program Conclusion We describe a novel ontology-guided approach to segmentation and classification of complex immunofluorescence images of the developing mouse lung. The ontology is used to automatically generate constraints for each image based on its biomedical context, which facilitates image segmentation and classification.

Download Full-text

Automated analysis of 3D-echocardiography using spatially registered patient-specific CMR meshes

European Heart Journal - Cardiovascular Imaging ◽

10.1093/ehjci/jeaa356.425 ◽

2021 ◽

Vol 22 (Supplement_1) ◽

Author(s):

D Zhao ◽

E Ferdian ◽

GD Maso Talou ◽

GM Quill ◽

K Gilbert ◽

...

Keyword(s):

New Zealand ◽

Interobserver Variability ◽

Ground Truth ◽

Automated Analysis ◽

3D Echocardiography ◽

Training Data ◽

Patient Specific ◽

Manual Analysis ◽

Lv Mass ◽

3D Echo

Abstract Funding Acknowledgements Type of funding sources: Public grant(s) – National budget only. Main funding source(s): National Heart Foundation (NHF) of New Zealand Health Research Council (HRC) of New Zealand Artificial intelligence shows considerable promise for automated analysis and interpretation of medical images, particularly in the domain of cardiovascular imaging. While application to cardiac magnetic resonance (CMR) has demonstrated excellent results, automated analysis of 3D echocardiography (3D-echo) remains challenging, due to the lower signal-to-noise ratio (SNR), signal dropout, and greater interobserver variability in manual annotations. As 3D-echo is becoming increasingly widespread, robust analysis methods will substantially benefit patient evaluation. We sought to leverage the high SNR of CMR to provide training data for a convolutional neural network (CNN) capable of analysing 3D-echo. We imaged 73 participants (53 healthy volunteers, 20 patients with non-ischaemic cardiac disease) under both CMR and 3D-echo (<1 hour between scans). 3D models of the left ventricle (LV) were independently constructed from CMR and 3D-echo, and used to spatially align the image volumes using least squares fitting to a cardiac template. The resultant transformation was used to map the CMR mesh to the 3D-echo image. Alignment of mesh and image was verified through volume slicing and visual inspection (Fig. 1) for 120 paired datasets (including 47 rescans) each at end-diastole and end-systole. 100 datasets (80 for training, 20 for validation) were used to train a shallow CNN for mesh extraction from 3D-echo, optimised with a composite loss function consisting of normalised Euclidian distance (for 290 mesh points) and volume. Data augmentation was applied in the form of rotations and tilts (<15 degrees) about the long axis. The network was tested on the remaining 20 datasets (different participants) of varying image quality (Tab. I). For comparison, corresponding LV measurements from conventional manual analysis of 3D-echo and associated interobserver variability (for two observers) were also estimated. Initial results indicate that the use of embedded CMR meshes as training data for 3D-echo analysis is a promising alternative to manual analysis, with improved accuracy and precision compared with conventional methods. Further optimisations and a larger dataset are expected to improve network performance. (n = 20) LV EDV (ml) LV ESV (ml) LV EF (%) LV mass (g) Ground truth CMR 150.5 ± 29.5 57.9 ± 12.7 61.5 ± 3.4 128.1 ± 29.8 Algorithm error -13.3 ± 15.7 -1.4 ± 7.6 -2.8 ± 5.5 0.1 ± 20.9 Manual error -30.1 ± 21.0 -15.1 ± 12.4 3.0 ± 5.0 Not available Interobserver error 19.1 ± 14.3 14.4 ± 7.6 -6.4 ± 4.8 Not available Tab. 1. LV mass and volume differences (means ± standard deviations) for 20 test cases. Algorithm: CNN – CMR (as ground truth). Abstract Figure. Fig 1. CMR mesh registered to 3D-echo.

Download Full-text

The antiandrogen enzalutamide downregulates TMPRSS2 and reduces cellular entry of SARS-CoV-2 in human lung cells

Nature Communications ◽

10.1038/s41467-021-24342-y ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

D. A. Leach ◽

A. Mohr ◽

E. S. Giotis ◽

E. Cil ◽

A. M. Isac ◽

...

Keyword(s):

Androgen Receptor ◽

Human Lung ◽

Cell Types ◽

Receptor Activation ◽

Surface Proteins ◽

Mouse Lung ◽

Lung Cells ◽

Human Lung Cells ◽

Cellular Entry ◽

Experimental Data Analysis

AbstractSARS-CoV-2 attacks various organs, most destructively the lung, and cellular entry requires two host cell surface proteins: ACE2 and TMPRSS2. Downregulation of one or both of these is thus a potential therapeutic approach for COVID-19. TMPRSS2 is a known target of the androgen receptor, a ligand-activated transcription factor; androgen receptor activation increases TMPRSS2 levels in various tissues, most notably prostate. We show here that treatment with the antiandrogen enzalutamide—a well-tolerated drug widely used in advanced prostate cancer—reduces TMPRSS2 levels in human lung cells and in mouse lung. Importantly, antiandrogens significantly reduced SARS-CoV-2 entry and infection in lung cells. In support of this experimental data, analysis of existing datasets shows striking co-expression of AR and TMPRSS2, including in specific lung cell types targeted by SARS-CoV-2. Together, the data presented provides strong evidence to support clinical trials to assess the efficacy of antiandrogens as a treatment option for COVID-19.

Download Full-text

Automated analysis of filopodial length and spatially resolved protein concentration via adaptive shape tracking

Molecular Biology of the Cell ◽

10.1091/mbc.e16-06-0406 ◽

2016 ◽

Vol 27 (22) ◽

pp. 3616-3626 ◽

Cited By ~ 4

Author(s):

Tanumoy Saha ◽

Isabel Rathmann ◽

Abhiyan Viplav ◽

Sadhana Panzade ◽

Isabell Begemann ◽

...

Keyword(s):

Protein Concentration ◽

Growth Dynamics ◽

Automated Analysis ◽

Cell Types ◽

Control Mechanisms ◽

Spatially Resolved ◽

Novel Approach ◽

Associated Proteins

Filopodia are dynamic, actin-rich structures that transiently form on a variety of cell types. To understand the underlying control mechanisms requires precise monitoring of localization and concentration of individual regulatory and structural proteins as filopodia elongate and subsequently retract. Although several methods exist that analyze changes in filopodial shape, a software solution to reliably correlate growth dynamics with spatially resolved protein concentration along the filopodium independent of bending, lateral shift, or tilting is missing. Here we introduce a novel approach based on the convex-hull algorithm for parallel analysis of growth dynamics and relative spatiotemporal protein concentration along flexible filopodial protrusions. Detailed in silico tests using various geometries confirm that our technique accurately tracks growth dynamics and relative protein concentration along the filopodial length for a broad range of signal distributions. To validate our technique in living cells, we measure filopodial dynamics and quantify spatiotemporal localization of filopodia-associated proteins during the filopodial extension–retraction cycle in a variety of cell types in vitro and in vivo. Together these results show that the technique is suitable for simultaneous analysis of growth dynamics and spatiotemporal protein enrichment along filopodia. To allow readily application by other laboratories, we share source code and instructions for software handling.

Download Full-text

Proteomics of Cerebrospinal Fluid: Throughput and Robustness Using a Scalable Automated Analysis Pipeline for Biomarker Discovery

Analytical Chemistry ◽

10.1021/acs.analchem.5b02748 ◽

2015 ◽

Vol 87 (21) ◽

pp. 10755-10761 ◽

Cited By ~ 23

Author(s):

Antonio Núñez Galindo ◽

Martin Kussmann ◽

Loïc Dayon

Keyword(s):

Cerebrospinal Fluid ◽

Biomarker Discovery ◽

Automated Analysis ◽

Analysis Pipeline

Download Full-text

High-throughput super-resolution analysis of influenza virus pleomorphism reveals insights into viral spatial organization

10.1101/2021.09.23.461536 ◽

2021 ◽

Author(s):

Andrew McMahon ◽

Rebecca Andrews ◽

Sohail V Ghani ◽

Thorben Cordes ◽

Achillefs N Kapanidis ◽

...

Keyword(s):

Large Scale ◽

Spatial Organization ◽

Structural Information ◽

Virus Assembly ◽

Super Resolution ◽

Automated Analysis ◽

Size Analysis ◽

Analysis Pipeline ◽

Single Experiment ◽

Viral Immunology

Many viruses form highly pleomorphic particles; in influenza, these particles range from spheres of ~ 100 nm in diameter to filaments of several microns in length. Virion structure is of interest, not only in the context of virus assembly, but also because pleomorphic variations may correlate with infectivity and pathogenicity. Detailed images of virus morphology often rely on electron microscopy, which is generally low throughput and limited in molecular identification. We have used fluorescence super-resolution microscopy combined with a rapid automated analysis pipeline to image many thousands of individual influenza virions, gaining information on their size, morphology and the distribution of membrane-embedded and internal proteins. This large-scale analysis revealed that influenza particles can be reliably characterised by length, that no spatial frequency patterning of the surface glycoproteins occurs, and that RNPs are preferentially located towards filament ends within Archetti bodies. Our analysis pipeline is versatile and can be adapted for use on multiple other pathogens, as demonstrated by its application for the size analysis of SARS-CoV-2. The ability to gain nanoscale structural information from many thousands of viruses in just a single experiment is valuable for the study of virus assembly mechanisms, host cell interactions and viral immunology, and should be able to contribute to the development of viral vaccines, anti-viral strategies and diagnostics.

Download Full-text

DANNP: an efficient artificial neural network pruning tool

PeerJ Computer Science ◽

10.7717/peerj-cs.137 ◽

2017 ◽

Vol 3 ◽

pp. e137 ◽

Cited By ~ 7

Author(s):

Mona Alshahrani ◽

Othman Soufan ◽

Arturo Magana-Mora ◽

Vladimir B. Bajic

Keyword(s):

Neural Network ◽

State Of The Art ◽

Model Performance ◽

Training Data ◽

Classification Problems ◽

Link Type ◽

On Line ◽

Pruning Algorithms ◽

Artificial Neural ◽

The Impact

Background Artificial neural networks (ANNs) are a robust class of machine learning models and are a frequent choice for solving classification problems. However, determining the structure of the ANNs is not trivial as a large number of weights (connection links) may lead to overfitting the training data. Although several ANN pruning algorithms have been proposed for the simplification of ANNs, these algorithms are not able to efficiently cope with intricate ANN structures required for complex classification problems. Methods We developed DANNP, a web-based tool, that implements parallelized versions of several ANN pruning algorithms. The DANNP tool uses a modified version of the Fast Compressed Neural Network software implemented in C++ to considerably enhance the running time of the ANN pruning algorithms we implemented. In addition to the performance evaluation of the pruned ANNs, we systematically compared the set of features that remained in the pruned ANN with those obtained by different state-of-the-art feature selection (FS) methods. Results Although the ANN pruning algorithms are not entirely parallelizable, DANNP was able to speed up the ANN pruning up to eight times on a 32-core machine, compared to the serial implementations. To assess the impact of the ANN pruning by DANNP tool, we used 16 datasets from different domains. In eight out of the 16 datasets, DANNP significantly reduced the number of weights by 70%–99%, while maintaining a competitive or better model performance compared to the unpruned ANN. Finally, we used a naïve Bayes classifier derived with the features selected as a byproduct of the ANN pruning and demonstrated that its accuracy is comparable to those obtained by the classifiers trained with the features selected by several state-of-the-art FS methods. The FS ranking methodology proposed in this study allows the users to identify the most discriminant features of the problem at hand. To the best of our knowledge, DANNP (publicly available at www.cbrc.kaust.edu.sa/dannp) is the only available and on-line accessible tool that provides multiple parallelized ANN pruning options. Datasets and DANNP code can be obtained at www.cbrc.kaust.edu.sa/dannp/data.php and https://doi.org/10.5281/zenodo.1001086.

Download Full-text

Virtual ChIP-seq: predicting transcription factor binding by learning from the transcriptome

10.1101/168419 ◽

2018 ◽

Cited By ~ 10

Author(s):

Mehran Karimzadeh ◽

Michael M. Hoffman

Keyword(s):

Transcription Factor ◽

Transcription Factors ◽

Binding Sites ◽

Cell Types ◽

Transcription Factor Binding ◽

Regulatory Function ◽

Factor Binding ◽

Link Type ◽

Genomic Regions ◽

Factor Sequence

AbstractMotivationIdentifying transcription factor binding sites is the first step in pinpointing non-coding mutations that disrupt the regulatory function of transcription factors and promote disease. ChIP-seq is the most common method for identifying binding sites, but performing it on patient samples is hampered by the amount of available biological material and the cost of the experiment. Existing methods for computational prediction of regulatory elements primarily predict binding in genomic regions with sequence similarity to known transcription factor sequence preferences. This has limited efficacy since most binding sites do not resemble known transcription factor sequence motifs, and many transcription factors are not even sequence-specific.ResultsWe developed Virtual ChIP-seq, which predicts binding of individual transcription factors in new cell types using an artificial neural network that integrates ChIP-seq results from other cell types and chromatin accessibility data in the new cell type. Virtual ChIP-seq also uses learned associations between gene expression and transcription factor binding at specific genomic regions. This approach outperforms methods that predict TF binding solely based on sequence preference, pre-dicting binding for 36 transcription factors (Matthews correlation coefficient > 0.3).AvailabilityThe datasets we used for training and validation are available at https://virchip.hoffmanlab.org. We have deposited in Zenodo the current version of our software (http://doi.org/10.5281/zenodo.1066928), datasets (http://doi.org/10.5281/zenodo.823297), predictions for 36 transcription factors on Roadmap Epigenomics cell types (http://doi.org/10.5281/zenodo.1455759), and predictions in Cistrome as well as ENCODE-DREAM in vivo TF Binding Site Prediction Challenge (http://doi.org/10.5281/zenodo.1209308).

Download Full-text

ICTD: A semi-supervised cell type identification and deconvolution method for multi-omics data

10.1101/426593 ◽

2018 ◽

Cited By ~ 2

Author(s):

Wennan Chang ◽

Changlin Wan ◽

Xiaoyu Lu ◽

Szu-wei Tu ◽

Yifan Sun ◽

...

Keyword(s):

Single Cell ◽

Cell Types ◽

Training Data ◽

Marker Genes ◽

Cell Detection ◽

Omics Data ◽

Deconvolution Method ◽

Cell Type ◽

Data Set ◽

Cell Type Specific

AbstractWe developed a novel deconvolution method, namely Inference of Cell Types and Deconvolution (ICTD) that addresses the fundamental issue of identifiability and robustness in current tissue data deconvolution problem. ICTD provides substantially new capabilities for omics data based characterization of a tissue microenvironment, including (1) maximizing the resolution in identifying resident cell and sub types that truly exists in a tissue, (2) identifying the most reliable marker genes for each cell type, which are tissue and data set specific, (3) handling the stability problem with co-linear cell types, (4) co-deconvoluting with available matched multi-omics data, and (5) inferring functional variations specific to one or several cell types. ICTD is empowered by (i) rigorously derived mathematical conditions of identifiable cell type and cell type specific functions in tissue transcriptomics data and (ii) a semi supervised approach to maximize the knowledge transfer of cell type and functional marker genes identified in single cell or bulk cell data in the analysis of tissue data, and (iii) a novel unsupervised approach to minimize the bias brought by training data. Application of ICTD on real and single cell simulated tissue data validated that the method has consistently good performance for tissue data coming from different species, tissue microenvironments, and experimental platforms. Other than the new capabilities, ICTD outperformed other state-of-the-art devolution methods on prediction accuracy, the resolution of identifiable cell, detection of unknown sub cell types, and assessment of cell type specific functions. The premise of ICTD also lies in characterizing cell-cell interactions and discovering cell types and prognostic markers that are predictive of clinical outcomes.

Download Full-text

gprofiler2 -- an R package for gene list functional enrichment analysis and namespace conversion toolset g:Profiler

F1000Research ◽

10.12688/f1000research.24956.2 ◽

2020 ◽

Vol 9 ◽

pp. 709 ◽

Cited By ~ 1

Author(s):

Liis Kolberg ◽

Uku Raudvere ◽

Ivan Kuzmin ◽

Jaak Vilo ◽

Hedi Peterson

Keyword(s):

Gene List ◽

Enrichment Analysis ◽

Functional Enrichment Analysis ◽

Automated Analysis ◽

R Package ◽

Biological Data ◽

Functional Enrichment ◽

Link Type ◽

Functional Profiling ◽

Rest Api

g:Profiler (https://biit.cs.ut.ee/gprofiler) is a widely used gene list functional profiling and namespace conversion toolset that has been contributing to reproducible biological data analysis already since 2007. Here we introduce the accompanying R package, gprofiler2, developed to facilitate programmatic access to g:Profiler computations and databases via REST API. The gprofiler2 package provides an easy-to-use functionality that enables researchers to incorporate functional enrichment analysis into automated analysis pipelines written in R. The package also implements interactive visualisation methods to help to interpret the enrichment results and to illustrate them for publications. In addition, gprofiler2 gives access to the versatile gene/protein identifier conversion functionality in g:Profiler enabling to map between hundreds of different identifier types or orthologous species. The gprofiler2 package is freely available at the CRAN repository.

Download Full-text