scMARK an 'MNIST' like benchmark to evaluate and optimize models for unifying scRNA data

Mapping Intimacies ◽

10.1101/2021.12.08.471773 ◽

2021 ◽

Author(s):

Swechha Singh ◽

Dylan Mendonca ◽

Octavian Focsa ◽

Juan Javier Diaz-Mejia ◽

Sam Cooper

Keyword(s):

Single Cell ◽

Benchmark Dataset ◽

Rna Analysis ◽

Analysis Tools ◽

Variational Autoencoder ◽

Well Models ◽

The Way

Today's single-cell RNA analysis tools provide enormous value in enabling researchers to make sense of large single-cell RNA (scRNA) studies, yet their ability to integrate different studies at scale remains untested. Here we present a novel benchmark dataset (scMARK), that consists of 100,000 cells over 10 studies and can test how well models unify data from different scRNA studies. We also introduce a two-step framework that uses supervised models, to evaluate how well unsupervised models integrate scRNA data from the 10 studies. Using this framework, we show that the Variational Autoencoder, scVI, represents the only tool tested that can integrate scRNA studies at scale. Overall, this work paves the way to creating large scRNA atlases and 'off-the-shelf' analysis tools.

Download Full-text

Recent Advances in Single-Cell Profiling and Multispecific Therapeutics: Paving the Way for a New Era of Precision Medicine Targeting Cardiac Fibroblasts

Current Cardiology Reports ◽

10.1007/s11886-021-01517-z ◽

2021 ◽

Vol 23 (7) ◽

Author(s):

Sally Yu Shi ◽

Xin Luo ◽

Tracy M. Yamawaki ◽

Chi-Ming Li ◽

Brandon Ason ◽

...

Keyword(s):

Heart Failure ◽

Precision Medicine ◽

Single Cell ◽

Cardiac Fibroblasts ◽

Rapid Development ◽

Cardiac Fibroblast ◽

New Era ◽

Fibroblast Activation ◽

Cell Gene Expression ◽

The Way

Abstract Purpose of Review Cardiac fibroblast activation contributes to fibrosis, maladaptive remodeling and heart failure progression. This review summarizes the latest findings on cardiac fibroblast activation dynamics derived from single-cell transcriptomic analyses and discusses how this information may aid the development of new multispecific medicines. Recent Findings Advances in single-cell gene expression technologies have led to the discovery of distinct fibroblast subsets, some of which are more prevalent in diseased tissue and exhibit temporal changes in response to injury. In parallel to the rapid development of single-cell platforms, the advent of multispecific therapeutics is beginning to transform the biopharmaceutical landscape, paving the way for the selective targeting of diseased fibroblast subpopulations. Summary Insights gained from single-cell technologies reveal critical cardiac fibroblast subsets that play a pathogenic role in the progression of heart failure. Combined with the development of multispecific therapeutic agents that have enabled access to previously “undruggable” targets, we are entering a new era of precision medicine.

Download Full-text

ME-VAE: Multi-Encoder Variational AutoEncoder for Controlling Multiple Transformational Features in Single Cell Image Analysis

10.1101/2021.04.22.441005 ◽

2021 ◽

Author(s):

Luke Ternes ◽

Mark Dane ◽

Marilyne Labrie ◽

Gordon Mills ◽

Joe Gray ◽

...

Keyword(s):

Image Analysis ◽

Single Cell ◽

Imaging Features ◽

Phenotypic Differences ◽

Cell Image ◽

Intensity Measurements ◽

Quantitative Measurements ◽

Variational Autoencoder ◽

Cell Image Analysis ◽

Organizational Features

AbstractImage-based cell phenotyping relies on quantitative measurements as encoded representations of cells; however, defining suitable representations that capture complex imaging features is challenging since there are many obstacles, including segmentation and identifying subcellular compartments for feature extraction. Variational autoencoder (VAE) approaches produce encouraging results by mapping from an image to a representative descriptor, and outperform classical hand-crafted features for morphology, intensity, and texture at differentiating data. Although VAEs show promising results for capturing morphological and organizational features in tissue, single cell image analyses based on VAEs often fail to identify biologically informative features due to the intrinsic amount of uninformative variability. Herein, we propose a multi-encoder VAE (ME-VAE) in single cell image analysis using transformed images as a self-supervised signal to extract transform-invariant biologically meaningful features. We show that the proposed architecture improves analysis by making distinct populations more separable compared to traditional VAEs and intensity measurements by enhancing phenotypic differences between cells and by improving correlations to other modalities.

Download Full-text

Single-Cell in Situ RNA Analysis With Switchable Fluorescent Oligonucleotides

Frontiers in Cell and Developmental Biology ◽

10.3389/fcell.2018.00042 ◽

2018 ◽

Vol 6 ◽

Cited By ~ 5

Author(s):

Lu Xiao ◽

Jia Guo

Keyword(s):

Single Cell ◽

Rna Analysis

Download Full-text

Dhaka: variational autoencoder for unmasking tumor heterogeneity from single cell genomic data

Bioinformatics ◽

10.1093/bioinformatics/btz095 ◽

2019 ◽

Cited By ~ 9

Author(s):

Sabrina Rashid ◽

Sohrab Shah ◽

Ziv Bar-Joseph ◽

Ravi Pandya

Keyword(s):

Single Cell ◽

Tumor Heterogeneity ◽

Genomic Data ◽

Variational Autoencoder

Download Full-text

Abstract 4347: Integrated single-cell DNA and RNA analysis of intratumoral heterogeneity and immune lineages in colorectal and gastric tumor biopsies

10.1158/1538-7445.am2018-4347 ◽

2018 ◽

Author(s):

Billy Lau ◽

Noemi Andor ◽

Anuja Sathe ◽

Christina Wood-Bouwens ◽

George Poultsides ◽

...

Keyword(s):

Single Cell ◽

Intratumoral Heterogeneity ◽

Gastric Tumor ◽

Rna Analysis ◽

Dna And Rna ◽

Tumor Biopsies

Download Full-text

Single cell RNA analysis identifies cellular heterogeneity and adaptive responses of the lung at birth

Nature Communications ◽

10.1038/s41467-018-07770-1 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 52

Author(s):

Minzhe Guo ◽

Yina Du ◽

Jason J. Gokey ◽

Samriddha Ray ◽

Sheila M. Bell ◽

...

Keyword(s):

Single Cell ◽

Cellular Heterogeneity ◽

Adaptive Responses ◽

Rna Analysis

Download Full-text

Single cell ecology

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2019.0076 ◽

2019 ◽

Vol 374 (1786) ◽

pp. 20190076 ◽

Cited By ~ 2

Author(s):

Thomas A. Richards ◽

Ramon Massana ◽

Stefano Pagliara ◽

Neil Hall

Keyword(s):

Single Cell ◽

Building Blocks ◽

Biological Properties ◽

Natural Environments ◽

Biological Processes ◽

Special Issue ◽

New Approaches ◽

New Methodologies ◽

Biological Entities ◽

The Way

Cells are the building blocks of life, from single-celled microbes through to multi-cellular organisms. To understand a multitude of biological processes we need to understand how cells behave, how they interact with each other and how they respond to their environment. The use of new methodologies is changing the way we study cells allowing us to study them on minute scales and in unprecedented detail. These same methods are allowing researchers to begin to sample the vast diversity of microbes that dominate natural environments. The aim of this special issue is to bring together research and perspectives on the application of new approaches to understand the biological properties of cells, including how they interact with other biological entities. This article is part of a discussion meeting issue ‘Single cell ecology’.

Download Full-text

Interpretable factor models of single-cell RNA-seq via variational autoencoders

10.1101/737601 ◽

2019 ◽

Cited By ~ 2

Author(s):

Valentine Svensson ◽

Lior Pachter

Keyword(s):

Gene Expression ◽

Single Cell ◽

Statistical Inference ◽

Factor Models ◽

Rna Seq ◽

Cell Type ◽

Massive Datasets ◽

Domain Specific ◽

Variational Autoencoder ◽

Inference Methods

Single cell RNA-seq makes possible the investigation of variability in gene expression among cells, and dependence of variation on cell type. Statistical inference methods for such analyses must be scalable, and ideally interpretable. We present an approach based on a modification of a recently published highly scalable variational autoencoder framework that provides interpretability without sacrificing much accuracy. We demonstrate that our approach enables identification of gene programs in massive datasets. Our strategy, namely the learning of factor models with the auto-encoding variational Bayes framework, is not domain specific and may be of interest for other applications.

Download Full-text

VASC: dimension reduction and visualization of single cell RNA sequencing data by deep variational autoencoder

10.1101/199315 ◽

2017 ◽

Cited By ~ 6

Author(s):

Dongfang Wang ◽

Jin Gu

Keyword(s):

Dimension Reduction ◽

Single Cell ◽

Rna Sequencing ◽

Original Data ◽

Marker Genes ◽

Single Cell Level ◽

Sequencing Data ◽

Cell Level ◽

Variational Autoencoder ◽

Single Cell Rna Sequencing

AbstractSingle cell RNA sequencing (scRNA-seq) is a powerful technique to analyze the transcriptomic heterogeneities in single cell level. It is an important step for studying cell sub-populations and lineages based on scRNA-seq data by finding an effective low-dimensional representation and visualization of the original data. The scRNA-seq data are much noiser than traditional bulk RNA-Seq: in the single cell level, the transcriptional fluctuations are much larger than the average of a cell population and the low amount of RNA transcripts will increase the rate of technical dropout events. In this study, we proposed VASC (deep Variational Autoencoder for scRNA-seq data), a deep multi-layer generative model, for the unsupervised dimension reduction and visualization of scRNA-seq data. It can explicitly model the dropout events and find the nonlinear hierarchical feature representations of the original data. Tested on twenty datasets, VASC shows superior performances in most cases and broader dataset compatibility compared with four state-of-the-art dimension reduction methods. Then, for a case study of pre-implantation embryos, VASC successfully re-establishes the cell dynamics and identifies several candidate marker genes associated with the early embryo development.

Download Full-text

SISUA: Semi-Supervised Generative Autoencoder for Single Cell Data

10.1101/631382 ◽

2019 ◽

Cited By ~ 1

Author(s):

Trung Ngo Trong ◽

Roger Kramer ◽

Juha Mehtonen ◽

Gerardo González ◽

Ville Hautamäki ◽

...

Keyword(s):

Single Cell ◽

Network Architecture ◽

Surface Protein ◽

Protein Quantification ◽

Additional Information ◽

Protein Levels ◽

Variational Autoencoder ◽

Cell Gene Expression ◽

Cell Phenotypes ◽

Cell Data

ABSTRACTSingle-cell transcriptomics offers a tool to study the diversity of cell phenotypes through snapshots of the abundance of mRNA in individual cells. Often there is additional information available besides the single cell gene expression counts, such as bulk transcriptome data from the same tissue, or quantification of surface protein levels from the same cells. In this study, we propose models based on the Bayesian generative approach, where protein quantification available as CITE-seq counts from the same cells are used to constrain the learning process, thus forming a semi-supervised model. The generative model is based on the deep variational autoencoder (VAE) neural network architecture.

Download Full-text