High-throughput single-cell RNA-seq data imputation and characterization with surrogate-assisted automated deep learning

Author(s):  
Xiangtao Li ◽  
Shaochuan Li ◽  
Lei Huang ◽  
Shixiong Zhang ◽  
Ka-chun Wong

Abstract Single-cell RNA sequencing (scRNA-seq) technologies have been heavily developed to probe gene expression profiles at single-cell resolution. Deep imputation methods have been proposed to address the related computational challenges (e.g. the gene sparsity in single-cell data). In particular, the neural architectures of those deep imputation models have been proven to be critical for performance. However, deep imputation architectures are difficult to design and tune for those without rich knowledge of deep neural networks and scRNA-seq. Therefore, Surrogate-assisted Evolutionary Deep Imputation Model (SEDIM) is proposed to automatically design the architectures of deep neural networks for imputing gene expression levels in scRNA-seq data without any manual tuning. Moreover, the proposed SEDIM constructs an offline surrogate model, which can accelerate the computational efficiency of the architectural search. Comprehensive studies show that SEDIM significantly improves the imputation and clustering performance compared with other benchmark methods. In addition, we also extensively explore the performance of SEDIM in other contexts and platforms including mass cytometry and metabolic profiling in a comprehensive manner. Marker gene detection, gene ontology enrichment and pathological analysis are conducted to provide novel insights into cell-type identification and the underlying mechanisms. The source code is available at https://github.com/li-shaochuan/SEDIM.

2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Farzad Abdolhosseini ◽  
Behrooz Azarkhalili ◽  
Abbas Maazallahi ◽  
Aryan Kamal ◽  
Seyed Abolfazl Motahari ◽  
...  

2018 ◽  
Author(s):  
Lingxue Zhu ◽  
Jing Lei ◽  
Bernie Devlin ◽  
Kathryn Roeder

AbstractMotivated by the dynamics of development, in which cells of recognizable types, or pure cell types, transition into other types over time, we propose a method of semi-soft clustering that can classify both pure and intermediate cell types from data on gene expression or protein abundance from individual cells. Called SOUP, for Semi-sOft clUstering with Pure cells, this novel algorithm reveals the clustering structure for both pure cells, which belong to one single cluster, as well as transitional cells with soft memberships. SOUP involves a two-step process: identify the set of pure cells and then estimate a membership matrix. To find pure cells, SOUP uses the special block structure the K cell types form in a similarity matrix, devised by pairwise comparison of the gene expression profiles of individual cells. Once pure cells are identified, they provide the key information from which the membership matrix can be computed. SOUP is applicable to general clustering problems as well, as long as the unrestrictive modeling assumptions hold. The performance of SOUP is documented via extensive simulation studies. Using SOUP to analyze two single cell data sets from brain shows it produce sensible and interpretable results.


2020 ◽  
Author(s):  
Dan D. Erdmann-Pham ◽  
Jonathan Fischer ◽  
Justin Hong ◽  
Yun S. Song

AbstractDirect comparison of bulk gene expression profiles is complicated by distinct cell type mixtures in each sample which obscure whether observed differences are actually due to changes in expression levels themselves or simply cell type compositions. Single-cell technology has made it possible to measure gene expression in individual cells, achieving higher resolution at the expense of increased noise. If carefully incorporated, such single-cell data can be used to deconvolve bulk samples to yield accurate estimates of the true cell type proportions, thus enabling one to disentangle the effects of differential expression and cell type mixtures. Here, we propose a generative model and a likelihood-based inference method that uses asymptotic statistical theory and a novel optimization procedure to perform deconvolution of bulk RNA-seq data to produce accurate cell type proportion estimates. We demonstrate the effectiveness of our method, called RNA-Sieve, across a diverse array of scenarios involving real data and discuss several extensions made uniquely possible by our probabilistic framework, including general hypotheses tests and confidence intervals.


2020 ◽  
Vol 8 (Suppl 3) ◽  
pp. A4-A4
Author(s):  
Anushka Dikshit ◽  
Dan Zollinger ◽  
Karen Nguyen ◽  
Jill McKay-Fleisch ◽  
Kit Fuhrman ◽  
...  

BackgroundThe canonical WNT-β-catenin signaling pathway is vital for development and tissue homeostasis but becomes strongly tumorigenic when dysregulated. and alter the transcriptional signature of a cell to promote malignant transformation. However, thorough characterization of these transcriptomic signatures has been challenging because traditional methods lack either spatial information, multiplexing, or sensitivity/specificity. To overcome these challenges, we developed a novel workflow combining the single molecule and single cell visualization capabilities of the RNAscope in situ hybridization (ISH) assay with the highly multiplexed spatial profiling capabilities of the GeoMx™ Digital Spatial Profiler (DSP) RNA assays. Using these methods, we sought to spatially profile and compare gene expression signatures of tumor niches with high and low CTNNB1 expression.MethodsAfter screening 120 tumor cores from multiple tumors for CTNNB1 expression by the RNAscope assay, we identified melanoma as the tumor type with the highest CTNNB1 expression while prostate tumors had the lowest expression. Using the RNAscope Multiplex Fluorescence assay we selected regions of high CTNNB1 expression within 3 melanoma tumors as well as regions with low CTNNB1 expression within 3 prostate tumors. These selected regions of interest (ROIs) were then transcriptionally profiled using the GeoMx DSP RNA assay for a set of 78 genes relevant in immuno-oncology. Target genes that were differentially expressed were further visualized and spatially assessed using the RNAscope Multiplex Fluorescence assay to confirm GeoMx DSP data with single cell resolution.ResultsThe GeoMx DSP analysis comparing the melanoma and prostate tumors revealed that they had significantly different gene expression profiles and many of these genes showed concordance with CTNNB1 expression. Furthermore, immunoregulatory targets such as ICOSLG, CTLA4, PDCD1 and ARG1, also demonstrated significant correlation with CTNNB1 expression. On validating selected targets using the RNAscope assay, we could distinctly visualize that they were not only highly expressed in melanoma compared to the prostate tumor, but their expression levels changed proportionally to that of CTNNB1 within the same tumors suggesting that these differentially expressed genes may be regulated by the WNT-β-catenin pathway.ConclusionsIn summary, by combining the RNAscope ISH assay and the GeoMx DSP RNA assay into one joint workflow we transcriptionally profiled regions of high and low CTNNB1 expression within melanoma and prostate tumors and identified genes potentially regulated by the WNT- β-catenin pathway. This novel workflow can be fully automated and is well suited for interrogating the tumor and stroma and their interactions.GeoMx Assays are for RESEARCH ONLY, not for diagnostics.


2021 ◽  
Vol 9 (Suppl 1) ◽  
pp. A12.1-A12
Author(s):  
Y Arjmand Abbassi ◽  
N Fang ◽  
W Zhu ◽  
Y Zhou ◽  
Y Chen ◽  
...  

Recent advances of high-throughput single cell sequencing technologies have greatly improved our understanding of the complex biological systems. Heterogeneous samples such as tumor tissues commonly harbor cancer cell-specific genetic variants and gene expression profiles, both of which have been shown to be related to the mechanisms of disease development, progression, and responses to treatment. Furthermore, stromal and immune cells within tumor microenvironment interact with cancer cells to play important roles in tumor responses to systematic therapy such as immunotherapy or cell therapy. However, most current high-throughput single cell sequencing methods detect only gene expression levels or epigenetics events such as chromatin conformation. The information on important genetic variants including mutation or fusion is not captured. To better understand the mechanisms of tumor responses to systematic therapy, it is essential to decipher the connection between genotype and gene expression patterns of both tumor cells and cells in the tumor microenvironment. We developed FocuSCOPE, a high-throughput multi-omics sequencing solution that can detect both genetic variants and transcriptome from same single cells. FocuSCOPE has been used to successfully perform single cell analysis of both gene expression profiles and point mutations, fusion genes, or intracellular viral sequences from thousands of cells simultaneously, delivering comprehensive insights of tumor and immune cells in tumor microenvironment at single cell resolution.Disclosure InformationY. Arjmand Abbassi: None. N. Fang: None. W. Zhu: None. Y. Zhou: None. Y. Chen: None. U. Deutsch: None.


2021 ◽  
Vol 288 (1945) ◽  
pp. 20202793
Author(s):  
Alexander Yermanos ◽  
Daniel Neumeier ◽  
Ioana Sandu ◽  
Mariana Borsa ◽  
Ann Cathrin Waindok ◽  
...  

Neuroinflammation plays a crucial role during ageing and various neurological conditions, including Alzheimer's disease, multiple sclerosis and infection. Technical limitations, however, have prevented an integrative analysis of how lymphocyte immune receptor repertoires and their accompanying transcriptional states change with age in the central nervous system. Here, we leveraged single-cell sequencing to simultaneously profile B cell receptor and T cell receptor repertoires and accompanying gene expression profiles in young and old mouse brains. We observed the presence of clonally expanded B and T cells in the central nervous system of aged male mice. Furthermore, many of these B cells were of the IgM and IgD isotypes, and had low levels of somatic hypermutation. Integrating gene expression information additionally revealed distinct transcriptional profiles of these clonally expanded lymphocytes. Our findings implicate that clonally related T and B cells in the CNS of elderly mice may contribute to neuroinflammation accompanying homeostatic ageing.


2021 ◽  
Author(s):  
Philip Bischoff ◽  
Alexandra Trinks ◽  
Jennifer Wiederspahn ◽  
Benedikt Obermayer ◽  
Jan Patrick Pett ◽  
...  

AbstractLung carcinoid tumors, also referred to as pulmonary neuroendocrine tumors or lung carcinoids, are rare neoplasms of the lung with a more favorable prognosis than other subtypes of lung cancer. Still, some patients suffer from relapsed disease and metastatic spread while no consensus treatment exists for metastasized carcinoids. Several recent single-cell studies have provided detailed insights into the cellular heterogeneity of more common lung cancers, such as adeno- and squamous cell carcinoma. However, the characteristics of lung carcinoids on the single-cell level are yet completely unknown.To study the cellular composition and single-cell gene expression profiles in lung carcinoids, we applied single-cell RNA sequencing to three lung carcinoid tumor samples and normal lung tissue. The single-cell transcriptomes of carcinoid tumor cells reflected intertumoral heterogeneity associated with clinicopathological features, such as tumor necrosis and proliferation index. The immune microenvironment was specifically enriched in noninflammatory monocyte-derived myeloid cells. Tumor-associated endothelial cells were characterized by distinct gene expression profiles. A spectrum of vascular smooth muscle cells and pericytes predominated the stromal microenvironment. We found a small proportion of myofibroblasts exhibiting features reminiscent of cancer-associated fibroblasts. Stromal and immune cells exhibited potential paracrine interactions which may shape the microenvironment via NOTCH, VEGF, TGFβ and JAK/STAT signaling. Moreover, single-cell gene signatures of pericytes and myofibroblasts demonstrated prognostic value in bulk gene expression data.Here, we provide first comprehensive insights into the cellular composition and single-cell gene expression profiles in lung carcinoids, demonstrating the non-inflammatory and vessel-rich nature of their tumor microenvironment, and outlining relevant intercellular interactions which could serve as future therapeutic targets.


Science ◽  
2020 ◽  
Vol 371 (6531) ◽  
pp. eaba5257 ◽  
Author(s):  
Anna Kuchina ◽  
Leandra M. Brettner ◽  
Luana Paleologu ◽  
Charles M. Roco ◽  
Alexander B. Rosenberg ◽  
...  

Single-cell RNA sequencing (scRNA-seq) has become an essential tool for characterizing gene expression in eukaryotes, but current methods are incompatible with bacteria. Here, we introduce microSPLiT (microbial split-pool ligation transcriptomics), a high-throughput scRNA-seq method for Gram-negative and Gram-positive bacteria that can resolve heterogeneous transcriptional states. We applied microSPLiT to >25,000 Bacillus subtilis cells sampled at different growth stages, creating an atlas of changes in metabolism and lifestyle. We retrieved detailed gene expression profiles associated with known, but rare, states such as competence and prophage induction and also identified unexpected gene expression states, including the heterogeneous activation of a niche metabolic pathway in a subpopulation of cells. MicroSPLiT paves the way to high-throughput analysis of gene expression in bacterial communities that are otherwise not amenable to single-cell analysis, such as natural microbiota.


Sign in / Sign up

Export Citation Format

Share Document