scholarly journals Optimized design of single-cell RNA sequencing experiments for cell-type-specific eQTL analysis

2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Igor Mandric ◽  
Tommer Schwarz ◽  
Arunabha Majumdar ◽  
Kangcheng Hou ◽  
Leah Briscoe ◽  
...  

Abstract Single-cell RNA-sequencing (scRNA-Seq) is a compelling approach to directly and simultaneously measure cellular composition and state, which can otherwise only be estimated by applying deconvolution methods to bulk RNA-Seq estimates. However, it has not yet become a widely used tool in population-scale analyses, due to its prohibitively high cost. Here we show that given the same budget, the statistical power of cell-type-specific expression quantitative trait loci (eQTL) mapping can be increased through low-coverage per-cell sequencing of more samples rather than high-coverage sequencing of fewer samples. We use simulations starting from one of the largest available real single-cell RNA-Seq data from 120 individuals to also show that multiple experimental designs with different numbers of samples, cells per sample and reads per cell could have similar statistical power, and choosing an appropriate design can yield large cost savings especially when multiplexed workflows are considered. Finally, we provide a practical approach on selecting cost-effective designs for maximizing cell-type-specific eQTL power which is available in the form of a web tool.

2019 ◽  
Author(s):  
Igor Mandric ◽  
Tommer Schwarz ◽  
Arunabha Majumdar ◽  
Richard Perez ◽  
Meena Subramaniam ◽  
...  

AbstractSingle-cell RNA-sequencing (scRNA-Seq) is a compelling approach to simultaneously measure cellular composition and state which is impossible with bulk profiling approaches. However, it has not yet become a widely used tool in population-scale analyses, due to its prohibitively high cost. Here we show that given the same budget, the statistical power of cell-type-specific expression quantitative trait loci (eQTL) mapping can be increased through low-coverage per-cell sequencing of more samples rather than high-coverage sequencing of fewer samples. We also show that multiple experimental designs with different numbers of samples, cells per sample and reads per cell could have similar statistical power, and choosing an appropriate design can yield large cost savings especially when multiplexed workflows are considered. Finally, we provide a practical approach on selecting cost-effective designs for maximizing cell-type-specific eQTL power.


2019 ◽  
Vol 15 ◽  
pp. P1258-P1258 ◽  
Author(s):  
Tulsi Patel ◽  
Troy Carnwath ◽  
Laura Lewis-Tuffin ◽  
Mariet Allen ◽  
Sarah J. Lincoln ◽  
...  

2020 ◽  
Author(s):  
Jiaxin Fan ◽  
Xuran Wang ◽  
Rui Xiao ◽  
Mingyao Li

AbstractAllelic expression imbalance (AEI), quantified by the relative expression of two alleles of a gene in a diploid organism, can help explain phenotypic variations among individuals. Traditional methods detect AEI using bulk RNA sequencing (RNA-seq) data, a data type that averages out cell-to-cell heterogeneity in gene expression across cell types. Since the patterns of AEI may vary across different cell types, it is desirable to study AEI in a cell-type-specific manner. Although this can be achieved by single-cell RNA sequencing (scRNA-seq), it requires full-length transcript to be sequenced in single cells of a large number of individuals, which are still cost prohibitive to generate. To overcome this limitation and utilize the vast amount of existing disease relevant bulk tissue RNA-seq data, we developed BSCET, which enables the characterization of cell-type-specific AEI in bulk RNA-seq data by integrating cell type composition information inferred from a small set of scRNA-seq samples, possibly obtained from an external dataset. By modeling covariate effect, BSCET can also detect genes whose cell-type-specific AEI are associated with clinical factors. Through extensive benchmark evaluations, we show that BSCET correctly detected genes with cell-type-specific AEI and differential AEI between healthy and diseased samples using bulk RNA-seq data. BSCET also uncovered cell-type-specific AEIs that were missed in bulk data analysis when the directions of AEI are opposite in different cell types. We further applied BSCET to two pancreatic islet bulk RNA-seq datasets, and detected genes showing cell-type-specific AEI that are related to the progression of type 2 diabetes. Since bulk RNA-seq data are easily accessible, BSCET provided a convenient tool to integrate information from scRNA-seq data to gain insight on AEI with cell type resolution. Results from such analysis will advance our understanding of cell type contributions in human diseases.Author SummaryDetection of allelic expression imbalance (AEI), a phenomenon where the two alleles of a gene differ in their expression magnitude, is a key step towards the understanding of phenotypic variations among individuals. Existing methods detect AEI use bulk RNA sequencing (RNA-seq) data and ignore AEI variations among different cell types. Although single-cell RNA sequencing (scRNA-seq) has enabled the characterization of cell-to-cell heterogeneity in gene expression, the high costs have limited its application in AEI analysis. To overcome this limitation, we developed BSCET to characterize cell-type-specific AEI using the widely available bulk RNA-seq data by integrating cell-type composition information inferred from scRNA-seq samples. Since the degree of AEI may vary with disease phenotypes, we further extended BSCET to detect genes whose cell-type-specific AEIs are associated with clinical factors. Through extensive benchmark evaluations and analyses of two pancreatic islet bulk RNA-seq datasets, we demonstrated BSCET’s ability to refine bulk-level AEI to cell-type resolution, and to identify genes whose cell-type-specific AEIs are associated with the progression of type 2 diabetes. With the vast amount of easily accessible bulk RNA-seq data, we believe BSCET will be a valuable tool for elucidating cell type contributions in human diseases.


PLoS Genetics ◽  
2021 ◽  
Vol 17 (3) ◽  
pp. e1009080
Author(s):  
Jiaxin Fan ◽  
Xuran Wang ◽  
Rui Xiao ◽  
Mingyao Li

Allelic expression imbalance (AEI), quantified by the relative expression of two alleles of a gene in a diploid organism, can help explain phenotypic variations among individuals. Traditional methods detect AEI using bulk RNA sequencing (RNA-seq) data, a data type that averages out cell-to-cell heterogeneity in gene expression across cell types. Since the patterns of AEI may vary across different cell types, it is desirable to study AEI in a cell-type-specific manner. Although this can be achieved by single-cell RNA sequencing (scRNA-seq), it requires full-length transcript to be sequenced in single cells of a large number of individuals, which are still cost prohibitive to generate. To overcome this limitation and utilize the vast amount of existing disease relevant bulk tissue RNA-seq data, we developed BSCET, which enables the characterization of cell-type-specific AEI in bulk RNA-seq data by integrating cell type composition information inferred from a small set of scRNA-seq samples, possibly obtained from an external dataset. By modeling covariate effect, BSCET can also detect genes whose cell-type-specific AEI are associated with clinical factors. Through extensive benchmark evaluations, we show that BSCET correctly detected genes with cell-type-specific AEI and differential AEI between healthy and diseased samples using bulk RNA-seq data. BSCET also uncovered cell-type-specific AEIs that were missed in bulk data analysis when the directions of AEI are opposite in different cell types. We further applied BSCET to two pancreatic islet bulk RNA-seq datasets, and detected genes showing cell-type-specific AEI that are related to the progression of type 2 diabetes. Since bulk RNA-seq data are easily accessible, BSCET provided a convenient tool to integrate information from scRNA-seq data to gain insight on AEI with cell type resolution. Results from such analysis will advance our understanding of cell type contributions in human diseases.


2020 ◽  
Author(s):  
Nithin Joshy ◽  
Kyuson Yun

AbstractMotivationSingle-cell RNA sequencing (scRNA-seq) is a recent technology that has provided many valuable biological insights. Notable uses include identifying novel cell-types, measuring the cellular response to treatment, and tracking trajectories of distinct cell lineages in time. The raw data generated in this process typically amounts to hundreds of millions of sequencing reads and requires substantial computational infrastructure for downstream analysis, a major hurdle for a biological research lab. Fortunately, the preprocessing step that converts this huge sequence data into manageable cell-specific expression profiles is standardized and can be performed in the cloud. We demonstrate how a cloud-based computational framework can be used to transform the raw data into biologically interpretable cell-type-specific information, using either 3’ or 5’ transcriptome libraries from 10x Genomics. The processed data which is an order of magnitude smaller in size can be easily downloaded to a laptop for customized analysis to gain deeper biological insights.ResultsWe produced an automated and easily extensible pipeline in the cloud for the analysis of single-cell RNA-seq data which provides a convenient method to handle post-processing of scRNA sequencing using next generation sequencing platforms. The basic step provides the transformation of the scRNA-seq data to cell-type-specific expression profiles and computes the quality control metrics for the dataset. The extensibility of the platform is demonstrated by adding a doublet-removal algorithm and recomputing the clustering of the cells. Any additional computational steps that take a cell-type expression counts matrix as input can be easily added to this framework with minimal effort.AvailabilityThe framework and its documentation for installation is available at the Github repository http://github.com/nj3252/CB-Source/[email protected] informationSupplementary data available at Bioinformatics online.


PLoS ONE ◽  
2018 ◽  
Vol 13 (10) ◽  
pp. e0205883 ◽  
Author(s):  
Joseph C. Mays ◽  
Michael C. Kelly ◽  
Steven L. Coon ◽  
Lynne Holtzclaw ◽  
Martin F. Rath ◽  
...  

Author(s):  
Jun Cheng ◽  
Wenduo Gu ◽  
Ting Lan ◽  
Jiacheng Deng ◽  
Zhichao Ni ◽  
...  

Abstract Aims Hypertension is a major risk factor for cardiovascular diseases. However, vascular remodelling, a hallmark of hypertension, has not been systematically characterized yet. We described systematic vascular remodelling, especially the artery type- and cell type-specific changes, in hypertension using spontaneously hypertensive rats (SHRs). Methods and results Single-cell RNA sequencing was used to depict the cell atlas of mesenteric artery (MA) and aortic artery (AA) from SHRs. More than 20 000 cells were included in the analysis. The number of immune cells more than doubled in aortic aorta in SHRs compared to Wistar Kyoto controls, whereas an expansion of MA mesenchymal stromal cells (MSCs) was observed in SHRs. Comparison of corresponding artery types and cell types identified in integrated datasets unravels dysregulated genes specific for artery types and cell types. Intersection of dysregulated genes with curated gene sets including cytokines, growth factors, extracellular matrix (ECM), receptors, etc. revealed vascular remodelling events involving cell–cell interaction and ECM re-organization. Particularly, AA remodelling encompasses upregulated cytokine genes in smooth muscle cells, endothelial cells, and especially MSCs, whereas in MA, change of genes involving the contractile machinery and downregulation of ECM-related genes were more prominent. Macrophages and T cells within the aorta demonstrated significant dysregulation of cellular interaction with vascular cells. Conclusion Our findings provide the first cell landscape of resistant and conductive arteries in hypertensive animal models. Moreover, it also offers a systematic characterization of the dysregulated gene profiles with unbiased, artery type-specific and cell type-specific manners during hypertensive vascular remodelling.


2019 ◽  
Vol 33 (S1) ◽  
Author(s):  
Steven Schaffert ◽  
Aram Krauson ◽  
Elisabeth Walczak ◽  
Jonathan Nizar ◽  
Gwendolyn Holdgate ◽  
...  

2019 ◽  
Vol 21 (Supplement_6) ◽  
pp. vi64-vi64
Author(s):  
Robert Suter ◽  
Vasileios Stathias ◽  
Anna Jermakowicz ◽  
Alexa Semonche ◽  
Michael Ivan ◽  
...  

Abstract Glioblastoma (GBM) remains the most common adult brain tumor, with poor survival expectations, and no new therapeutic modalities approved in the last decade. Our laboratories have recently demonstrated that the integration of a transcriptional disease signature obtained from The Cancer Genome Atlas’ GBM dataset with transcriptional cell drug-response signatures in the LINCS L1000 dataset yields possible combinatorial therapeutics. Considering the extreme intra-tumor heterogeneity associated with the disease, we hypothesize that the utilization of single-cell RNA-sequencing (scRNA-seq) of patient tumors will further strengthen our predictive model by providing insight on the unique transcriptomes of the cellular niches present within these tumors, and into the transcriptional dynamics of these same cellular niches. By sequencing single-cell transcriptomes from recurrent GBM tumors resected from patients at the University of Miami, and integrating our datasets with previously published scRNA-seq data from primary GBM tumors, we are able to gain additional insight into the differences between these clinical distinctions. We have analyzed the differential expression of kinases both across and within distinct cell populations of primary and recurrent GBM tumors. This transcriptional map of kinase expression represents the heterogeneity of potential targets within individual tumors and between recurrent and primary GBM. Additionally, by generating disease signatures unique to each cellular population, and integrating these with transcriptional drug-response signatures from LINCS, we are able to predict compounds to target specific cell populations within GMB tumors. Additional computational techniques such as RNA velocity analysis and cell cycle scoring elucidate temporal insights to further prioritize these cell-type specific therapeutics, and reveal the intra-cellular dynamics present within these tumors. Collectively, our studies suggest that we have developed a novel omics pipeline based on the single cell RNA-sequencing of individual GBM cells that addresses intra-tumor heterogeneity, and may lead to novel therapeutic combinations for the treatment of this incurable disease.


Sign in / Sign up

Export Citation Format

Share Document