scholarly journals LAK: Lasso and K-Means Based Single-Cell RNA-Seq Data Clustering Analysis

IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 129679-129688 ◽  
Author(s):  
Jiao Hua ◽  
Hongkun Liu ◽  
Boyang Zhang ◽  
Shuilin Jin
2021 ◽  
Vol 90 ◽  
pp. 107415
Author(s):  
Junyi Li ◽  
Wei Jiang ◽  
Henry Han ◽  
Jing Liu ◽  
Bo Liu ◽  
...  

2017 ◽  
Vol 18 (1) ◽  
Author(s):  
Zhuo Wang ◽  
Shuilin Jin ◽  
Guiyou Liu ◽  
Xiurui Zhang ◽  
Nan Wang ◽  
...  

Blood ◽  
2015 ◽  
Vol 126 (23) ◽  
pp. SCI-20-SCI-20
Author(s):  
H. Leighton Grimes ◽  
Singh Harinder ◽  
Andre Olsson ◽  
Nathan Salomonis ◽  
Bruce J. Aronow ◽  
...  

Abstract Single-cell RNA-Seq has the potential to become a dominant approach in probing diverse and complex developmental compartments. Its unbiased and comprehensive nature could enable developmental ordering of cellular and regulatory gene hierarchies without prior knowledge. To test general utility we performed single-cell RNA-seq of murine hematopoietic progenitors focusing on the myeloid developmental hierarchy. Using novel unsupervised clustering analysis, ICDS, we correctly ordered known hierarchical states as well as revealed rare intermediates. Regulatory state analysis suggested that the transcription factors Gfi1 and Irf8 function antagonistically to control homeostatic neutrophil and macrophage production, respectively. This prediction was validated by complementary genetic and genomic experiments in granulocyte-macrophage progenitors. Using knock-in reporters for Gfi1 and Irf8 and clonogenic analyses coupled with single-cell RNA-seq we distinguished regulatory states of bi-potential progenitors from their lineage specifying or committed progeny. Thus single-cell RNA-Seq is a powerful developmental tool to characterize hierarchical and rare cellular states along with the regulators that control their dynamics. Disclosures No relevant conflicts of interest to declare.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 166730-166741
Author(s):  
Jihong Guan ◽  
Rui-Yi Li ◽  
Jiasheng Wang

BMC Genomics ◽  
2018 ◽  
Vol 19 (S6) ◽  
Author(s):  
Marmar Moussa ◽  
Ion I. Măndoiu
Keyword(s):  

2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Yael Baran ◽  
Akhiad Bercovich ◽  
Arnau Sebe-Pedros ◽  
Yaniv Lubling ◽  
Amir Giladi ◽  
...  

Abstract scRNA-seq profiles each represent a highly partial sample of mRNA molecules from a unique cell that can never be resampled, and robust analysis must separate the sampling effect from biological variance. We describe a methodology for partitioning scRNA-seq datasets into metacells: disjoint and homogenous groups of profiles that could have been resampled from the same cell. Unlike clustering analysis, our algorithm specializes at obtaining granular as opposed to maximal groups. We show how to use metacells as building blocks for complex quantitative transcriptional maps while avoiding data smoothing. Our algorithms are implemented in the MetaCell R/C++ software package.


2020 ◽  
Author(s):  
Hui Li ◽  
Cory R. Brouwer ◽  
Weijun Luo

AbstractSingle cell RNA sequencing (scRNA-Seq) has been widely used in biomedical research and generated enormous volume and diversity of data. The raw data contain multiple types of noise and technical artifacts and need thorough cleaning. The existing denoising and imputation methods largely focus on a single type of noise (i.e. dropouts) and have strong distribution assumptions which greatly limit their performance and application. We designed and developed the AutoClass model, integrating two deep neural network components, an autoencoder and a classifier, as to maximize both noise removal and signal retention. AutoClass is free of distribution assumptions, hence can effectively clean a wide range of noises and artifacts. AutoClass outperforms the state-of-art methods in multiple types of scRNA-Seq data analyses, including data recovery, differential expression analysis, clustering analysis and batch effect removal. Importantly, AutoClass is robust on key hyperparameter settings including bottleneck layer size, pre-clustering number and classifier weight. We have made AutoClass open source at: https://github.com/datapplab/AutoClass.


Genes ◽  
2020 ◽  
Vol 11 (5) ◽  
pp. 532 ◽  
Author(s):  
Weilai Chi ◽  
Minghua Deng

Single-cell RNA-seq (scRNA-seq) is quite prevalent in studying transcriptomes, but it suffers from excessive zeros, some of which are true, but others are false. False zeros, which can be seen as missing data, obstruct the downstream analysis of single-cell RNA-seq data. How to distinguish true zeros from false ones is the key point of this problem. Here, we propose sparsity-penalized stacked denoising autoencoders (scSDAEs) to impute scRNA-seq data. scSDAEs adopt stacked denoising autoencoders with a sparsity penalty, as well as a layer-wise pretraining procedure to improve model fitting. scSDAEs can capture nonlinear relationships among the data and incorporate information about the observed zeros. We tested the imputation efficiency of scSDAEs on recovering the true values of gene expression and helping downstream analysis. First, we show that scSDAE can recover the true values and the sample–sample correlations of bulk sequencing data with simulated noise. Next, we demonstrate that scSDAEs accurately impute RNA mixture dataset with different dilutions, spike-in RNA concentrations affected by technical zeros, and improves the consistency of RNA and protein levels in CITE-seq data. Finally, we show that scSDAEs can help downstream clustering analysis. In this study, we develop a deep learning-based method, scSDAE, to impute single-cell RNA-seq affected by technical zeros. Furthermore, we show that scSDAEs can recover the true values, to some extent, and help downstream analysis.


Author(s):  
Jiahua Rao ◽  
Xiang Zhou ◽  
Yutong Lu ◽  
Huiying Zhao ◽  
Yuedong Yang

AbstractSingle-cell RNA sequencing technology promotes the profiling of single-cell transcriptomes at an unprecedented throughput and resolution. However, in scRNA-seq studies, only a low amount of sequenced mRNA in each cell leads to missing detection for a portion of mRNA molecules, i.e. the dropout problem. The dropout event hinders various downstream analysis, such as clustering analysis, differential expression analysis, and inference of gene-to-gene relationships. Therefore, it is necessary to develop robust and effective imputation methods for the increasing scRNA-seq data. In this study, we have developed an imputation method (GraphSCI) to impute the dropout events in scRNA-seq data based on the graph convolution networks. The method takes advantage of low-dimensional representations of similar cells and gene-gene interactions to impute the dropouts. Extensive experiments demonstrated that GraphSCI outperforms other state-of-the-art methods for imputation on both simulated and real scRNA-seq data. Meanwhile, GraphSCI is able to accurately infer gene-to-gene relationships by utilizing the imputed matrix that are concealed by dropout events in raw data.


Sign in / Sign up

Export Citation Format

Share Document