A hierarchical Bayesian model for single-cell clustering using RNA-sequencing data

AbstractThe rapid advancement of single cell technologies has shed new light on the complex mechanisms of cellular heterogeneity. However, compared with bulk RNA sequencing (RNA-seq), single-cell RNA-seq (scRNA-seq) suffers from higher noise and lower coverage, which brings new computational difficulties. Based on statistical independence, cell-specific network (CSN) is able to quantify the overall associations between genes for each cell, yet suffering from a problem of overestimation related to indirect effects. To overcome this problem, we propose the “conditional cell-specific network” (CCSN) method, which can measure the direct associations between genes by eliminating the indirect associations. CCSN can be used for cell clustering and dimension reduction on a network basis of single cells. Intuitively, each CCSN can be viewed as the transformation from less “reliable” gene expression to more “reliable” gene-gene associations in a cell. Based on CCSN, we further design network flow entropy (NFE) to estimate the differentiation potency of a single cell. A number of scRNA-seq datasets were used to demonstrate the advantages of our approach: (1) one direct association network for one cell; (2) most existing scRNA-seq methods designed for gene expression matrices are also applicable to CCSN-transformed degree matrices; (3) CCSN-based NFE helps resolving the direction of differentiation trajectories by quantifying the potency of each cell. CCSN is publicly available at http://sysbio.sibcb.ac.cn/cb/chenlab/soft/CCSN.zip.

Download Full-text

A monotonicity-based gene clustering algorithm for enhancing clarity in single-cell RNA sequencing data

10.1101/2020.12.20.423308 ◽

2020 ◽

Author(s):

Victor Wang ◽

Pietro Antonio Cicalese ◽

Chandra Mohan

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Expression Patterns ◽

Gene Clustering ◽

Sequencing Data ◽

Technical Noise ◽

Cell Clustering ◽

Single Cell Rna Sequencing

AbstractSingle-cell RNA sequencing (scRNA-seq) technologies and analysis tools have allowed for meaningful insight into the roles and relationships of cells. However, high dimensionality, frequent dropout values, and technical noise remain prevalent challenges for scRNA-seq data, obscuring the already complex expression patterns. To address several shortcomings in commonly used distance metrics, we present a monotonicity-based distance metric designed to enhance the clarity of scRNA-seq data. We apply our metric in a gene clustering algorithm, which we run on several biological datasets. We compare our results to those generated by popular clustering algorithms to demonstrate that our algorithm has substantial ability to improve the accuracy of subsequent cell clustering.

Download Full-text

Faculty Opinions recommendation of A hierarchical Bayesian model for learning nonlinear statistical regularities in nonstationary natural signals.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1024224.288708 ◽

2005 ◽

Author(s):

Peter König

Keyword(s):

Bayesian Model ◽

Hierarchical Bayesian ◽

Hierarchical Bayesian Model ◽

Statistical Regularities

Download Full-text

A development of hierarchical bayesian model for changing point analysis at watershed scale

Journal of Korea Water Resources Association ◽

10.3741/jkwra.2017.50.2.75 ◽

2017 ◽

Vol 50 (02) ◽

Keyword(s):

Bayesian Model ◽

Hierarchical Bayesian ◽

Hierarchical Bayesian Model ◽

Watershed Scale ◽

Point Analysis

Download Full-text

Mixed Distribution Models Based on Single-Cell RNA Sequencing Data

Interdisciplinary Sciences Computational Life Sciences ◽

10.1007/s12539-021-00427-6 ◽

2021 ◽

Author(s):

Min Wu ◽

Junhua Xu ◽

Tao Ding ◽

Jie Gao

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Sequencing Data ◽

Distribution Models ◽

Mixed Distribution ◽

Single Cell Rna Sequencing

Download Full-text

A Bernoulli-Gamma Hierarchical Bayesian Model for Daily Rainfall Forecasts

Journal of Hydrology ◽

10.1016/j.jhydrol.2021.126317 ◽

2021 ◽

pp. 126317

Author(s):

Carlos H. R. Lima ◽

Hyun-Han Kwon ◽

Yong-Tak Kim

Keyword(s):

Bayesian Model ◽

Daily Rainfall ◽

Hierarchical Bayesian ◽

Hierarchical Bayesian Model

Download Full-text

A hierarchical Bayesian model for the analysis of space-time air pollutant concentrations and an application to air pollution analysis in Northern China

Stochastic Environmental Research and Risk Assessment ◽

10.1007/s00477-021-02027-8 ◽

2021 ◽

Author(s):

Weifu Ding ◽

Yee Leung ◽

Jiangshe Zhang ◽

Tung Fung

Keyword(s):

Air Pollution ◽

Bayesian Model ◽

Northern China ◽

Space Time ◽

Air Pollutant ◽

Hierarchical Bayesian ◽

Hierarchical Bayesian Model ◽

Pollutant Concentrations ◽

Air Pollutant Concentrations

Download Full-text

IMMU-27. SINGLE CELL RNA-SEQUENCING IDENTIFIES NOVEL BONE MARROW DERIVED MYELOID CELLS IN GLIOBLASTOMA ASSOCIATED WITH TUMOR AGGRESSION

Neuro-Oncology ◽

10.1093/neuonc/noaa215.457 ◽

2020 ◽

Vol 22 (Supplement_2) ◽

pp. ii110-ii110

Author(s):

Christina Jackson ◽

Christopher Cherry ◽

Sadhana Bom ◽

Hao Zhang ◽

John Choi ◽

...

Keyword(s):

Bone Marrow ◽

Single Cell ◽

Tumor Cells ◽

Rna Sequencing ◽

Metabolic Pathways ◽

Myeloid Cells ◽

Tumor Grade ◽

Sequencing Data ◽

Single Cell Rna Sequencing ◽

Two Populations

Abstract BACKGROUND Glioma associated myeloid cells (GAMs) can be induced to adopt an immunosuppressive phenotype that can lead to inhibition of anti-tumor responses in glioblastoma (GBM). Understanding the composition and phenotypes of GAMs is essential to modulating the myeloid compartment as a therapeutic adjunct to improve anti-tumor immune response. METHODS We performed single-cell RNA-sequencing (sc-RNAseq) of 435,400 myeloid and tumor cells to identify transcriptomic and phenotypic differences in GAMs across glioma grades. We further correlated the heterogeneity of the GAM landscape with tumor cell transcriptomics to investigate interactions between GAMs and tumor cells. RESULTS sc-RNAseq revealed a diverse landscape of myeloid-lineage cells in gliomas with an increase in preponderance of bone marrow derived myeloid cells (BMDMs) with increasing tumor grade. We identified two populations of BMDMs unique to GBMs; Mac-1and Mac-2. Mac-1 demonstrates upregulation of immature myeloid gene signature and altered metabolic pathways. Mac-2 is characterized by expression of scavenger receptor MARCO. Pseudotime and RNA velocity analysis revealed the ability of Mac-1 to transition and differentiate to Mac-2 and other GAM subtypes. We further found that the presence of these two populations of BMDMs are associated with the presence of tumor cells with stem cell and mesenchymal features. Bulk RNA-sequencing data demonstrates that gene signatures of these populations are associated with worse survival in GBM. CONCLUSION We used sc-RNAseq to identify a novel population of immature BMDMs that is associated with higher glioma grades. This population exhibited altered metabolic pathways and stem-like potentials to differentiate into other GAM populations including GAMs with upregulation of immunosuppressive pathways. Our results elucidate unique interactions between BMDMs and GBM tumor cells that potentially drives GBM progression and the more aggressive mesenchymal subtype. Our discovery of these novel BMDMs have implications in new therapeutic targets in improving the efficacy of immune-based therapies in GBM.

Download Full-text

Robust resource demand estimation using hierarchical Bayesian model in a distributed service system

8th ACM IKDD CODS and 26th COMAD ◽

10.1145/3430984.3431003 ◽

2020 ◽

Author(s):

Sumanta Mukherjee ◽

Krishnasuri Narayanam ◽

Nupur Aggarwal ◽

Digbalay Bose ◽

Amith Singhee

Keyword(s):

Bayesian Model ◽

Service System ◽

Demand Estimation ◽

Hierarchical Bayesian ◽

Hierarchical Bayesian Model ◽

Resource Demand ◽

Distributed Service

Download Full-text

Software Benchmark—Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Data

Microbiology Research ◽

10.3390/microbiolres12020022 ◽

2021 ◽

Vol 12 (2) ◽

pp. 317-334

Author(s):

Omar Alaqeeli ◽

Li Xing ◽

Xuekui Zhang

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Classification Tree ◽

Area Under The Curve ◽

Data Sets ◽

Sequencing Data ◽

Single Cell Rna Sequencing ◽

Tree Algorithms ◽

R Packages

Classification tree is a widely used machine learning method. It has multiple implementations as R packages; rpart, ctree, evtree, tree and C5.0. The details of these implementations are not the same, and hence their performances differ from one application to another. We are interested in their performance in the classification of cells using the single-cell RNA-Sequencing data. In this paper, we conducted a benchmark study using 22 Single-Cell RNA-sequencing data sets. Using cross-validation, we compare packages’ prediction performances based on their Precision, Recall, F1-score, Area Under the Curve (AUC). We also compared the Complexity and Run-time of these R packages. Our study shows that rpart and evtree have the best Precision; evtree is the best in Recall, F1-score and AUC; C5.0 prefers more complex trees; tree is consistently much faster than others, although its complexity is often higher than others.

Download Full-text