Inferring Better Gene Regulation Networks from Single Cell Data

Author(s):  
Michael P.H. Stumpf
2018 ◽  
Vol 15 (5) ◽  
pp. 055001 ◽  
Author(s):  
Lisa Weber ◽  
William Raymond ◽  
Brian Munsky

2020 ◽  
Author(s):  
Jacob Hepkema ◽  
Nicholas Keone Lee ◽  
Benjamin J. Stewart ◽  
Siwat Ruangroengkulrith ◽  
Varodom Charoensawan ◽  
...  

AbstractBinding of transcription factors (TFs) at proximal promoters and distal enhancers is central to gene regulation. Yet, identification of TF binding sites, also known as regulatory motifs, and quantification of their impact remains challenging. Here we present scover, a convolutional neural network model that can discover putative regulatory motifs along with their cell type-specific importance from single-cell data. Analysis of scRNA-seq data from human kidney shows that ETS, YY1 and NRF1 are the most important motif families for proximal promoters. Using multiple mouse tissues we obtain for the first time a model with cell type resolution which explains 34% of the variance in gene expression. Finally, by applying scover to distal enhancers identified using scATAC-seq from the mouse cerebral cortex we highlight the emergence of layer specific regulatory patterns during development.


2017 ◽  
Author(s):  
Lisa Weber ◽  
William Raymond ◽  
Brian Munsky

AbstractIn quantitative analyses of biological processes, one may use many different scales of models (e.g., spatial or non-spatial, deterministic or stochastic, time-varying or at steady-state) or many different approaches to match models to experimental data (e.g., model fitting or parameter uncertainty/sloppiness quantification with different experiment designs). These different analyses can lead to surprisingly different results, even when applied to the same data and the same model. We use a simplified gene regulation model to illustrate many of these concerns, especially for ODE analyses of deterministic processes, chemical master equation and finite state projection analyses of heterogeneous processes, and stochastic simulations. For each analysis, we employ Matlab and Python software to consider a time-dependent input signal (e.g., a kinase nuclear translocation) and several model hypotheses, along with simulated single-cell data. We illustrate different approaches (e.g., deterministic and stochastic) to identify the mechanisms and parameters of the same model from the same simulated data. For each approach, we explore how uncertainty in parameter space varies with respect to the chosen analysis approach or specific experiment design. We conclude with a discussion of how our simulated results relate to the integration of experimental and computational investigations to explore signal-activated gene expression models in yeast [1] and human cells [2]‡.PACS numbers: 87.10.+e, 87.15.Aa, 05.10.Gg, 05.40.Ca,02.50.-rSubmitted to: Phys. Biol.


2021 ◽  
Author(s):  
Jordan W. Squair ◽  
Michael A. Skinnider ◽  
Matthieu Gautier ◽  
Leonard J. Foster ◽  
Grégoire Courtine
Keyword(s):  

2021 ◽  
Vol 22 (S3) ◽  
Author(s):  
Yuanyuan Li ◽  
Ping Luo ◽  
Yi Lu ◽  
Fang-Xiang Wu

Abstract Background With the development of the technology of single-cell sequence, revealing homogeneity and heterogeneity between cells has become a new area of computational systems biology research. However, the clustering of cell types becomes more complex with the mutual penetration between different types of cells and the instability of gene expression. One way of overcoming this problem is to group similar, related single cells together by the means of various clustering analysis methods. Although some methods such as spectral clustering can do well in the identification of cell types, they only consider the similarities between cells and ignore the influence of dissimilarities on clustering results. This methodology may limit the performance of most of the conventional clustering algorithms for the identification of clusters, it needs to develop special methods for high-dimensional sparse categorical data. Results Inspired by the phenomenon that same type cells have similar gene expression patterns, but different types of cells evoke dissimilar gene expression patterns, we improve the existing spectral clustering method for clustering single-cell data that is based on both similarities and dissimilarities between cells. The method first measures the similarity/dissimilarity among cells, then constructs the incidence matrix by fusing similarity matrix with dissimilarity matrix, and, finally, uses the eigenvalues of the incidence matrix to perform dimensionality reduction and employs the K-means algorithm in the low dimensional space to achieve clustering. The proposed improved spectral clustering method is compared with the conventional spectral clustering method in recognizing cell types on several real single-cell RNA-seq datasets. Conclusions In summary, we show that adding intercellular dissimilarity can effectively improve accuracy and achieve robustness and that improved spectral clustering method outperforms the traditional spectral clustering method in grouping cells.


Cell ◽  
2021 ◽  
Author(s):  
Yuhan Hao ◽  
Stephanie Hao ◽  
Erica Andersen-Nissen ◽  
William M. Mauck ◽  
Shiwei Zheng ◽  
...  

Author(s):  
Zhen Miao ◽  
Benjamin D. Humphreys ◽  
Andrew P. McMahon ◽  
Junhyong Kim

Sign in / Sign up

Export Citation Format

Share Document