scholarly journals Inferring biosynthetic and gene regulatory networks from Artemisia annua RNA sequencing data on a credit card-sized ARM computer

Author(s):  
Qiao Wen Tan ◽  
Marek Mutwil
2019 ◽  
Author(s):  
Qiao Wen Tan ◽  
Marek Mutwil

0.ABSTRACTPrediction of gene function and gene regulatory networks is one of the most active topics in bioinformatics. The accumulation of publicly available gene expression data for hundreds of plant species, together with advances in bioinformatical methods and affordable computing, sets ingenuity as the major bottleneck in understanding gene function and regulation. Here, we show how a credit card-sized computer retailing for less than 50 USD can be used to rapidly predict gene function and infer regulatory networks from RNA sequencing data. To achieve this, we constructed a bioinformatical pipeline that downloads and allows quality-control of RNA sequencing data; and generates a gene co-expression network that can reveal enzymes and transcription factors participating and controlling a given biosynthetic pathway. We exemplify this by first identifying genes and transcription factors involved in the biosynthesis of secondary cell wall in the plant Artemisia annua, the main natural source of the anti-malarial drug artemisinin. Networks were then used to dissect the artemisinin biosynthesis pathway, which suggest potential transcription factors regulating artemisinin biosynthesis. We provide the source code of our pipeline and envision that the ubiquity of affordable computing, availability of biological data and increased bioinformatical training of biologists will transform the field of bioinformatics.HighlightsProcessing of large scale transcriptomic data with affordable single-board computersTranscription factors can be found in the same network as their targetsCo-expression of transcription factors and genes in secondary cell wall biosynthesisCo-expression of transcription factors and genes involved in artemisinin biosynthesis


Author(s):  
Rui-Qi Wang ◽  
Wei Zhao ◽  
Hai-Kui Yang ◽  
Jia-Mei Dong ◽  
Wei-Jie Lin ◽  
...  

Colorectal cancer (CRC) manifests as gastrointestinal tumors with high intratumoral heterogeneity. Recent studies have demonstrated that CRC may consist of tumor cells with different consensus molecular subtypes (CMS). The advancements in single-cell RNA sequencing have facilitated the development of gene regulatory networks to decode key regulators for specific cell types. Herein, we comprehensively analyzed the CMS of CRC patients by using single-cell RNA-sequencing data. CMS for all malignant cells were assigned using CMScaller. Gene set variation analysis showed pathway activity differences consistent with those reported in previous studies. Cell–cell communication analysis confirmed that CMS1 was more closely related to immune cells, and that monocytes and macrophages play dominant roles in the CRC tumor microenvironment. On the basis of the constructed gene regulation networks (GRNs) for each subtype, we identified that the critical transcription factor ERG is universally activated and upregulated in all CMS in comparison with normal cells, and that it performed diverse roles by regulating the expression of different downstream genes. In summary, molecular subtyping of single-cell RNA-sequencing data for colorectal cancer could elucidate the heterogeneity in gene regulatory networks and identify critical regulators of CRC.


2019 ◽  
Vol 101 (3) ◽  
pp. 716-730 ◽  
Author(s):  
Ryan J. Spurney ◽  
Lisa Van den Broeck ◽  
Natalie M. Clark ◽  
Adam P. Fisher ◽  
Maria A. de Luis Balaguer ◽  
...  

2021 ◽  
Author(s):  
Boris M. Brenerman ◽  
Benjamin D. Shapiro ◽  
Michael C. Schatz ◽  
Alexis Battle

AbstractSingle-cell RNA sequencing data contain patterns of correlation that are poorly captured by techniques that rely on linear estimation or assumptions of Gaussian behavior. We apply random forest regression to scRNAseq data from mouse brains, which identifies the co-regulation of genes within specific cellular contexts. By analyzing the estimators of the random forest, we identify several novel candidate gene regulatory networks and compare these networks in aged and young mice. We demonstrate that cell populations have cell-type specific phenotypes of aging that are not detected by other methods, including the collapse of differentiating oligodendrocytes but not precursors or mature oligodendrocytes.


2018 ◽  
Author(s):  
Maria Angels de Luis Balaguer ◽  
Ryan J. Spurney ◽  
Natalie M. Clark ◽  
Adam P. Fisher ◽  
Rosangela Sozzani

ABSTRACTPredicting gene regulatory networks (GRNs) from gene expression profiles has become a common approach for identifying important biological regulators. Despite the increase in the use of inference methods, existing computational approaches do not integrate RNA-sequencing data analysis, are often not automated, and are restricted to users with bioinformatics and programming backgrounds. To address these limitations, we have developed TuxNet, an integrated user-friendly platform, which, with just a few selections, allows to process raw RNA-sequencing data (using the Tuxedo pipeline) and infer GRNs from these processed data. TuxNet is implemented as a graphical user interface and, using expression data from any organism with an existing reference genome, can mine the regulations among genes either by applying a dynamic Bayesian network inference algorithm, GENIST, or a regression tree-based pipeline that uses spatiotemporal data, RTP-STAR. To illustrate the use of TuxNet while getting insight into the regulatory cascade downstream of the Arabidopsis root stem cell regulator PERIANTHIA (PAN), we obtained time course gene expression data of a PAN inducible line and inferred a GRN using GENIST. Using RTP-STAR, we then inferred the network of a PAN secondary downstream gene, ATHB13, for which we obtained wildtype and mutant expression profiles. Our case studies feature the versatility of TuxNet to infer networks using different types of gene expression data (i.e time course and steady-state data) as well as how inference networks are used to identify important regulators.SUMMARYTuxNet offers a simple interface for non-computational biologists to infer GRNs from raw RNA-seq data.


eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Christopher A Jackson ◽  
Dayanne M Castro ◽  
Giuseppe-Antonio Saldi ◽  
Richard Bonneau ◽  
David Gresham

Understanding how gene expression programs are controlled requires identifying regulatory relationships between transcription factors and target genes. Gene regulatory networks are typically constructed from gene expression data acquired following genetic perturbation or environmental stimulus. Single-cell RNA sequencing (scRNAseq) captures the gene expression state of thousands of individual cells in a single experiment, offering advantages in combinatorial experimental design, large numbers of independent measurements, and accessing the interaction between the cell cycle and environmental responses that is hidden by population-level analysis of gene expression. To leverage these advantages, we developed a method for scRNAseq in budding yeast (Saccharomyces cerevisiae). We pooled diverse transcriptionally barcoded gene deletion mutants in 11 different environmental conditions and determined their expression state by sequencing 38,285 individual cells. We benchmarked a framework for learning gene regulatory networks from scRNAseq data that incorporates multitask learning and constructed a global gene regulatory network comprising 12,228 interactions.


Biometrics ◽  
2017 ◽  
Vol 73 (4) ◽  
pp. 1221-1230 ◽  
Author(s):  
Bochao Jia ◽  
Suwa Xu ◽  
Guanghua Xiao ◽  
Vishal Lamba ◽  
Faming Liang

2018 ◽  
Vol 36 (5) ◽  
pp. 1356-1369 ◽  
Author(s):  
Scott M. Riester ◽  
Yang Lin ◽  
Wei Wang ◽  
Lin Cong ◽  
Abdel-Moneim Mohamed Ali ◽  
...  

2021 ◽  
Vol 11 ◽  
Author(s):  
Abbas Karimi Rizi ◽  
Mina Zamani ◽  
Amirhossein Shirazi ◽  
G. Reza Jafari ◽  
János Kertész

Genes communicate with each other through different regulatory effects, which lead to the emergence of complex network structures in cells, and such structures are expected to be different for normal and cancerous cells. To study these differences, we have investigated the Gene Regulatory Network (GRN) of cells as inferred from RNA-sequencing data. The GRN is a signed weighted network corresponding to the inductive or inhibitory interactions. Here we focus on a particular of motifs in the GRN, the triangles, which are imbalanced if the number of negative interactions is odd. By studying the stability of imbalanced triangles in the GRN, we show that the network of cancerous cells has fewer imbalanced triangles compared to normal cells. Moreover, in the normal cells, imbalanced triangles are isolated from the main part of the network, while such motifs are part of the network's giant component in cancerous cells. Our result demonstrates that due to genes' collective behavior the structure of the complex networks is different in cancerous cells from those in normal ones.


Sign in / Sign up

Export Citation Format

Share Document