Reconstruction of regulatory networks to predict gene function in pancreatic development and disease

2020 ◽  
Author(s):  
S Heller ◽  
J Schwab ◽  
M Breunig ◽  
Kestler HA ◽  
A Kleger
2019 ◽  
Author(s):  
Matej Mihelčić ◽  
Tomislav Šmuc ◽  
Fran Supek

AbstractGenes with similar roles in the cell are known to cluster on chromosomes, thus benefiting from coordinated regulation. This allows gene function to be inferred by transferring annotations from genomic neighbors, following the guilt-by-association principle. We performed a systematic search for co-occurrence of >1000 gene functions in genomic neighborhoods across 1669 prokaryotic, 49 fungal and 80 metazoan genomes, revealing prevalent patterns that cannot be explained by clustering of functionally similar genes. It is a very common occurrence that pairs of dissimilar gene functions – corresponding to semantically distant Gene Ontology terms – are significantly co-located on chromosomes. These neighborhood associations are often as conserved across genomes as the known associations between similar functions, suggesting selective benefits from clustering of certain diverse functions, which may conceivably play complementary roles in the cell. We propose a simple encoding of chromosomal gene order, the neighborhood function profiles (NFP), which draws on diverse gene clustering patterns to predict gene function and phenotype. NFPs yield a 26-46% increase in predictive power over state-of-the-art approaches that propagate function across neighborhoods, thus providing hundreds of novel, high-confidence gene function inferences per genome. Furthermore, we demonstrate that the effect of structural variation on gene function distribution across chromosomes may be used to predict phenotype of individuals from their genome sequence.


2020 ◽  
Vol 30 (20) ◽  
pp. 3961-3971.e6
Author(s):  
Aditya C. Bandekar ◽  
Sishir Subedi ◽  
Thomas R. Ioerger ◽  
Christopher M. Sassetti

2021 ◽  
Author(s):  
Alexander Lachmann ◽  
Kaeli Rizzo ◽  
Alon Bartal ◽  
Minji Jeon ◽  
Daniel J. B. Clarke ◽  
...  

Gene co-expression correlations from mRNA-sequencing (RNA-seq) can be used to predict gene function based on the covariance structure that exists within such data. In the past, we showed that RNA-seq co-expression data is highly predictive of gene function and protein-protein interactions. We demonstrated that the performance of such predictions is dependent on the source of the gene expression data. Furthermore, since genes function in different cellular contexts, predictions derived from tissue-specific gene co-expression data outperform predictions derived from cross-tissue gene co-expression data. However, the identification of the optimal tissue type to maximize gene function predictions for all mammalian genes is not trivial. Here we introduce and validate an approach we term Partitioning RNA-seq data Into Segments for Massive co-EXpression-based gene function Predictions (PrismExp), for improved gene function prediction based on RNA-seq co-expression data. With coexpression data from ARCHS4, we apply PrismExp to predict a wide variety of gene functions, including pathway membership, phenotypic associations, and protein-protein interactions. PrismExp outperforms the cross-tissue co-expression correlation matrix approach on all tested domains. Hence, PrismExp can enhance machine learning methods that utilize RNA-seq coexpression correlations to impute knowledge about understudied genes and proteins.


Author(s):  
Qiao Wen Tan ◽  
William Goh ◽  
Marek Mutwil

AbstractAs genomes become more and more available, gene function prediction presents itself as one of the major hurdles in our quest to extract meaningful information on the biological processes genes participate in. In order to facilitate gene function prediction, we show how our user-friendly pipeline, Large-Scale Transcriptomic Analysis Pipeline in Cloud (LSTrAP-Cloud), can be useful in helping biologists make a shortlist of genes that they might be interested in. LSTrAP-Cloud is based on Google Colaboratory and provides user-friendly tools that process and quality-control RNA sequencing data streamed from the European Sequencing Archive. LSTRAP-Cloud outputs a gene co-expression network that can be used to identify functionally related genes for any organism with a sequenced genome and publicly available RNA sequencing data. Here, we used the biosynthesis pathway of Nicotiana tabacum as a case study to demonstrate how enzymes, transporters and transcription factors involved in the synthesis, transport and regulation of nicotine can be identified using our pipeline.


2007 ◽  
Vol 73 (15) ◽  
pp. 4717-4724 ◽  
Author(s):  
Susan R. Steyert ◽  
Silvia A. Pineiro

ABSTRACT Bdellovibrio bacteriovorus is a species of unique obligate predatory bacteria that utilize gram-negative bacteria as prey. Their life cycle alternates between a motile extracellular phase and a growth phase within the prey cell periplasm. The mechanism of prey cell invasion and the genetic networks and regulation during the life cycle have not been elucidated. The obligate predatory nature of the B. bacteriovorus life cycle suggests the use of this bacterium in potential applications involving pathogen control but adds complexity to the development of practical genetic systems that can be used to determine gene function. This work reports the development of a genetic technique for allelic exchange or gene inactivation by construction of in-frame markerless deletion mutants including the use of a counterselectable marker in B. bacteriovorus. A suicide plasmid carrying the sacB gene for counterselection was used to inactivate the strB gene in B. bacteriovorus HD100 by an in-frame deletion. Despite the inactivation of the strB gene, B. bacteriovorus was found to retain resistance to high concentrations of streptomycin. The stability of a plasmid for use in complementation experiments was also investigated, and it was determined that pMMB206 replicates autonomously in B. bacteriovorus. Development of this practical genetic system now facilitates the study of B. bacteriovorus at the molecular level and will aid in understanding the regulatory networks and gene function in this fascinating predatory bacterium.


2009 ◽  
Vol 10 (9) ◽  
pp. R97 ◽  
Author(s):  
James C Costello ◽  
Mehmet M Dalkilic ◽  
Scott M Beason ◽  
Jeff R Gehlhausen ◽  
Rupali Patwardhan ◽  
...  

2019 ◽  
Author(s):  
Qiao Wen Tan ◽  
Marek Mutwil

0.ABSTRACTPrediction of gene function and gene regulatory networks is one of the most active topics in bioinformatics. The accumulation of publicly available gene expression data for hundreds of plant species, together with advances in bioinformatical methods and affordable computing, sets ingenuity as the major bottleneck in understanding gene function and regulation. Here, we show how a credit card-sized computer retailing for less than 50 USD can be used to rapidly predict gene function and infer regulatory networks from RNA sequencing data. To achieve this, we constructed a bioinformatical pipeline that downloads and allows quality-control of RNA sequencing data; and generates a gene co-expression network that can reveal enzymes and transcription factors participating and controlling a given biosynthetic pathway. We exemplify this by first identifying genes and transcription factors involved in the biosynthesis of secondary cell wall in the plant Artemisia annua, the main natural source of the anti-malarial drug artemisinin. Networks were then used to dissect the artemisinin biosynthesis pathway, which suggest potential transcription factors regulating artemisinin biosynthesis. We provide the source code of our pipeline and envision that the ubiquity of affordable computing, availability of biological data and increased bioinformatical training of biologists will transform the field of bioinformatics.HighlightsProcessing of large scale transcriptomic data with affordable single-board computersTranscription factors can be found in the same network as their targetsCo-expression of transcription factors and genes in secondary cell wall biosynthesisCo-expression of transcription factors and genes involved in artemisinin biosynthesis


Sign in / Sign up

Export Citation Format

Share Document