gene function prediction Latest Research Papers

Abstract The function of most genes is unknown. The best results in automated function prediction are obtained with machine learning-based methods that combine multiple data sources, typically sequence derived features, protein structure and interaction data. Even though there is ample evidence showing that a gene’s function is not independent of its location, the few available examples of gene function prediction based on gene location rely on sequence identity between genes of different organisms and are thus subjected to the limitations of the relationship between sequence and function. Here we predict thousands of gene functions in five model eukaryotes (Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus and Homo sapiens) using machine learning models exclusively trained with features derived from the location of genes in the genomes to which they belong. Our aim was not to obtain the best performing method to automated function prediction but to explore the extent to which a gene's location can predict its function in eukaryotes. We found that our models outperform BLAST when predicting terms from Biological Process and Cellular Component Ontologies, showing that, at least in some cases, gene location alone can be more useful than sequence to infer gene function.

Download Full-text

Supervised Gene Function Prediction Using Spectral Clustering on Gene Co-expression Networks

Complex Networks & Their Applications X - Studies in Computational Intelligence ◽

10.1007/978-3-030-93413-2_54 ◽

2022 ◽

pp. 652-663

Author(s):

Miguel Romero ◽

Óscar Ramírez ◽

Jorge Finke ◽

Camilo Rocha

Keyword(s):

Gene Function ◽

Spectral Clustering ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

Gene function prediction in five model eukaryotes based on gene relative location through machine learning

10.1101/2021.08.27.457944 ◽

2021 ◽

Author(s):

Flavio Pazos Obregón ◽

Diego Silvera ◽

Pablo Soto ◽

Patricio Yankilevich ◽

Gustavo Guerberoff ◽

...

Keyword(s):

Machine Learning ◽

Gene Function ◽

Homo Sapiens ◽

Function Prediction ◽

Gene Location ◽

Supplementary Information ◽

Relative Location ◽

Gene Function Prediction ◽

Ample Evidence ◽

And Function

Motiviation: The function of most genes is unknown. The best results in gene function prediction are obtained with machine learning-based methods that combine multiple data sources, typically sequence derived features, protein structure and interaction data. Even though there is ample evidence showing that a gene's function is not independent of its location, the few available examples of gene function prediction based on gene location relay on sequence identity between genes of different organisms and are thus subjected to the limitations of the relationship between sequence and function. Results: Here we predict thousands of gene functions in five eukaryotes (Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus and Homo sapiens) using machine learning models trained with features derived from the location of genes in the genomes to which they belong. To the best of our knowledge this is the first work in which gene function prediction is successfully achieved in eukaryotic genomes using predictive features derived exclusively from the relative location of the genes. Contact: [email protected] Supplementary information: http://gfpml.bnd.edu.uy

Download Full-text

Accurate and efficient gene function prediction using a multi-bacterial network

Bioinformatics ◽

10.1093/bioinformatics/btaa885 ◽

2020 ◽

Author(s):

Jeffrey N Law ◽

Shiv D Kale ◽

T M Murali

Keyword(s):

Gene Function ◽

Bacterial Species ◽

Heterogeneous Data ◽

Function Prediction ◽

Label Propagation ◽

Supplementary Information ◽

Gene Function Prediction ◽

Functional Annotations ◽

A Genome ◽

Multiple Species

Abstract Motivation Nearly 40% of the genes in sequenced genomes have no experimentally or computationally derived functional annotations. To fill this gap, we seek to develop methods for network-based gene function prediction that can integrate heterogeneous data for multiple species with experimentally based functional annotations and systematically transfer them to newly sequenced organisms on a genome-wide scale. However, the large sizes of such networks pose a challenge for the scalability of current methods. Results We develop a label propagation algorithm called FastSinkSource. By formally bounding its rate of progress, we decrease the running time by a factor of 100 without sacrificing accuracy. We systematically evaluate many approaches to construct multi-species bacterial networks and apply FastSinkSource and other state-of-the-art methods to these networks. We find that the most accurate and efficient approach is to pre-compute annotation scores for species with experimental annotations, and then to transfer them to other organisms. In this manner, FastSinkSource runs in under 3 min for 200 bacterial species. Availability and implementation An implementation of our framework and all data used in this research are available at https://github.com/Murali-group/multi-species-GOA-prediction. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Bacterial community structure and gene function prediction in response to long-term running of dual graphene modified bioelectrode bioelectrochemical systems

Bioresource Technology ◽

10.1016/j.biortech.2020.123398 ◽

2020 ◽

Vol 309 ◽

pp. 123398 ◽

Cited By ~ 4

Author(s):

Junfeng Chen ◽

Yanyan Liu ◽

Yuewei Yang ◽

Meizhen Tang ◽

Renjun Wang ◽

...

Keyword(s):

Community Structure ◽

Bacterial Community ◽

Gene Function ◽

Bacterial Community Structure ◽

Function Prediction ◽

Bioelectrochemical Systems ◽

Gene Function Prediction

Download Full-text

Machine learning: A powerful tool for gene function prediction in plants

Applications in Plant Sciences ◽

10.1002/aps3.11376 ◽

2020 ◽

Vol 8 (7) ◽

Cited By ~ 5

Author(s):

Elizabeth H. Mahood ◽

Lars H. Kruse ◽

Gaurav D. Moghe

Keyword(s):

Machine Learning ◽

Gene Function ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

A Literature Review of Gene Function Prediction by Modeling Gene Ontology

Frontiers in Genetics ◽

10.3389/fgene.2020.00400 ◽

2020 ◽

Vol 11 ◽

Cited By ~ 1

Author(s):

Yingwen Zhao ◽

Jun Wang ◽

Jian Chen ◽

Xiangliang Zhang ◽

Maozu Guo ◽

...

Keyword(s):

Gene Ontology ◽

Literature Review ◽

Gene Function ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

LSTrAP-Cloud: A User-Friendly Cloud Computing Pipeline to Infer Coexpression Networks

Genes ◽

10.3390/genes11040428 ◽

2020 ◽

Vol 11 (4) ◽

pp. 428 ◽

Cited By ~ 3

Author(s):

Qiao Wen Tan ◽

William Goh ◽

Marek Mutwil

Keyword(s):

Rna Sequencing ◽

Gene Function ◽

Large Scale ◽

Single Gene ◽

Function Prediction ◽

Sequencing Data ◽

Gene Function Prediction ◽

European Nucleotide Archive ◽

User Friendly ◽

Coexpression Networks

As genomes become more and more available, gene function prediction presents itself as one of the major hurdles in our quest to extract meaningful information on the biological processes genes participate in. In order to facilitate gene function prediction, we show how our user-friendly pipeline, the Large-Scale Transcriptomic Analysis Pipeline in Cloud (LSTrAP-Cloud), can be useful in helping biologists make a shortlist of genes involved in a biological process that they might be interested in, by using a single gene of interest as bait. The LSTrAP-Cloud is based on Google Colaboratory, and provides user-friendly tools that process quality-control RNA sequencing data streamed from the European Nucleotide Archive. The LSTRAP-Cloud outputs a gene coexpression network that can be used to identify functionally related genes for any organism with a sequenced genome and publicly available RNA sequencing data. Here, we used the biosynthesis pathway of Nicotiana tabacum as a case study to demonstrate how enzymes, transporters, and transcription factors involved in the synthesis, transport, and regulation of nicotine can be identified using our pipeline.

Download Full-text

Network aggregation improves gene function prediction of grapevine gene co-expression networks

Plant Molecular Biology ◽

10.1007/s11103-020-01001-2 ◽

2020 ◽

Vol 103 (4-5) ◽

pp. 425-441 ◽

Cited By ~ 2

Author(s):

Darren C. J. Wong

Keyword(s):

Gene Function ◽

Function Prediction ◽

Gene Function Prediction ◽

Network Aggregation

Download Full-text

Integrating multi-network topology for gene function prediction using deep neural networks

Briefings in Bioinformatics ◽

10.1093/bib/bbaa036 ◽

2020 ◽

Cited By ~ 3

Author(s):

Jiajie Peng ◽

Hansheng Xue ◽

Zhongyu Wei ◽

Idil Tuncali ◽

Jianye Hao ◽

...

Keyword(s):

Gene Function ◽

Biological Networks ◽

Feature Learning ◽

Learning Task ◽

Function Prediction ◽

Feature Representation ◽

Superior Performance ◽

Gene Function Prediction ◽

Multiple Networks ◽

Low Dimensional

Abstract Motivation The emergence of abundant biological networks, which benefit from the development of advanced high-throughput techniques, contributes to describing and modeling complex internal interactions among biological entities such as genes and proteins. Multiple networks provide rich information for inferring the function of genes or proteins. To extract functional patterns of genes based on multiple heterogeneous networks, network embedding-based methods, aiming to capture non-linear and low-dimensional feature representation based on network biology, have recently achieved remarkable performance in gene function prediction. However, existing methods do not consider the shared information among different networks during the feature learning process. Results Taking the correlation among the networks into account, we design a novel semi-supervised autoencoder method to integrate multiple networks and generate a low-dimensional feature representation. Then we utilize a convolutional neural network based on the integrated feature embedding to annotate unlabeled gene functions. We test our method on both yeast and human datasets and compare with three state-of-the-art methods. The results demonstrate the superior performance of our method. We not only provide a comprehensive analysis of the performance of the newly proposed algorithm but also provide a tool for extracting features of genes based on multiple networks, which can be used in the downstream machine learning task. Availability DeepMNE-CNN is freely available at https://github.com/xuehansheng/DeepMNE-CNN Contact [email protected]; [email protected]; [email protected]

Download Full-text

gene function prediction
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Gene Function Prediction in Five Model Eukaryotes Exclusively Based on Gene Relative Location Through Machine Learning

Supervised Gene Function Prediction Using Spectral Clustering on Gene Co-expression Networks

Gene function prediction in five model eukaryotes based on gene relative location through machine learning

Accurate and efficient gene function prediction using a multi-bacterial network

Bacterial community structure and gene function prediction in response to long-term running of dual graphene modified bioelectrode bioelectrochemical systems

Machine learning: A powerful tool for gene function prediction in plants

A Literature Review of Gene Function Prediction by Modeling Gene Ontology

LSTrAP-Cloud: A User-Friendly Cloud Computing Pipeline to Infer Coexpression Networks

Network aggregation improves gene function prediction of grapevine gene co-expression networks

Integrating multi-network topology for gene function prediction using deep neural networks

Export Citation Format

gene function predictionRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Gene Function Prediction in Five Model Eukaryotes Exclusively Based on Gene Relative Location Through Machine Learning

Supervised Gene Function Prediction Using Spectral Clustering on Gene Co-expression Networks

Gene function prediction in five model eukaryotes based on gene relative location through machine learning

Accurate and efficient gene function prediction using a multi-bacterial network

Bacterial community structure and gene function prediction in response to long-term running of dual graphene modified bioelectrode bioelectrochemical systems

Machine learning: A powerful tool for gene function prediction in plants

A Literature Review of Gene Function Prediction by Modeling Gene Ontology

LSTrAP-Cloud: A User-Friendly Cloud Computing Pipeline to Infer Coexpression Networks

Network aggregation improves gene function prediction of grapevine gene co-expression networks

Integrating multi-network topology for gene function prediction using deep neural networks

gene function prediction
Recently Published Documents