Integrating multi-network topology for gene function prediction using deep neural networks

Abstract Motivation The emergence of abundant biological networks, which benefit from the development of advanced high-throughput techniques, contributes to describing and modeling complex internal interactions among biological entities such as genes and proteins. Multiple networks provide rich information for inferring the function of genes or proteins. To extract functional patterns of genes based on multiple heterogeneous networks, network embedding-based methods, aiming to capture non-linear and low-dimensional feature representation based on network biology, have recently achieved remarkable performance in gene function prediction. However, existing methods do not consider the shared information among different networks during the feature learning process. Results Taking the correlation among the networks into account, we design a novel semi-supervised autoencoder method to integrate multiple networks and generate a low-dimensional feature representation. Then we utilize a convolutional neural network based on the integrated feature embedding to annotate unlabeled gene functions. We test our method on both yeast and human datasets and compare with three state-of-the-art methods. The results demonstrate the superior performance of our method. We not only provide a comprehensive analysis of the performance of the newly proposed algorithm but also provide a tool for extracting features of genes based on multiple networks, which can be used in the downstream machine learning task. Availability DeepMNE-CNN is freely available at https://github.com/xuehansheng/DeepMNE-CNN Contact [email protected]; [email protected]; [email protected]

Download Full-text

Integrating multi-network topology for gene function prediction using deep neural networks

10.1101/532408 ◽

2019 ◽

Author(s):

Hansheng Xue ◽

Jiajie Peng ◽

Xuequn Shang

Keyword(s):

Neural Network ◽

Gene Function ◽

State Of The Art ◽

Feature Learning ◽

Function Prediction ◽

Feature Representation ◽

Superior Performance ◽

Gene Function Prediction ◽

Multiple Networks ◽

Low Dimensional

AbstractMotivationThe emerging of abundant biological networks, which benefit from the development of advanced high-throughput techniques, contribute to describing and modeling complex internal interactions among biological entities such as genes and proteins. Multiple networks provide rich information for inferring the function of genes or proteins. To extract functional patterns of genes based on multiple heterogeneous networks, network embedding-based methods, aiming to capture non-linear and low-dimensional feature representation based on network biology, have recently achieved remarkable performance in gene function prediction. However, existing methods mainly do not consider the shared information among different networks during the feature learning process. Thus, we propose a novel multi-networks embedding-based function prediction method based on semi-supervised autoencoder and feature convolution neural network, named DeepMNE-CNN, which captures complex topological structures of multi-networks and takes the correlation among multi-networks into account.ResultsWe design a novel semi-supervised autoencoder method to integrate multiple networks and generate a low-dimensional feature representation. Then we utilize a convolutional neural network based on the integrated feature embedding to annotate unlabeled gene functions. We test our method on both yeast and human dataset and compare with four state-of-the-art methods. The results demonstrate the superior performance of our method over four state-of-the-art algorithms. From the future explorations, we find that semi-supervised autoencoder based multi-networks integration method and CNN-based feature learning methods both contribute to the task of function prediction.AvailabilityDeepMNE-CNN is freely available at https://github.com/xuehansheng/DeepMNE-CNN

Download Full-text

Automated gene function prediction through gene multifunctionality in biological networks

Neurocomputing ◽

10.1016/j.neucom.2015.04.007 ◽

2015 ◽

Vol 162 ◽

pp. 48-56 ◽

Cited By ~ 13

Author(s):

Marco Frasca

Keyword(s):

Gene Function ◽

Biological Networks ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

Faculty Opinions recommendation of The art of gene function prediction.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1056759.508687 ◽

2006 ◽

Author(s):

Martin Noble

Keyword(s):

Gene Function ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

Faculty Opinions recommendation of Network-Based Gene Function Prediction in Mouse and Other Model Vertebrates Using MouseNet Server.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.727562216.793535246 ◽

2017 ◽

Author(s):

John Hancock

Keyword(s):

Gene Function ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression

PLoS ONE ◽

10.1371/journal.pone.0134668 ◽

2015 ◽

Vol 10 (8) ◽

pp. e0134668 ◽

Cited By ~ 12

Author(s):

Sonja Lehtinen ◽

Jon Lees ◽

Jürg Bähler ◽

John Shawe-Taylor ◽

Christine Orengo

Keyword(s):

Least Squares ◽

Partial Least Squares ◽

Gene Function ◽

Partial Least Squares Regression ◽

Function Prediction ◽

Least Squares Regression ◽

Gene Function Prediction ◽

Functional Association ◽

Kernel Partial Least Squares

Download Full-text

Using similarity learning to improve network-based gene function prediction

2012 IEEE International Conference on Bioinformatics and Biomedicine ◽

10.1109/bibm.2012.6392663 ◽

2012 ◽

Cited By ~ 1

Author(s):

Ngo Phuong Nhung ◽

Tu Minti Phuong

Keyword(s):

Gene Function ◽

Function Prediction ◽

Similarity Learning ◽

Gene Function Prediction

Download Full-text

Gene Function Prediction and Functional Network: The Role of Gene Ontology

Intelligent Systems Reference Library - Data Mining: Foundations and Intelligent Paradigms ◽

10.1007/978-3-642-23151-3_7 ◽

2012 ◽

pp. 123-162 ◽

Cited By ~ 1

Author(s):

Erliang Zeng ◽

Chris Ding ◽

Kalai Mathee ◽

Lisa Schneper ◽

Giri Narasimhan

Keyword(s):

Gene Ontology ◽

Gene Function ◽

Function Prediction ◽

Functional Network ◽

Gene Function Prediction

Download Full-text

A hierarchical multi-label classification method based on neural networks for gene function prediction

Biotechnology & Biotechnological Equipment ◽

10.1080/13102818.2018.1521302 ◽

2018 ◽

Vol 32 (6) ◽

pp. 1613-1621 ◽

Cited By ~ 4

Author(s):

Shou Feng ◽

Ping Fu ◽

Wenbin Zheng

Keyword(s):

Neural Networks ◽

Gene Function ◽

Function Prediction ◽

Classification Method ◽

Gene Function Prediction

Download Full-text

Accurate and efficient gene function prediction using a multi-bacterial network

Bioinformatics ◽

10.1093/bioinformatics/btaa885 ◽

2020 ◽

Author(s):

Jeffrey N Law ◽

Shiv D Kale ◽

T M Murali

Keyword(s):

Gene Function ◽

Bacterial Species ◽

Heterogeneous Data ◽

Function Prediction ◽

Label Propagation ◽

Supplementary Information ◽

Gene Function Prediction ◽

Functional Annotations ◽

A Genome ◽

Multiple Species

Abstract Motivation Nearly 40% of the genes in sequenced genomes have no experimentally or computationally derived functional annotations. To fill this gap, we seek to develop methods for network-based gene function prediction that can integrate heterogeneous data for multiple species with experimentally based functional annotations and systematically transfer them to newly sequenced organisms on a genome-wide scale. However, the large sizes of such networks pose a challenge for the scalability of current methods. Results We develop a label propagation algorithm called FastSinkSource. By formally bounding its rate of progress, we decrease the running time by a factor of 100 without sacrificing accuracy. We systematically evaluate many approaches to construct multi-species bacterial networks and apply FastSinkSource and other state-of-the-art methods to these networks. We find that the most accurate and efficient approach is to pre-compute annotation scores for species with experimental annotations, and then to transfer them to other organisms. In this manner, FastSinkSource runs in under 3 min for 200 bacterial species. Availability and implementation An implementation of our framework and all data used in this research are available at https://github.com/Murali-group/multi-species-GOA-prediction. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text