scholarly journals Predicting genes associated with RNA methylation pathways using machine learning

2021 ◽  
Author(s):  
Georgia Tsagkogeorga ◽  
Helena Santos Rosa ◽  
Andrej Alendar ◽  
Dan Leggate ◽  
Oliver Rausch ◽  
...  

RNA methylation plays an important role in functional regulation of RNAs, and has thus attracted an increasing interest in biology and drug discovery. Here, we collected and collated transcriptomic, proteomic, structural and physical interaction data from the Harmonizome database, and applied supervised machine learning to predict novel genes associated with RNA methylation pathways in human. We selected five types of classifiers, which we trained and evaluated using cross-validation on multiple training sets. The best models reached 88% accuracy based on cross-validation, and an average 91% accuracy on the test set. Using protein-protein interaction data, we propose six molecular sub-networks linking model predictions to previously known RNA methylation genes, with roles in mRNA methylation, tRNA processing, rRNA processing, but also protein and chromatin modifications. Our study exemplifies how access to large omics datasets joined by machine learning methods can be used to predict gene function.

Author(s):  
Hugo Willy

Recent breakthroughs in high throughput experiments to determine protein-protein interaction have generated a vast amount of protein interaction data. However, most of the experiments could only answer the question of whether two proteins interact but not the question on the mechanisms by which proteins interact. Such understanding is crucial for understanding the protein interaction of an organism as a whole (the interactome) and even predicting novel protein interactions. Protein interaction usually occurs at some specific sites on the proteins and, given their importance, they are usually well conserved throughout the evolution of the proteins of the same family. Based on this observation, a number of works on finding protein patterns/motifs conserved in interacting proteins have emerged in the last few years. Such motifs are collectively termed as the interaction motifs. This chapter provides a review on the different approaches on finding interaction motifs with a discussion on their implications, potentials and possible areas of improvements in the future.


2011 ◽  
Vol 16 (2) ◽  
Author(s):  
Dariusz Plewczynski ◽  
Tomas Klingström

AbstractStudying the interactome is one of the exciting frontiers of proteomics, as shown lately at the recent bioinformatics conferences (for example ISMB 2010, or ECCB 2010). Distribution of data is facilitated by a large number of databases. Metamining databases have been created in order to allow researchers access to several databases in one search, but there are serious difficulties for end users to evaluate the metamining effort. Therefore we suggest a new standard, “Good Interaction Data Metamining Practice” (GIDMP), which could be easily automated and requires only very minor inclusion of statistical data on each database homepage. Widespread adoption of the GIDMP standard would provide users with: a standardized way to evaluate the statistics provided by each metamining database, thus enhancing the end-user experiencea stable contact point for each database, allowing the smooth transition of statisticsa fully automated system, enhancing time- and cost-effectiveness.The proposed information can be presented as a few hidden lines of text on the source database www page, and a constantly updated table for a metamining database included in the source/credits web page.


Yeast ◽  
2001 ◽  
Vol 18 (6) ◽  
pp. 523-531 ◽  
Author(s):  
Haretsugu Hishigaki ◽  
Kenta Nakai ◽  
Toshihide Ono ◽  
Akira Tanigami ◽  
Toshihisa Takagi

Sign in / Sign up

Export Citation Format

Share Document