From Biomedical Literature to Knowledge: Mining Protein-Protein Interactions

Forms and consequences of incompatibility

Mitonuclear Ecology ◽

10.1093/oso/9780198818250.003.0002 ◽

2019 ◽

pp. 20-48

Author(s):

Geoffrey E. Hill

Keyword(s):

Mitochondrial Dna ◽

Electron Transport ◽

Mitochondrial Dysfunction ◽

Protein Interactions ◽

Biomedical Literature ◽

Nuclear Genes ◽

Extensive Literature ◽

Protein Protein Interactions ◽

Growing Body ◽

Evolutionary Consequences

To understand the evolutionary consequences of poor coadaptation of mitochondrial and nuclear genes, it is necessary to consider in molecular detail the manifestations of mitochondrial dysfunction. Most considerations of mitochondrial dysfunction resulting from mitonuclear incompatibilities focus on protein–protein interactions in the electron transport system, but the interactions of mitochondrial and nuclear genes in enabling the transcription, translation, and replication of mitochondrial DNA can play an equally important role in mitonuclear coevolution and coadaptation. This chapter reviews the extensive literature on how mitochondrial dysfunction is the cause of many inherited human diseases and explains how this biomedical literature connects to a rapidly growing body of research on the evolution and maintenance of coadaptation of mitochondrial and nuclear genes among non-human eukaryotes. The goal of the chapter is to establish the fundamental importance of coadaptation between co-functioning mitochondrial and nuclear genes.

Download Full-text

UniProt-Related Documents (UniReD): assisting wet lab biologists in their quest on finding novel counterparts in a protein network

NAR Genomics and Bioinformatics ◽

10.1093/nargab/lqaa005 ◽

2020 ◽

Vol 2 (1) ◽

Cited By ~ 1

Author(s):

Theodosios Theodosiou ◽

Nikolaos Papanikolaou ◽

Maria Savvaki ◽

Giulia Bonetto ◽

Stella Maxouri ◽

...

Keyword(s):

Protein Interactions ◽

Computational Prediction ◽

Biomedical Literature ◽

Protein Network ◽

Protein Protein Interactions ◽

High Coverage ◽

Depth Study ◽

Experimental Approaches ◽

Wet Lab ◽

User Friendly

Abstract The in-depth study of protein–protein interactions (PPIs) is of key importance for understanding how cells operate. Therefore, in the past few years, many experimental as well as computational approaches have been developed for the identification and discovery of such interactions. Here, we present UniReD, a user-friendly, computational prediction tool which analyses biomedical literature in order to extract known protein associations and suggest undocumented ones. As a proof of concept, we demonstrate its usefulness by experimentally validating six predicted interactions and by benchmarking it against public databases of experimentally validated PPIs succeeding a high coverage. We believe that UniReD can become an important and intuitive resource for experimental biologists in their quest for finding novel associations within a protein network and a useful tool to complement experimental approaches (e.g. mass spectrometry) by producing sorted lists of candidate proteins for further experimental validation. UniReD is available at http://bioinformatics.med.uoc.gr/unired/

Download Full-text

HMNPPID—human malignant neoplasm protein–protein interaction database

Human Genomics ◽

10.1186/s40246-019-0223-5 ◽

2019 ◽

Vol 13 (S1) ◽

Author(s):

Qingqing Li ◽

Zhihao Yang ◽

Zhehuan Zhao ◽

Ling Luo ◽

Zhiheng Li ◽

...

Keyword(s):

Protein Interaction ◽

Protein Interactions ◽

Molecular Mechanisms ◽

Malignant Neoplasm ◽

Biomedical Literature ◽

Malignant Neoplasms ◽

Protein Protein Interactions ◽

Protein Protein Interaction ◽

Interaction Database ◽

Protein Interaction Database

Abstract Background Protein–protein interaction (PPI) information extraction from biomedical literature helps unveil the molecular mechanisms of biological processes. Especially, the PPIs associated with human malignant neoplasms can unveil the biology behind these neoplasms. However, such PPI database is not currently available. Results In this work, a database of protein–protein interactions associated with 171 kinds of human malignant neoplasms named HMNPPID is constructed. In addition, a visualization program, named VisualPPI, is provided to facilitate the analysis of the PPI network for a specific neoplasm. Conclusions HMNPPID can hopefully become an important resource for the research on PPIs of human malignant neoplasms since it provides readily available data for healthcare professionals. Thus, they do not need to dig into a large amount of biomedical literatures any more, which may accelerate the researches on the PPIs of malignant neoplasms.

Download Full-text

Using biomedical literature mining to consolidate the set of known human protein-protein interactions

10.3115/1641484.1641491 ◽

2005 ◽

Cited By ~ 3

Author(s):

Arun Ramani ◽

Edward Marcotte ◽

Razvan Bunescu ◽

Raymond Mooney

Keyword(s):

Protein Interactions ◽

Biomedical Literature ◽

Human Protein ◽

Literature Mining ◽

Protein Protein Interactions ◽

Biomedical Literature Mining

Download Full-text

An Image-Text Approach for Extracting Experimental Evidence of Protein-Protein Interactions in the Biomedical Literature

Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics - BCB'13 ◽

10.1145/2506583.2506585 ◽

2013 ◽

Author(s):

Luis D. Lopez ◽

Jingyi Yu ◽

Cecilia N. Arighi ◽

Manabu Torii ◽

K. Vijay-Shanker ◽

...

Keyword(s):

Experimental Evidence ◽

Protein Interactions ◽

Biomedical Literature ◽

Protein Protein Interactions

Download Full-text

Discovering novel protein–protein interactions by measuring the protein semantic similarity from the biomedical literature

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720014420086 ◽

2014 ◽

Vol 12 (06) ◽

pp. 1442008 ◽

Cited By ~ 4

Author(s):

Jung-Hsien Chiang ◽

Jiun-Huang Ju

Keyword(s):

Semantic Similarity ◽

Protein Interactions ◽

Similarity Measures ◽

Biomedical Literature ◽

Biological Research ◽

Protein Protein Interactions ◽

Automated Identification ◽

Learning Classifier ◽

Novel Method ◽

Novel Protein

Protein–protein interactions (PPIs) are involved in the majority of biological processes. Identification of PPIs is therefore one of the key aims of biological research. Although there are many databases of PPIs, many other unidentified PPIs could be buried in the biomedical literature. Therefore, automated identification of PPIs from biomedical literature repositories could be used to discover otherwise hidden interactions. Search engines, such as Google, have been successfully applied to measure the relatedness among words. Inspired by such approaches, we propose a novel method to identify PPIs through semantic similarity measures among protein mentions. We define six semantic similarity measures as features based on the page counts retrieved from the MEDLINE database. A machine learning classifier, Random Forest, is trained using the above features. The proposed approach achieve an averaged micro-F of 71.28% and an averaged macro-F of 64.03% over five PPI corpora, an improvement over the results of using only the conventional co-occurrence feature (averaged micro-F of 68.79% and an averaged macro-F of 60.49%). A relation-word reinforcement further improves the averaged micro-F to 71.3% and averaged macro-F to 65.12%. Comparing the results of the current work with other studies on the AIMed corpus (ranging from 77.58% to 85.1% in micro-F, 62.18% to 76.27% in macro-F), we show that the proposed approach achieves micro-F of 81.88% and macro-F of 64.01% without the use of sophisticated feature extraction. Finally, we manually examine the newly discovered PPI pairs based on a literature review, and the results suggest that our approach could extract novel protein–protein interactions.

Download Full-text

Detecting experimental techniques and selecting relevant documents for protein-protein interactions from biomedical literature

BMC Bioinformatics ◽

10.1186/1471-2105-12-s8-s11 ◽

2011 ◽

Vol 12 (S8) ◽

Cited By ~ 16

Author(s):

Xinglong Wang ◽

Rafal Rak ◽

Angelo Restificar ◽

Chikashi Nobata ◽

CJ Rupp ◽

...

Keyword(s):

Protein Interactions ◽

Biomedical Literature ◽

Experimental Techniques ◽

Protein Protein Interactions

Download Full-text

MINING GENE-DISEASE RELATIONSHIPS FROM BIOMEDICAL LITERATURE: WEIGHTING PROTEIN–PROTEIN INTERACTIONS AND CONNECTIVITY MEASURES

Biocomputing 2007 ◽

10.1142/9789812772435_0004 ◽

2006 ◽

Cited By ~ 3

Author(s):

GRACIELA GONZALEZ ◽

JUAN C. URIBE ◽

LUIS TARI ◽

COLLEEN BROPHY ◽

CHITTA BARAL

Keyword(s):

Protein Interactions ◽

Biomedical Literature ◽

Protein Protein Interactions

Download Full-text

A Hybrid Deep Learning Model for Protein–Protein Interactions Extraction from Biomedical Literature

Applied Sciences ◽

10.3390/app10082690 ◽

2020 ◽

Vol 10 (8) ◽

pp. 2690 ◽

Cited By ~ 1

Author(s):

Changqin Quan ◽

Zhiwei Luo ◽

Song Wang

Keyword(s):

Deep Learning ◽

Protein Interactions ◽

Short Term Memory ◽

Channel Model ◽

Biomedical Literature ◽

Hybrid Architecture ◽

Protein Protein Interactions ◽

Long Distance ◽

Proposed Model ◽

Bidirectional Lstm

The exponentially increasing size of biomedical literature and the limited ability of manual curators to discover protein–protein interactions (PPIs) in text has led to delays in keeping PPI databases updated with the current findings. The state-of-the-art text mining methods for PPI extraction are primarily based on deep learning (DL) models, and the performance of a DL-based method is mainly affected by the architecture of DL models and the feature embedding methods. In this study, we compared different architectures of DL models, including convolutional neural networks (CNN), long short-term memory (LSTM), and hybrid models, and proposed a hybrid architecture of a bidirectional LSTM+CNN model for PPI extraction. Pretrained word embedding and shortest dependency path (SDP) embedding are fed into a two-embedding channel model, such that the model is able to model long-distance contextual information and can capture the local features and structure information effectively. The experimental results showed that the proposed model is superior to the non-hybrid DL models, and the hybrid CNN+Bidirectional LSTM model works well for PPI extraction. The visualization and comparison of the hidden features learned by different DL models further confirmed the effectiveness of the proposed model.

Download Full-text

Towards extracting supporting information about predicted protein-protein interactions

10.1101/031591 ◽

2015 ◽

Author(s):

Adam Roth ◽

Sandeep Subramanian ◽

Madhavi Ganapathiraju

Keyword(s):

Semantic Similarity ◽

Protein Interactions ◽

Contextual Information ◽

Relation Extraction ◽

Angular Distance ◽

Biomedical Literature ◽

Protein Protein Interactions ◽

Real World Application ◽

Evidential Support ◽

Ppi Prediction

One of the goals of relation extraction is to identify protein-protein interactions (PPIs) in biomedical literature. Current systems are capturing binary relations and also the direction and type of an interaction. Besides assisting in the curation PPIs into databases, there has been little real-world application of these algorithms. We describe UPSITE, a text mining tool for extracting evidence in support of a hypothesized interaction. Given a predicted PPI, UPSITE uses a binary relation detector to check whether a PPI is found in abstracts in PubMed. If it is not found, UPSITE retrieves documents relevant to each of the two proteins separately, and extracts contextual information about biological events surrounding each protein, and calculates semantic similarity of the two proteins to provide evidential support for the predicted PPI. In evaluations, relation extraction achieved an Fscore of 0.88 on the HPRD50 corpus, and semantic similarity measured with angular distance was found to be statistically significant. With the development of PPI prediction algorithms, the burden of interpreting the validity and relevance of novel PPIs is on biologists. We suggest that presenting annotations of the two proteins in a PPI side-by-side and a score that quantifies their similarity lessens this burden to some extent.

Download Full-text