Automatic concept recognition using the Human Phenotype Ontology reference and test suite corpora

T. Groza; S. Kohler; S. Doelken; N. Collier; A. Oellrich; D. Smedley; F. M. Couto; G. Baynam; A. Zankl; P. N. Robinson

doi:10.1093/database/bav005

Automatic concept recognition using the Human Phenotype Ontology reference and test suite corpora

Database ◽

10.1093/database/bav005 ◽

2015 ◽

Vol 2015 (0) ◽

pp. bav005-bav005 ◽

Cited By ~ 31

Author(s):

T. Groza ◽

S. Kohler ◽

S. Doelken ◽

N. Collier ◽

A. Oellrich ◽

...

Keyword(s):

Human Phenotype Ontology ◽

Test Suite ◽

Phenotype Ontology ◽

Concept Recognition ◽

Human Phenotype

Download Full-text

PhenoTagger: a hybrid method for phenotype concept recognition using human phenotype ontology

Bioinformatics ◽

10.1093/bioinformatics/btab019 ◽

2021 ◽

Author(s):

Ling Luo ◽

Shankai Yan ◽

Po-Ting Lai ◽

Daniel Veltri ◽

Andrew Oler ◽

...

Keyword(s):

Machine Learning ◽

Hybrid Method ◽

Human Phenotype Ontology ◽

Training Data ◽

Supplementary Information ◽

Training Dataset ◽

Biomedical Text ◽

Phenotype Ontology ◽

Concept Recognition ◽

Human Phenotype

Abstract Motivation Automatic phenotype concept recognition from unstructured text remains a challenging task in biomedical text mining research. Previous works that address the task typically use dictionary-based matching methods, which can achieve high precision but suffer from lower recall. Recently, machine learning-based methods have been proposed to identify biomedical concepts, which can recognize more unseen concept synonyms by automatic feature learning. However, most methods require large corpora of manually annotated data for model training, which is difficult to obtain due to the high cost of human annotation. Results In this article, we propose PhenoTagger, a hybrid method that combines both dictionary and machine learning-based methods to recognize Human Phenotype Ontology (HPO) concepts in unstructured biomedical text. We first use all concepts and synonyms in HPO to construct a dictionary, which is then used to automatically build a distantly supervised training dataset for machine learning. Next, a cutting-edge deep learning model is trained to classify each candidate phrase (n-gram from input sentence) into a corresponding concept label. Finally, the dictionary and machine learning-based prediction results are combined for improved performance. Our method is validated with two HPO corpora, and the results show that PhenoTagger compares favorably to previous methods. In addition, to demonstrate the generalizability of our method, we retrained PhenoTagger using the disease ontology MEDIC for disease concept recognition to investigate the effect of training on different ontologies. Experimental results on the NCBI disease corpus show that PhenoTagger without requiring manually annotated training data achieves competitive performance as compared with state-of-the-art supervised methods. Availabilityand implementation The source code, API information and data for PhenoTagger are freely available at https://github.com/ncbi-nlp/PhenoTagger. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Predicting genes from phenotypes using Human Phenotype Ontology (HPO) terms

Molecular Genetics and Metabolism ◽

10.1016/s1096-7192(21)00318-8 ◽

2021 ◽

Vol 132 ◽

pp. S149

Author(s):

Anne Slavotinek ◽

Hannah Prasad ◽

Hannah Hoban ◽

Tiffany Yip ◽

Shannon Rego ◽

...

Keyword(s):

Human Phenotype Ontology ◽

Phenotype Ontology ◽

Human Phenotype

Download Full-text

Unique insights from ClinicalTrials.gov by mining protein mutations and RSids in addition to applying the Human Phenotype Ontology v1 (protocols.io.bfacjiaw)

protocols.io ◽

10.17504/protocols.io.bfacjiaw ◽

2020 ◽

Author(s):

Shray Alag

Keyword(s):

Human Phenotype Ontology ◽

Phenotype Ontology ◽

Human Phenotype ◽

Protein Mutations

Download Full-text

Biological and Medical Ontologies: Human Phenotype Ontology (HPO)

Encyclopedia of Bioinformatics and Computational Biology ◽

10.1016/b978-0-12-809633-8.20398-1 ◽

2019 ◽

pp. 848-857

Author(s):

Anna Bernasconi ◽

Marco Masseroli

Keyword(s):

Human Phenotype Ontology ◽

Phenotype Ontology ◽

Human Phenotype

Download Full-text

Unique insights from ClinicalTrials.gov by mining protein mutations and RSids in addition to applying the Human Phenotype Ontology

PLoS ONE ◽

10.1371/journal.pone.0233438 ◽

2020 ◽

Vol 15 (5) ◽

pp. e0233438

Author(s):

Shray Alag

Keyword(s):

Human Phenotype Ontology ◽

Phenotype Ontology ◽

Human Phenotype ◽

Protein Mutations

Download Full-text

Decomposing Phenotype Descriptions for the Human Skeletal Phenome

Biomedical Informatics Insights ◽

10.4137/bii.s10729 ◽

2013 ◽

Vol 6 ◽

pp. BII.S10729

Author(s):

Tudor Groza ◽

Jane Hunter ◽

Andreas Zankl

Keyword(s):

Experimental Study ◽

Intrinsic Value ◽

Human Phenotype Ontology ◽

Phenotype Ontology ◽

Human Phenotype ◽

Generic Concept ◽

Meta Model ◽

Processing Pipeline ◽

Skeletal Phenotype ◽

Automatic Decomposition

Over the course of the last few years there has been a significant amount of research performed on ontology-based formalization of phenotype descriptions. The intrinsic value and knowledge captured within such descriptions can only be expressed by taking advantage of their inner structure that implicitly combines qualities and anatomical entities. We present a meta-model (the Phenotype Fragment Ontology) and a processing pipeline that enable together the automatic decomposition and conceptualization of phenotype descriptions for the human skeletal phenome. We use this approach to showcase the usefulness of the generic concept of phenotype decomposition by performing an experimental study on all skeletal phenotype concepts defined in the Human Phenotype Ontology.

Download Full-text

Annotating Diseases Using Human Phenotype Ontology Improves Prediction of Disease-Associated Long Non-coding RNAs

Journal of Molecular Biology ◽

10.1016/j.jmb.2018.05.006 ◽

2018 ◽

Vol 430 (15) ◽

pp. 2219-2230 ◽

Cited By ~ 8

Author(s):

Duc-Hau Le ◽

Lan T.M. Dao

Keyword(s):

Human Phenotype Ontology ◽

Phenotype Ontology ◽

Human Phenotype ◽

Non Coding Rnas

Download Full-text

Integrating exome and whole genome analysis with the Human Phenotype Ontology for discovery of new genes in rare eye diseases

Acta Ophthalmologica ◽

10.1111/j.1755-3768.2019.5001 ◽

2019 ◽

Vol 97 (S263) ◽

Author(s):

Nikolas Pontikos

Keyword(s):

Genome Analysis ◽

Eye Diseases ◽

Human Phenotype Ontology ◽

Whole Genome ◽

Phenotype Ontology ◽

Whole Genome Analysis ◽

Human Phenotype ◽

New Genes

Download Full-text

Human phenotype ontology annotation and cluster analysis to unravel genetic defects in 707 cases with unexplained bleeding and platelet disorders

Genome Medicine ◽

10.1186/s13073-015-0151-5 ◽

2015 ◽

Vol 7 (1) ◽

Cited By ~ 75

Author(s):

Sarah K Westbury ◽

◽

Ernest Turro ◽

Daniel Greene ◽

Claire Lentaigne ◽

...

Keyword(s):

Cluster Analysis ◽

Human Phenotype Ontology ◽

Genetic Defects ◽

Phenotype Ontology ◽

Human Phenotype ◽

Platelet Disorders ◽

And Cluster Analysis

Download Full-text

The Human Phenotype Ontology: Semantic Unification of Common and Rare Disease

The American Journal of Human Genetics ◽

10.1016/j.ajhg.2015.05.020 ◽

2015 ◽

Vol 97 (1) ◽

pp. 111-124 ◽

Cited By ~ 125

Author(s):

Tudor Groza ◽

Sebastian Köhler ◽

Dawid Moldenhauer ◽

Nicole Vasilevsky ◽

Gareth Baynam ◽

...

Keyword(s):

Rare Disease ◽

Human Phenotype Ontology ◽

Phenotype Ontology ◽

Human Phenotype ◽

Semantic Unification

Download Full-text