Evolutionary algorithm based on different semantic similarity functions for synonym recognition in the biomedical domain

10.31219/osf.io/zkam5 ◽

2017 ◽

Author(s):

José M. Chaves-González ◽

Jorge Martinez-Gil

Keyword(s):

Evolutionary Algorithm ◽

Semantic Similarity ◽

Pearson Correlation ◽

Similarity Metrics ◽

Biomedical Domain ◽

Similarity Functions ◽

New Approach ◽

Domain Specific ◽

Similarity Ratings ◽

Very High

One of the most challenging problems in the semantic web field consists of computing the semantic similarity between different terms. The problem here is the lack of accurate domain-specific dictionaries, such as biomedical, financial or any other particular and dynamic field. In this article we propose a new approach which uses different existing semantic similarity methods to obtain precise results in the biomedical domain. Specifically, we have developed an evolutionary algorithm which uses information provided by different semantic similarity metrics. Our results have been validated against a variety of biomedical datasets and different collections of similarity functions. The proposed system provides very high quality results when compared against similarity ratings provided by human experts (in terms of Pearson correlation coefficient) surpassing the results of other relevant works previously published in the literature.

Download Full-text

Using MEDLINE as Standard Corpus for Measuring Semantic Similarity in the Biomedical Domain

Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06) ◽

10.1109/bibe.2006.253295 ◽

2006 ◽

Cited By ~ 6

Author(s):

Hisham Al-Mubaid ◽

Hoa Nguyen

Keyword(s):

Semantic Similarity ◽

Biomedical Domain

Download Full-text

A Cluster-Based Approach for Semantic Similarity in the Biomedical Domain

2006 International Conference of the IEEE Engineering in Medicine and Biology Society ◽

10.1109/iembs.2006.4398006 ◽

2006 ◽

Cited By ~ 6

Author(s):

Hisham Al-Mubaid ◽

Hoa A. Nguyen

Keyword(s):

Semantic Similarity ◽

Biomedical Domain

Download Full-text

Adapting Gloss Vector Semantic Relatedness Measure for Semantic Similarity Estimation: An Evaluation in the Biomedical Domain

Semantic Technology - Lecture Notes in Computer Science ◽

10.1007/978-3-319-14122-0_11 ◽

2014 ◽

pp. 129-145 ◽

Cited By ~ 4

Author(s):

Ahmad Pesaranghader ◽

Azadeh Rezaei ◽

Ali Pesaranghader

Keyword(s):

Semantic Similarity ◽

Semantic Relatedness ◽

Biomedical Domain ◽

Similarity Estimation

Download Full-text

A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2013.11.006 ◽

2014 ◽

Vol 48 ◽

pp. 38-53 ◽

Cited By ~ 76

Author(s):

Sébastien Harispe ◽

David Sánchez ◽

Sylvie Ranwez ◽

Stefan Janaqi ◽

Jacky Montmain

Keyword(s):

Semantic Similarity ◽

Similarity Measures ◽

Biomedical Domain

Download Full-text

Semantic Similarity Measures between Terms in the Biomedical Domain within frame work Unified Medical Language System (UMLS)

International Journal of Computer Applications Technology and Research ◽

10.7753/ijcatr0708.1007 ◽

2018 ◽

Vol 7 (8) ◽

pp. 331-340

Author(s):

Abdelhakeem M. B. Abdelrahman ◽

Dr. Ahmad Kayed

Keyword(s):

Semantic Similarity ◽

Similarity Measures ◽

Biomedical Domain ◽

Language System ◽

Unified Medical Language System ◽

Medical Language ◽

Frame Work

Download Full-text

HESML: a real-time semantic measures library for the biomedical domain with a reproducible survey

BMC Bioinformatics ◽

10.1186/s12859-021-04539-0 ◽

2022 ◽

Vol 23 (1) ◽

Author(s):

Juan J. Lastra-Díaz ◽

Alicia Lara-Clares ◽

Ana Garcia-Serrano

Keyword(s):

Real Time ◽

Semantic Similarity ◽

Similarity Measure ◽

Shortest Path ◽

State Of The Art ◽

Biomedical Domain ◽

Snomed Ct ◽

Shortest Path Algorithm ◽

Semantic Similarity Measure ◽

Current State

Abstract Background Ontology-based semantic similarity measures based on SNOMED-CT, MeSH, and Gene Ontology are being extensively used in many applications in biomedical text mining and genomics respectively, which has encouraged the development of semantic measures libraries based on the aforementioned ontologies. However, current state-of-the-art semantic measures libraries have some performance and scalability drawbacks derived from their ontology representations based on relational databases, or naive in-memory graph representations. Likewise, a recent reproducible survey on word similarity shows that one hybrid IC-based measure which integrates a shortest-path computation sets the state of the art in the family of ontology-based semantic measures. However, the lack of an efficient shortest-path algorithm for their real-time computation prevents both their practical use in any application and the use of any other path-based semantic similarity measure. Results To bridge the two aforementioned gaps, this work introduces for the first time an updated version of the HESML Java software library especially designed for the biomedical domain, which implements the most efficient and scalable ontology representation reported in the literature, together with a new method for the approximation of the Dijkstra’s algorithm for taxonomies, called Ancestors-based Shortest-Path Length (AncSPL), which allows the real-time computation of any path-based semantic similarity measure. Conclusions We introduce a set of reproducible benchmarks showing that HESML outperforms by several orders of magnitude the current state-of-the-art libraries in the three aforementioned biomedical ontologies, as well as the real-time performance and approximation quality of the new AncSPL shortest-path algorithm. Likewise, we show that AncSPL linearly scales regarding the dimension of the common ancestor subgraph regardless of the ontology size. Path-based measures based on the new AncSPL algorithm are up to six orders of magnitude faster than their exact implementation in large ontologies like SNOMED-CT and GO. Finally, we provide a detailed reproducibility protocol and dataset as supplementary material to allow the exact replication of all our experiments and results.

Download Full-text