scholarly journals Polysemy and Synonymy Detection in Ontology Engineering

Polysemy, when a single term has multiple meanings, and synonymy, when multiple terms have the same meaning, are common phenomena in linguistics as well as in scientific knowledge. In ontology engineering, it is vital to detect the synonyms annotations and the multiple inheritances because of polysemy. The persistence of these issues in the semantic description of a knowledge domain causes problematic interoperability and data processing. The disambiguation of the entities, properties and relationships sense in a semantic web ontology significantly improves linked data generation and information retrieval. We explore the synonymy and polysemy in the setting of a cardiology terminology generated from textbooks on the basis of field coverage, professionals’ associations’ recommendations and bibliometrics, for the building of a cardiologic ontology. From 56,134 terms collected we found that 67.7% were unique. The indexed terms included single words, compound words and multi-word expressions. The frequency of their appearances in the combined master index was calculated and used as a marker of their significance. To cope with the linguistic polysemy and synonymy of terms, we examined them in WordNet, MeSH and BioPortal, as well as by latent semantic analysis (LSA) through singular value decomposition (SVD). Through these approaches we managed to identify and decipher semantic associations and relationships between the terms. We proposed a roadmap for ontology building from scratch by utilizing intrinsic and extrinsic knowledge resources and reuse of metadata. We anticipate that this approach is applicable in ontology engineering of different knowledge domains for relationships setting and linked data contextualization

2018 ◽  
Vol 2 (2) ◽  
pp. 70-82 ◽  
Author(s):  
Binglu Wang ◽  
Yi Bu ◽  
Win-bin Huang

AbstractIn the field of scientometrics, the principal purpose for author co-citation analysis (ACA) is to map knowledge domains by quantifying the relationship between co-cited author pairs. However, traditional ACA has been criticized since its input is insufficiently informative by simply counting authors’ co-citation frequencies. To address this issue, this paper introduces a new method that reconstructs the raw co-citation matrices by regarding document unit counts and keywords of references, named as Document- and Keyword-Based Author Co-Citation Analysis (DKACA). Based on the traditional ACA, DKACA counted co-citation pairs by document units instead of authors from the global network perspective. Moreover, by incorporating the information of keywords from cited papers, DKACA captured their semantic similarity between co-cited papers. In the method validation part, we implemented network visualization and MDS measurement to evaluate the effectiveness of DKACA. Results suggest that the proposed DKACA method not only reveals more insights that are previously unknown but also improves the performance and accuracy of knowledge domain mapping, representing a new basis for further studies.


Author(s):  
Xiang Zhang ◽  
Erjing Lin ◽  
Yulian Lv

In this article, the authors propose a novel search model: Multi-Target Search (MT search in brief). MT search is a keyword-based search model on Semantic Associations in Linked Data. Each search contains multiple sub-queries, in which each sub-query represents a certain user need for a certain object in a group relationship. They first formularize the problem of association search, and then introduce their approach to discover Semantic Associations in large-scale Linked Data. Next, they elaborate their novel search model, the notion of Virtual Document they use to extract linguistic features, and the details of search process. The authors then discuss the way search results are organized and summarized. Quantitative experiments are conducted on DBpedia to validate the effectiveness and efficiency of their approach.


2020 ◽  
Vol 18 (01) ◽  
pp. 2050029 ◽  
Author(s):  
Feng Bai ◽  
Yi Wang

This paper presents a hybrid snapshot simulation methodology to accelerate the generation of high-quality data for proper orthogonal decomposition (POD) and reduced-order model (ROM) development. The entire span of the snapshot simulation is divided into multiple intervals, each simulated by either high-fidelity full-order model (FOM) or fast local ROM. The simulation then alternates between FOM and local ROM to accelerate snapshot data generation while maintaining the data fidelity and representation. Model switch is determined on-the-fly by evaluating several criteria that monitor the dominance of leading POD modes and ROM trajectory. The incremental singular value decomposition (iSVD) is employed to continuously update ROMs for enhanced accuracy and utilization. A global ROM broadly applicable to various online simulation is immediately available at the end of the simulation. The hybrid snapshot simulation demonstrates excellent accuracy ([Formula: see text] error) and 2.09–2.6[Formula: see text]X speedup relative to its traditional counterpart. The constructed ROMs also preserve salient accuracy ([Formula: see text] error). The results prove feasibility of the proposed method for robust and efficient snapshot data generation and ROM development.


Author(s):  
Anastasia Dimou ◽  
Gerald Haesendonck ◽  
Martin Vanbrabant ◽  
Laurens De Vocht ◽  
Ruben Verborgh ◽  
...  

2019 ◽  
Vol 46 (3) ◽  
pp. 419-433 ◽  
Author(s):  
Övünç Öztürk ◽  
Tuğba Özacar

This article is a proof-of-concept case study to evaluate the functionality of a block metaphor–based linked data generator. In this work, we chose to produce linked data repository of recipes, which provide a medium for people to share their regional and healthy recipes with the masses. However, the same approach can also be adapted easily to other domains. Therefore, the applicability of our approach extends well beyond the food domain that we are considering in this article. As a medium for information sharing and understanding between heterogeneous systems, ontologies will play an important role in the realisation of the Internet of things (IoT) vision. Therefore, an ontology-based recipe repository would also be one of the basic blocks of a smart kitchen environment. However, building ontologies is a challenging task, especially for users who are not conversant in the ontology building languages. This article proposes an approach that can be used even by non-experts and facilitates the sharing and searching of recipe data. In our case, we exploit the features of the block paradigm to publish recipes in Linked Data format. In this way, users do not have to know the OWL (Web Ontology Language) syntax and the text input is kept minimal. As far as we know, this article is the first study that produces linked data using Blockly in the literature. We also conducted a user-based evaluation of the proposed approach using the System Usability Scale (SUS) questionnaire.


2011 ◽  
Vol 2 (3) ◽  
pp. 21-31 ◽  
Author(s):  
Arup Sarkar ◽  
Ujjal Marjit ◽  
Utpal Biswas

Author(s):  
Carlo Schwarz

In this article, I present the lsemantica command, which implements latent semantic analysis in Stata. Latent semantic analysis is a machine learning algorithm for word and text similarity comparison and uses truncated singular value decomposition to derive the hidden semantic relationships between words and texts. lsemantica provides a simple command for latent semantic analysis as well as complementary commands for text similarity comparison.


2013 ◽  
Vol 2013 ◽  
pp. 1-8 ◽  
Author(s):  
Jengnan Tzeng

The singular value decomposition (SVD) is a fundamental matrix decomposition in linear algebra. It is widely applied in many modern techniques, for example, high- dimensional data visualization, dimension reduction, data mining, latent semantic analysis, and so forth. Although the SVD plays an essential role in these fields, its apparent weakness is the order three computational cost. This order three computational cost makes many modern applications infeasible, especially when the scale of the data is huge and growing. Therefore, it is imperative to develop a fast SVD method in modern era. If the rank of matrix is much smaller than the matrix size, there are already some fast SVD approaches. In this paper, we focus on this case but with the additional condition that the data is considerably huge to be stored as a matrix form. We will demonstrate that this fast SVD result is sufficiently accurate, and most importantly it can be derived immediately. Using this fast method, many infeasible modern techniques based on the SVD will become viable.


2020 ◽  
Vol 2020 ◽  
pp. 1-12
Author(s):  
Yongwei Liu ◽  
Xiaoshu Cao ◽  
Tao Li

Research on the influence of accessibility on land use and landscape patterns is one of the most important subfields in landscape ecology and transportation geography. In this review article, we use CiteSpace and VOSviewer to analyze relevant information, including the number of published papers, highly cited literature, high-frequency keywords, periodicals, and the leading countries conducting research on this particular field. Based on the mapping knowledge domain theory and summarizing method, this research, using an extensive review of the existing literature to analyze the influence of accessibility on land use and landscape patterns, the following conclusions have been reached: first, most of the relevant studies are conducted by applying theories on landscape ecology rather than on transportation geography, and the measure index of accessibility is relatively simple. Second, while accessibility has played a key role in analyzing the interactions between transportation, land use, and landscape patterns, studies on the long-term effect of transportation on land use and land patterns are extremely important. Also, different road types have been found to impose different effects. Third, research on the functional landscape in inner cities has become a significant research focus, particularly with the progress in big data. And fourth, improvements in data acquisition and processing have greatly benefited the field, specifically with recent advancements in GIS and RS technology. However, studies on landscape patterns with regional perspectives have largely been insufficient, especially those conducted over long time scales.


Author(s):  
Wouter Maroy ◽  
Anastasia Dimou ◽  
Dimitris Kontokostas ◽  
Ben De Meester ◽  
Ruben Verborgh ◽  
...  
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document