Polysemy and Synonymy Detection in Ontology Engineering

WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS ◽

10.37394/23209.2020.17.14 ◽

2020 ◽

Vol 17 ◽

Keyword(s):

Linked Data ◽

Semantic Analysis ◽

Ontology Engineering ◽

Data Generation ◽

Knowledge Domain ◽

Knowledge Resources ◽

Knowledge Domains ◽

Single Term ◽

Semantic Associations ◽

Value Decomposition

Polysemy, when a single term has multiple meanings, and synonymy, when multiple terms have the same meaning, are common phenomena in linguistics as well as in scientific knowledge. In ontology engineering, it is vital to detect the synonyms annotations and the multiple inheritances because of polysemy. The persistence of these issues in the semantic description of a knowledge domain causes problematic interoperability and data processing. The disambiguation of the entities, properties and relationships sense in a semantic web ontology significantly improves linked data generation and information retrieval. We explore the synonymy and polysemy in the setting of a cardiology terminology generated from textbooks on the basis of field coverage, professionals’ associations’ recommendations and bibliometrics, for the building of a cardiologic ontology. From 56,134 terms collected we found that 67.7% were unique. The indexed terms included single words, compound words and multi-word expressions. The frequency of their appearances in the combined master index was calculated and used as a marker of their significance. To cope with the linguistic polysemy and synonymy of terms, we examined them in WordNet, MeSH and BioPortal, as well as by latent semantic analysis (LSA) through singular value decomposition (SVD). Through these approaches we managed to identify and decipher semantic associations and relationships between the terms. We proposed a roadmap for ontology building from scratch by utilizing intrinsic and extrinsic knowledge resources and reuse of metadata. We anticipate that this approach is applicable in ontology engineering of different knowledge domains for relationships setting and linked data contextualization

Download Full-text

Document- and Keyword-based Author Co-citation Analysis

Data and Information Management ◽

10.2478/dim-2018-0009 ◽

2018 ◽

Vol 2 (2) ◽

pp. 70-82 ◽

Cited By ~ 2

Author(s):

Binglu Wang ◽

Yi Bu ◽

Win-bin Huang

Keyword(s):

Citation Analysis ◽

Semantic Similarity ◽

Method Validation ◽

New Method ◽

Global Network ◽

Network Visualization ◽

Knowledge Domain ◽

Knowledge Domains ◽

Domain Mapping ◽

The Relationship

AbstractIn the field of scientometrics, the principal purpose for author co-citation analysis (ACA) is to map knowledge domains by quantifying the relationship between co-cited author pairs. However, traditional ACA has been criticized since its input is insufficiently informative by simply counting authors’ co-citation frequencies. To address this issue, this paper introduces a new method that reconstructs the raw co-citation matrices by regarding document unit counts and keywords of references, named as Document- and Keyword-Based Author Co-Citation Analysis (DKACA). Based on the traditional ACA, DKACA counted co-citation pairs by document units instead of authors from the global network perspective. Moreover, by incorporating the information of keywords from cited papers, DKACA captured their semantic similarity between co-cited papers. In the method validation part, we implemented network visualization and MDS measurement to evaluate the effectiveness of DKACA. Results suggest that the proposed DKACA method not only reveals more insights that are previously unknown but also improves the performance and accuracy of knowledge domain mapping, representing a new basis for further studies.

Download Full-text

Multi-Target Search on Semantic Associations in Linked Data

International Journal on Semantic Web and Information Systems ◽

10.4018/ijswis.2018010103 ◽

2018 ◽

Vol 14 (1) ◽

pp. 71-97 ◽

Cited By ~ 1

Author(s):

Xiang Zhang ◽

Erjing Lin ◽

Yulian Lv

Keyword(s):

Linked Data ◽

Large Scale ◽

Search Process ◽

Search Model ◽

Linguistic Features ◽

Target Search ◽

Search Results ◽

Semantic Associations ◽

Effectiveness And Efficiency ◽

Virtual Document

In this article, the authors propose a novel search model: Multi-Target Search (MT search in brief). MT search is a keyword-based search model on Semantic Associations in Linked Data. Each search contains multiple sub-queries, in which each sub-query represents a certain user need for a certain object in a group relationship. They first formularize the problem of association search, and then introduce their approach to discover Semantic Associations in large-scale Linked Data. Next, they elaborate their novel search model, the notion of Virtual Document they use to extract linguistic features, and the details of search process. The authors then discuss the way search results are organized and summarized. Quantitative experiments are conducted on DBpedia to validate the effectiveness and efficiency of their approach.

Download Full-text

Reduced-Order Modeling Based on Hybrid Snapshot Simulation

International Journal of Computational Methods ◽

10.1142/s0219876220500292 ◽

2020 ◽

Vol 18 (01) ◽

pp. 2050029 ◽

Cited By ~ 1

Author(s):

Feng Bai ◽

Yi Wang

Keyword(s):

Quality Data ◽

Data Generation ◽

Order Model ◽

High Quality Data ◽

Reduced Order ◽

Online Simulation ◽

Data Fidelity ◽

Value Decomposition ◽

Proper Orthogonal ◽

Traditional Counterpart

This paper presents a hybrid snapshot simulation methodology to accelerate the generation of high-quality data for proper orthogonal decomposition (POD) and reduced-order model (ROM) development. The entire span of the snapshot simulation is divided into multiple intervals, each simulated by either high-fidelity full-order model (FOM) or fast local ROM. The simulation then alternates between FOM and local ROM to accelerate snapshot data generation while maintaining the data fidelity and representation. Model switch is determined on-the-fly by evaluating several criteria that monitor the dominance of leading POD modes and ROM trajectory. The incremental singular value decomposition (iSVD) is employed to continuously update ROMs for enhanced accuracy and utilization. A global ROM broadly applicable to various online simulation is immediately available at the end of the simulation. The hybrid snapshot simulation demonstrates excellent accuracy ([Formula: see text] error) and 2.09–2.6[Formula: see text]X speedup relative to its traditional counterpart. The constructed ROMs also preserve salient accuracy ([Formula: see text] error). The results prove feasibility of the proposed method for robust and efficient snapshot data generation and ROM development.

Download Full-text

ILastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantics, Analytics, Visualization - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01379-0_2 ◽

2018 ◽

pp. 15-32

Author(s):

Anastasia Dimou ◽

Gerald Haesendonck ◽

Martin Vanbrabant ◽

Laurens De Vocht ◽

Ruben Verborgh ◽

...

Keyword(s):

User Interface ◽

Linked Data ◽

Data Generation ◽

Scholarly Data

Download Full-text

A case study for block-based linked data generation: Recipes as jigsaw puzzles

Journal of Information Science ◽

10.1177/0165551519849518 ◽

2019 ◽

Vol 46 (3) ◽

pp. 419-433 ◽

Cited By ~ 2

Author(s):

Övünç Öztürk ◽

Tuğba Özacar

Keyword(s):

Linked Data ◽

Heterogeneous Systems ◽

Data Repository ◽

Data Generation ◽

Text Input ◽

Ontology Language ◽

Data Generator ◽

System Usability Scale ◽

The Masses

This article is a proof-of-concept case study to evaluate the functionality of a block metaphor–based linked data generator. In this work, we chose to produce linked data repository of recipes, which provide a medium for people to share their regional and healthy recipes with the masses. However, the same approach can also be adapted easily to other domains. Therefore, the applicability of our approach extends well beyond the food domain that we are considering in this article. As a medium for information sharing and understanding between heterogeneous systems, ontologies will play an important role in the realisation of the Internet of things (IoT) vision. Therefore, an ontology-based recipe repository would also be one of the basic blocks of a smart kitchen environment. However, building ontologies is a challenging task, especially for users who are not conversant in the ontology building languages. This article proposes an approach that can be used even by non-experts and facilitates the sharing and searching of recipe data. In our case, we exploit the features of the block paradigm to publish recipes in Linked Data format. In this way, users do not have to know the OWL (Web Ontology Language) syntax and the text input is kept minimal. As far as we know, this article is the first study that produces linked data using Blockly in the literature. We also conducted a user-based evaluation of the proposed approach using the System Usability Scale (SUS) questionnaire.

Download Full-text

Linked Data Generation for the University Data From Legacy Database

International journal of Web & Semantic Technology ◽

10.5121/ijwest.2011.2302 ◽

2011 ◽

Vol 2 (3) ◽

pp. 21-31 ◽

Cited By ~ 5

Author(s):

Arup Sarkar ◽

Ujjal Marjit ◽

Utpal Biswas

Keyword(s):

Linked Data ◽

Data Generation ◽

The University

Download Full-text

lsemantica: A command for text similarity based on latent semantic analysis

The Stata Journal Promoting communications on statistics and Stata ◽

10.1177/1536867x19830910 ◽

2019 ◽

Vol 19 (1) ◽

pp. 129-142 ◽

Cited By ~ 1

Author(s):

Carlo Schwarz

Keyword(s):

Latent Semantic Analysis ◽

Semantic Analysis ◽

Learning Algorithm ◽

Singular Value ◽

Machine Learning Algorithm ◽

Truncated Singular Value Decomposition ◽

Similarity Comparison ◽

Text Similarity ◽

Semantic Relationships ◽

Value Decomposition

In this article, I present the lsemantica command, which implements latent semantic analysis in Stata. Latent semantic analysis is a machine learning algorithm for word and text similarity comparison and uses truncated singular value decomposition to derive the hidden semantic relationships between words and texts. lsemantica provides a simple command for latent semantic analysis as well as complementary commands for text similarity comparison.

Download Full-text

Split-and-Combine Singular Value Decomposition for Large-Scale Matrix

Journal of Applied Mathematics ◽

10.1155/2013/683053 ◽

2013 ◽

Vol 2013 ◽

pp. 1-8 ◽

Cited By ~ 6

Author(s):

Jengnan Tzeng

Keyword(s):

Singular Value Decomposition ◽

Large Scale ◽

Semantic Analysis ◽

Computational Cost ◽

Matrix Decomposition ◽

Singular Value ◽

Fast Method ◽

The Matrix ◽

Scale Matrix ◽

Value Decomposition

The singular value decomposition (SVD) is a fundamental matrix decomposition in linear algebra. It is widely applied in many modern techniques, for example, high- dimensional data visualization, dimension reduction, data mining, latent semantic analysis, and so forth. Although the SVD plays an essential role in these fields, its apparent weakness is the order three computational cost. This order three computational cost makes many modern applications infeasible, especially when the scale of the data is huge and growing. Therefore, it is imperative to develop a fast SVD method in modern era. If the rank of matrix is much smaller than the matrix size, there are already some fast SVD approaches. In this paper, we focus on this case but with the additional condition that the data is considerably huge to be stored as a matrix form. We will demonstrate that this fast SVD result is sufficiently accurate, and most importantly it can be derived immediately. Using this fast method, many infeasible modern techniques based on the SVD will become viable.

Download Full-text

Influence of Accessibility on Land Use and Landscape Pattern Based on Mapping Knowledge Domains: Review and Implications

Journal of Advanced Transportation ◽

10.1155/2020/7985719 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Yongwei Liu ◽

Xiaoshu Cao ◽

Tao Li

Keyword(s):

Land Use ◽

Landscape Ecology ◽

Relevant Information ◽

Landscape Patterns ◽

Knowledge Domain ◽

Knowledge Domains ◽

Significant Research ◽

Long Time ◽

Transportation Geography ◽

Conducting Research

Research on the influence of accessibility on land use and landscape patterns is one of the most important subfields in landscape ecology and transportation geography. In this review article, we use CiteSpace and VOSviewer to analyze relevant information, including the number of published papers, highly cited literature, high-frequency keywords, periodicals, and the leading countries conducting research on this particular field. Based on the mapping knowledge domain theory and summarizing method, this research, using an extensive review of the existing literature to analyze the influence of accessibility on land use and landscape patterns, the following conclusions have been reached: first, most of the relevant studies are conducted by applying theories on landscape ecology rather than on transportation geography, and the measure index of accessibility is relatively simple. Second, while accessibility has played a key role in analyzing the interactions between transportation, land use, and landscape patterns, studies on the long-term effect of transportation on land use and land patterns are extremely important. Also, different road types have been found to impose different effects. Third, research on the functional landscape in inner cities has become a significant research focus, particularly with the progress in big data. And fourth, improvements in data acquisition and processing have greatly benefited the field, specifically with recent advancements in GIS and RS technology. However, studies on landscape patterns with regional perspectives have largely been insufficient, especially those conducted over long time scales.

Download Full-text

Sustainable Linked Data Generation: The Case of DBpedia

Lecture Notes in Computer Science - The Semantic Web – ISWC 2017 ◽

10.1007/978-3-319-68204-4_28 ◽

2017 ◽

pp. 297-313

Author(s):

Wouter Maroy ◽

Anastasia Dimou ◽

Dimitris Kontokostas ◽

Ben De Meester ◽

Ruben Verborgh ◽

...

Keyword(s):

Linked Data ◽

Data Generation

Download Full-text