Automated Identification of Semantic Similarity between Concepts of Textual Business Rules

Business Rules (BR) are usually written by different stakeholders, which makes them vulnerable to contain different designations for a same concept. Such problem can be the source of a not well orchestrated behaviors. Whereas identification of synonyms is manual or totally neglected in most approaches dealing with natural language Business Rules. In this paper, we present an automated approach to identify semantic similarity between terms in textual BR using Natural Language Processing and knowledge-based algorithm refined using heuristics. Our method is unique in that it also identifies abbreviations/expansions (as a special case of synonym) which is not possible using a dictionary. Then, results are saved in a standard format (SBVR) for reusability purposes. Our approach was applied on more than 160 BR statements divided on three cases with an accuracy between 69% and 87% which suggests it to be an indispensable enhancement for other methods dealing with textual BR.

Download Full-text

LIS4: Lesk Inspired Sense Specific Semantic Similarity using WordNet

Journal of Information & Knowledge Management ◽

10.1142/s0219649221500064 ◽

2021 ◽

pp. 2150006

Author(s):

Saravanakumar Kandasamy ◽

Aswani Kumar Cherukuri

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Gold Standard ◽

Question Answering ◽

Knowledge Based ◽

Benchmark Datasets ◽

Processing Information

Semantic similarity quantification between concepts is one of the inevitable parts in domains like Natural Language Processing, Information Retrieval, Question Answering, etc. to understand the text and their relationships better. Last few decades, many measures have been proposed by incorporating various corpus-based and knowledge-based resources. WordNet and Wikipedia are two of the Knowledge-based resources. The contribution of WordNet in the above said domain is enormous due to its richness in defining a word and all of its relationship with others. In this paper, we proposed an approach to quantify the similarity between concepts that exploits the synsets and the gloss definitions of different concepts using WordNet. Our method considers the gloss definitions, contextual words that are helping in defining a word, synsets of contextual word and the confidence of occurrence of a word in other word’s definition for calculating the similarity. The evaluation based on different gold standard benchmark datasets shows the efficiency of our system in comparison with other existing taxonomical and definitional measures.

Download Full-text

A Natural Language Processing Approach to Measuring Treatment Adherence and Consistency Using Semantic Similarity

AERA Open ◽

10.1177/23328584211028615 ◽

2021 ◽

Vol 7 ◽

pp. 233285842110286

Author(s):

Kylie L. Anglin ◽

Vivian C. Wong ◽

Arielle Boguslav

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Intervention Implementation ◽

Proof Of Concept ◽

Coaching Intervention ◽

Processing Techniques ◽

Teacher Coaching ◽

The Impact

Though there is widespread recognition of the importance of implementation research, evaluators often face intense logistical, budgetary, and methodological challenges in their efforts to assess intervention implementation in the field. This article proposes a set of natural language processing techniques called semantic similarity as an innovative and scalable method of measuring implementation constructs. Semantic similarity methods are an automated approach to quantifying the similarity between texts. By applying semantic similarity to transcripts of intervention sessions, researchers can use the method to determine whether an intervention was delivered with adherence to a structured protocol, and the extent to which an intervention was replicated with consistency across sessions, sites, and studies. This article provides an overview of semantic similarity methods, describes their application within the context of educational evaluations, and provides a proof of concept using an experimental study of the impact of a standardized teacher coaching intervention.

Download Full-text

Enhancing the extraction of SBVR business vocabularies and business rules from UML use case diagrams with natural language processing

Proceedings of the 23rd Pan-Hellenic Conference on Informatics - PCI '19 ◽

10.1145/3368640.3368641 ◽

2019 ◽

Cited By ~ 1

Author(s):

Paulius Danenas ◽

Tomas Skersys ◽

Rimantas Butleris

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Use Case ◽

Business Rules

Download Full-text

Evolution of Semantic Similarity—A Survey

ACM Computing Surveys ◽

10.1145/3440755 ◽

2021 ◽

Vol 54 (2) ◽

pp. 1-37

Author(s):

Dhivya Chandrasekaran ◽

Vijay Mago

Keyword(s):

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Hybrid Methods ◽

Research Work ◽

Similarity Measures ◽

Text Data ◽

Knowledge Based ◽

Open Research ◽

Research Problems

Estimating the semantic similarity between text data is one of the challenging and open research problems in the field of Natural Language Processing (NLP). The versatility of natural language makes it difficult to define rule-based methods for determining semantic similarity measures. To address this issue, various semantic similarity methods have been proposed over the years. This survey article traces the evolution of such methods beginning from traditional NLP techniques such as kernel-based methods to the most recent research work on transformer-based models, categorizing them based on their underlying principles as knowledge-based, corpus-based, deep neural network–based methods, and hybrid methods. Discussing the strengths and weaknesses of each method, this survey provides a comprehensive view of existing systems in place for new researchers to experiment and develop innovative ideas to address the issue of semantic similarity.

Download Full-text

SEMblog

Ontology-Based Applications for Enterprise Systems and Knowledge Management - Advances in Knowledge Acquisition, Transfer, and Management ◽

10.4018/978-1-4666-1993-7.ch012 ◽

2013 ◽

pp. 210-223

Author(s):

Azleena Mohd Kassim ◽

Yu-N Cheah

Keyword(s):

Information Technology ◽

Knowledge Management ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Human Intervention ◽

Knowledge Based ◽

Search Mechanism ◽

Management Policies ◽

Knowledge Identification

Information Technology (IT) is often employed to put knowledge management policies into operation. However, many of these tools require human intervention when it comes to deciding how the knowledge is to be managed. The Sematic Web may be an answer to this issue, but many Sematic Web tools are not readily available for the regular IT user. Another problem that arises is that typical efforts to apply or reuse knowledge via a search mechanism do not necessarily link to other pages that are relevant. Blogging systems appear to address some of these challenges but the browsing experience can be further enhanced by providing links to other relevant posts. In this chapter, the authors present a semantic blogging tool called SEMblog to identify, organize, and reuse knowledge based on the Sematic Web and ontologies. The SEMblog methodology brings together technologies such as Natural Language Processing (NLP), Sematic Web representations, and the ubiquity of the blogging environment to produce a more intuitive way to manage knowledge, especially in the areas of knowledge identification, organization, and reuse. Based on detailed comparisons with other similar systems, the uniqueness of SEMblog lies in its ability to automatically generate keywords and semantic links.

Download Full-text

Automated Identification of Postoperative Complications Within an Electronic Medical Record Using Natural Language Processing

JAMA ◽

10.1001/jama.2011.1204 ◽

2011 ◽

Vol 306 (8) ◽

Cited By ~ 137

Author(s):

Harvey J. Murff ◽

Fern FitzHenry ◽

Michael E. Matheny ◽

Nancy Gentry ◽

Kristen L. Kotter ◽

...

Keyword(s):

Natural Language Processing ◽

Postoperative Complications ◽

Natural Language ◽

Medical Record ◽

Electronic Medical Record ◽

Language Processing ◽

Automated Identification

Download Full-text

Knowledge-Based Task Planning Using Natural Language Processing for Robotic Manufacturing

Volume 3: 30th Computers and Information in Engineering Conference, Parts A and B ◽

10.1115/detc2010-29123 ◽

2010 ◽

Cited By ~ 1

Author(s):

Iraj Mantegh ◽

Nazanin S. Darbandi

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Programming Languages ◽

Language Processing ◽

New Method ◽

Robot Programming ◽

Task Planning ◽

End User ◽

Knowledge Based ◽

Manufacturing Applications

Robotic alternative to many manual operations falls short in application due to the difficulties in capturing the manual skill of an expert operator. One of the main problems to be solved if robots are to become flexible enough for various manufacturing needs is that of end-user programming. An end-user with little or no technical expertise in robotics area needs to be able to efficiently communicate its manufacturing task to the robot. This paper proposes a new method for robot task planning using some concepts of Artificial Intelligence. Our method is based on a hierarchical knowledge representation and propositional logic, which allows an expert user to incrementally integrate process and geometric parameters with the robot commands. The objective is to provide an intelligent and programmable agent such as a robot with a knowledge base about the attributes of human behaviors in order to facilitate the commanding process. The focus of this work is on robot programming for manufacturing applications. Industrial manipulators work with low level programming languages. This work presents a new method based on Natural Language Processing (NLP) that allows a user to generate robot programs using natural language lexicon and task information. This will enable a manufacturing operator (for example for painting) who may be unfamiliar with robot programming to easily employ the agent for the manufacturing tasks.

Download Full-text

A Natural Language Processing Approach to Automated Highlighting of New Information in Clinical Notes

Applied Sciences ◽

10.3390/app10082824 ◽

2020 ◽

Vol 10 (8) ◽

pp. 2824

Author(s):

Yu-Hsiang Su ◽

Ching-Ping Chao ◽

Ling-Chien Hung ◽

Sheng-Feng Sung ◽

Pei-Ju Lee

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Task Performance ◽

Language Processing ◽

Automated Identification ◽

Clinical Notes ◽

New Information ◽

Perceived Workload ◽

The Impact ◽

User Experiment

Electronic medical records (EMRs) have been used extensively in most medical institutions for more than a decade in Taiwan. However, information overload associated with rapid accumulation of large amounts of clinical narratives has threatened the effective use of EMRs. This situation is further worsened by the use of “copying and pasting”, leading to lots of redundant information in clinical notes. This study aimed to apply natural language processing techniques to address this problem. New information in longitudinal clinical notes was identified based on a bigram language model. The accuracy of automated identification of new information was evaluated using expert annotations as the reference standard. A two-stage cross-over user experiment was conducted to evaluate the impact of highlighting of new information on task demands, task performance, and perceived workload. The automated method identified new information with an F1 score of 0.833. The user experiment found a significant decrease in perceived workload associated with a significantly higher task performance. In conclusion, automated identification of new information in clinical notes is feasible and practical. Highlighting of new information enables healthcare professionals to grasp key information from clinical notes with less perceived workload.

Download Full-text

A KNOWLEDGE-BASED CHINESE LETTER-WRITER

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001488000054 ◽

1988 ◽

Vol 02 (01) ◽

pp. 37-48

Author(s):

KOH TOH TZU

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Knowledge Engineering ◽

Systems Science ◽

Form Processing ◽

Design And Implementation ◽

Knowledge Based ◽

Chinese Business ◽

Letter Writer

Since the end of last year, the researchers at the Institute of Systems Science (ISS) started to consider a more ambitious project as part of its multilingual programming objective. This project examines the domain of Chinese Business Letter Writing. With the problem defined as generating Chinese letters to meet business needs, investigations suggest an intersection of 3 possible approaches: knowledge engineering, form processing and natural language processing. This paper attempts to report some of the findings and document the design and implementation issues that have arisen and been tackled as prototyping work progresses.

Download Full-text

A non-negative tensor factorization model for selectional preference induction

Natural Language Engineering ◽

10.1017/s1351324910000148 ◽

2010 ◽

Vol 16 (4) ◽

pp. 417-437 ◽

Cited By ~ 17

Author(s):

TIM VAN DE CRUYS

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Tensor Factorization ◽

Promising Tool ◽

Occurrence Data ◽

Factorization Model ◽

Distributional Similarity ◽

Factorization Methods

AbstractThe distributional similarity methods have proven to be a valuable tool for the induction of semantic similarity. Until now, most algorithms use two-way co-occurrence data to compute the meaning of words. Co-occurrence frequencies, however, need not be pairwise. One can easily imagine situations where it is desirable to investigate co-occurrence frequencies of three modes and beyond. This paper will investigate tensor factorization methods to build a model of three-way co-occurrences. The approach is applied to the problem of selectional preference induction, and automatically evaluated in a pseudo-disambiguation task. The results show that tensor factorization, and non-negative tensor factorization in particular, is a promising tool for Natural Language Processing (nlp).

Download Full-text