Representing Knowledge in Controlled Natural Language: A Case Study

Author(s):  
Rolf Schwitter
2021 ◽  
Vol 26 (4) ◽  
Author(s):  
Alvaro Veizaga ◽  
Mauricio Alferez ◽  
Damiano Torre ◽  
Mehrdad Sabetzadeh ◽  
Lionel Briand

AbstractNatural language (NL) is pervasive in software requirements specifications (SRSs). However, despite its popularity and widespread use, NL is highly prone to quality issues such as vagueness, ambiguity, and incompleteness. Controlled natural languages (CNLs) have been proposed as a way to prevent quality problems in requirements documents, while maintaining the flexibility to write and communicate requirements in an intuitive and universally understood manner. In collaboration with an industrial partner from the financial domain, we systematically develop and evaluate a CNL, named Rimay, intended at helping analysts write functional requirements. We rely on Grounded Theory for building Rimay and follow well-known guidelines for conducting and reporting industrial case study research. Our main contributions are: (1) a qualitative methodology to systematically define a CNL for functional requirements; this methodology is intended to be general for use across information-system domains, (2) a CNL grammar to represent functional requirements; this grammar is derived from our experience in the financial domain, but should be applicable, possibly with adaptations, to other information-system domains, and (3) an empirical evaluation of our CNL (Rimay) through an industrial case study. Our contributions draw on 15 representative SRSs, collectively containing 3215 NL requirements statements from the financial domain. Our evaluation shows that Rimay is expressive enough to capture, on average, 88% (405 out of 460) of the NL requirements statements in four previously unseen SRSs from the financial domain.


2020 ◽  
Vol 44 (12) ◽  
Author(s):  
Ishita Dasgupta ◽  
Demi Guo ◽  
Samuel J. Gershman ◽  
Noah D. Goodman
Keyword(s):  

Author(s):  
Jacqueline Peng ◽  
Mengge Zhao ◽  
James Havrilla ◽  
Cong Liu ◽  
Chunhua Weng ◽  
...  

Abstract Background Natural language processing (NLP) tools can facilitate the extraction of biomedical concepts from unstructured free texts, such as research articles or clinical notes. The NLP software tools CLAMP, cTAKES, and MetaMap are among the most widely used tools to extract biomedical concept entities. However, their performance in extracting disease-specific terminology from literature has not been compared extensively, especially for complex neuropsychiatric disorders with a diverse set of phenotypic and clinical manifestations. Methods We comparatively evaluated these NLP tools using autism spectrum disorder (ASD) as a case study. We collected 827 ASD-related terms based on previous literature as the benchmark list for performance evaluation. Then, we applied CLAMP, cTAKES, and MetaMap on 544 full-text articles and 20,408 abstracts from PubMed to extract ASD-related terms. We evaluated the predictive performance using precision, recall, and F1 score. Results We found that CLAMP has the best performance in terms of F1 score followed by cTAKES and then MetaMap. Our results show that CLAMP has much higher precision than cTAKES and MetaMap, while cTAKES and MetaMap have higher recall than CLAMP. Conclusion The analysis protocols used in this study can be applied to other neuropsychiatric or neurodevelopmental disorders that lack well-defined terminology sets to describe their phenotypic presentations.


Author(s):  
Ekaterina Savitskaya ◽  

In the field of cognitive linguistics it is accepted that, before developing its capacity for abstract and theoretical thought, the human mind went through the stage of reflecting reality through concrete images and thus has inherited old cognitive patterns. Even abstract notions of the modern civilization are based on traditional concrete images, and it is all fixed in natural language units. By way of illustration, the author analyzes the cognitive pattern “сleanness / dirtiness” as a constituent part of the English linguoculture, looking at the whole range of its verbal realization and demonstrating its influence on language-based thinking and modeling of reality. Comparing meanings of language units with their inner forms enabled the author to establish the connection between abstract notions and concrete images within cognitive patterns. Using the method of internal comparison and applying the results of etymological reconstruction of language units’ inner form made it possible to see how the world is viewed by representatives of the English linguoculture. Apparently, in the English linguoculture images of cleanness / dirtiness symbolize mainly two thematic areas: that of morality and that of renewal. Since every ethnic group has its own axiological dominants (key values) that determine the expressiveness of verbal invectives, one can draw the conclusion that people perceive and comprehend world fragments through the prism of mental stereo-types fixed in the inner form of language units. Sometimes, in relation to specific language units, a conflict arises between the inner form which retains traditional thinking and a meaning that reflects modern reality. Still, linguoculture is a constantly evolving entity, and its de-velopment entails breaking established stereotypes and creating new ones. Linguistically, the victory of the new over the old is manifested in the “dying out” of the verbal support for pre-vious cognitive patterns, which leads to “reprogramming” (“recoding”) of linguoculture rep-resentatives’ mentality.


Sign in / Sign up

Export Citation Format

Share Document