Incremental Ontology Population and Enrichment through Semantic-based Text Mining

Saira Gillani; Andrea Ko

doi:10.4018/ijswis.2015070103

Incremental Ontology Population and Enrichment through Semantic-based Text Mining

International Journal on Semantic Web and Information Systems ◽

10.4018/ijswis.2015070103 ◽

2015 ◽

Vol 11 (3) ◽

pp. 44-66 ◽

Cited By ~ 9

Author(s):

Saira Gillani ◽

Andrea Ko

Keyword(s):

Text Mining ◽

Domain Knowledge ◽

Learning System ◽

Domain Specific ◽

Automatic Categorization ◽

New Concepts ◽

Domain Specific Knowledge ◽

E Learning ◽

Ontology Population ◽

It Audit

Higher education and professional trainings often apply innovative e-learning systems, where ontologies are used for structuring domain knowledge. To provide up-to-date knowledge for the students, ontology has to be maintained regularly. It is especially true for IT audit and security domain, because technology is changing fast. However manual ontology population and enrichment is a complex task that require professional experience involving a lot of efforts. The authors' paper deals with the challenges and possible solutions for semi-automatic ontology enrichment and population. ProMine has two main contributions; one is the semantic-based text mining approach for automatically identifying domain-specific knowledge elements; the other is the automatic categorization of these extracted knowledge elements by using Wiktionary. ProMine ontology enrichment solution was applied in IT audit domain of an e-learning system. After ten cycles of the application ProMine, the number of automatically identified new concepts are tripled and ProMine categorized new concepts with high precision and recall.

Download Full-text

Ontology Maintenance Through Semantic Text Mining

Innovations, Developments, and Applications of Semantic Web and Information Systems - Advances in Web Technologies and Engineering ◽

10.4018/978-1-5225-5042-6.ch013 ◽

2018 ◽

pp. 350-371 ◽

Cited By ~ 1

Author(s):

Andrea Ko ◽

Saira Gillani

Keyword(s):

Text Mining ◽

Learning System ◽

Professional Experience ◽

Complex Task ◽

Domain Specific ◽

New Concepts ◽

Domain Specific Knowledge ◽

E Learning ◽

Ontology Population ◽

It Audit

Manual ontology population and enrichment is a complex task that require professional experience involving a lot of efforts. The authors' paper deals with the challenges and possible solutions for semi-automatic ontology enrichment and population. ProMine has two main contributions; one is the semantic-based text mining approach for automatically identifying domain-specific knowledge elements; the other is the automatic categorization of these extracted knowledge elements by using Wiktionary. ProMine ontology enrichment solution was applied in IT audit domain of an e-learning system. After seven cycles of the application ProMine, the number of automatically identified new concepts are significantly increased and ProMine categorized new concepts with high precision and recall.

Download Full-text

Cross-Fertilizing Deep Web Analysis and Ontology Enrichment

10.31219/osf.io/b3fvz ◽

2017 ◽

Author(s):

Marilena Oita ◽

Antoine Amarilli ◽

Pierre Senellart

Keyword(s):

Domain Knowledge ◽

Deep Web ◽

Web Pages ◽

Complete Understanding ◽

Specific Knowledge ◽

Domain Specific ◽

Domain Specific Knowledge ◽

Web Crawlers ◽

New Perspective ◽

The Impact

Deep Web databases, whose content is presented as dynamically-generated Web pages hidden behind forms, have mostly been left unindexed by search engine crawlers. In order to automatically explore this mass of information, many current techniques assume the existence of domain knowledge, which is costly to create and maintain. In this article, we present a new perspective on form understanding and deep Web data acquisition that does not require any domain-specific knowledge. Unlike previous approaches, we do not perform the various steps in the process (e.g., form understanding, record identification, attribute labeling) independently but integrate them to achieve a more complete understanding of deep Web sources. Through information extraction techniques and using the form itself for validation, we reconcile input and output schemas in a labeled graph which is further aligned with a generic ontology. The impact of this alignment is threefold: first, the resulting semantic infrastructure associated with the form can assist Web crawlers when probing the form for content indexing; second, attributes of response pages are labeled by matching known ontology instances, and relations between attributes are uncovered; and third, we enrich the generic ontology with facts from the deep Web.

Download Full-text

The REA Pattern, Knowledge Structures, and Conceptual Modeling Performance

Journal of Information Systems ◽

10.2308/jis.2005.19.2.57 ◽

2005 ◽

Vol 19 (2) ◽

pp. 57-77 ◽

Cited By ~ 11

Author(s):

Gregory J. Gerard

Keyword(s):

Domain Knowledge ◽

Conceptual Models ◽

Conceptual Modeling ◽

Knowledge Structure ◽

Specific Pattern ◽

Accounting Information Systems ◽

Domain Specific ◽

Domain Specific Knowledge ◽

Business Setting ◽

Structured Knowledge

Most database textbooks on conceptual modeling do not cover domainspecific patterns. The texts emphasize notation, apparently assuming that notation enables individuals to correctly model domain-specific knowledge acquired from experience. However, the domain knowledge acquired may not aid in the construction of conceptual models if it is not structured to support conceptual modeling. This study uses the Resources Events Agents (REA) pattern as an example of a domain-specific pattern that can be encoded as a knowledge structure for conceptual modeling of accounting information systems (AIS), and tests its effects on the accuracy of conceptual modeling in a familiar business setting. Fifty-three undergraduate and forty-six graduate students completed recall tasks designed to measure REA knowledge structure. The accuracy of participants' conceptual models was positively related to REA knowledge structure. Results suggest it is insufficient to know only conceptual modeling notation because structured knowledge of domain-specific patterns reduces design errors.

Download Full-text

Research on feature-based opinion mining using topic maps

The Electronic Library ◽

10.1108/el-11-2014-0197 ◽

2016 ◽

Vol 34 (3) ◽

pp. 435-456 ◽

Cited By ~ 2

Author(s):

Lixin Xia ◽

Zhongyi Wang ◽

Chen Chen ◽

Shanshan Zhai

Keyword(s):

Domain Knowledge ◽

Opinion Mining ◽

Specific Knowledge ◽

Content Type ◽

Topic Maps ◽

Domain Specific ◽

Product Features ◽

Domain Specific Knowledge ◽

Feature Based ◽

Integrate Domain

Purpose Opinion mining (OM), also known as “sentiment classification”, which aims to discover common patterns of user opinions from their textual statements automatically or semi-automatically, is not only useful for customers, but also for manufacturers. However, because of the complexity of natural language, there are still some problems, such as domain dependence of sentiment words, extraction of implicit features and others. The purpose of this paper is to propose an OM method based on topic maps to solve these problems. Design/methodology/approach Domain-specific knowledge is key to solve problems in feature-based OM. On the one hand, topic maps, as an ontology framework, are composed of topics, associations, occurrences and scopes, and can represent a class of knowledge representation schemes. On the other hand, compared with ontology, topic maps have many advantages. Thus, it is better to integrate domain-specific knowledge into OM based on topic maps. This method can make full use of the semantic relationships among feature words and sentiment words. Findings In feature-level OM, most of the existing research associate product features and opinions by their explicit co-occurrence, or use syntax parsing to judge the modification relationship between opinion words and product features within a review unit. They are mostly based on the structure of language units without considering domain knowledge. Only few methods based on ontology incorporate domain knowledge into feature-based OM, but they only use the “is-a” relation between concepts. Therefore, this paper proposes feature-based OM using topic maps. The experimental results revealed that this method can improve the accuracy of the OM. The findings of this study not only advance the state of OM research but also shed light on future research directions. Research limitations/implications To demonstrate the “feature-based OM using topic maps” applications, this work implements a prototype that helps users to find their new washing machines. Originality/value This paper presents a new method of feature-based OM using topic maps, which can integrate domain-specific knowledge into feature-based OM effectively. This method can improve the accuracy of the OM greatly. The proposed method can be applied across various application domains, such as e-commerce and e-government.

Download Full-text

Workflows Científicos com Apoio de Bases de Conhecimento em Tempo Real

10.5753/bresci.2016.9123 ◽

2020 ◽

Author(s):

Victor S. Bursztyn ◽

Jonas Dias ◽

Marta Mattoso

Keyword(s):

Knowledge Base ◽

Domain Knowledge ◽

Large Scale ◽

Workflow Engine ◽

Workflow Execution ◽

Human In The Loop ◽

Domain Specific ◽

Political Sciences ◽

Provenance Data ◽

Domain Specific Knowledge

One major challenge in large-scale experiments is the analytical capacity to contrast ongoing results with domain knowledge. We approach this challenge by constructing a domain-specific knowledge base, which is queried during workflow execution. We introduce K-Chiron, an integrated solution that combines a state-of-the-art automatic knowledge base construction (KBC) system to Chiron, a well-established workflow engine. In this work we experiment in the context of Political Sciences to show how KBC may be used to improve human-in-the-loop (HIL) support in scientific experiments. While HIL in traditional domain expert supervision is done offline, in K-Chiron it is done online, i.e. at runtime. We achieve results in less laborious ways, to the point of enabling a breed of experiments that could be unfeasible with traditional HIL. Finally, we show how provenance data could be leveraged with KBC to enable further experimentation in more dynamic settings.

Download Full-text

A General Method for Transferring Explicit Knowledge into Language Model Pretraining

Security and Communication Networks ◽

10.1155/2021/7115167 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Ruiqing Yan ◽

Lanchang Sun ◽

Fang Wang ◽

Xiaoming Zhang

Keyword(s):

Domain Knowledge ◽

Explicit Knowledge ◽

Language Model ◽

Background Knowledge ◽

Knowledge Bases ◽

Language Models ◽

Domain Specific ◽

Text Understanding ◽

Domain Specific Knowledge ◽

General Method

Recently, pretrained language models, such as Bert and XLNet, have rapidly advanced the state of the art on many NLP tasks. They can model implicit semantic information between words in the text. However, it is solely at the token level without considering the background knowledge. Intuitively, background knowledge influences the efficacy of text understanding. Inspired by this, we focus on improving model pretraining by leveraging external knowledge. Different from recent research that optimizes pretraining models by knowledge masking strategies, we propose a simple but general method to transfer explicit knowledge with pretraining. To be specific, we first match knowledge facts from a knowledge base (KB) and then add a knowledge injunction layer to a transformer directly without changing its architecture. This study seeks to find the direct impact of explicit knowledge on model pretraining. We conduct experiments on 7 datasets using 5 knowledge bases in different downstream tasks. Our investigation reveals promising results in all the tasks. The experiment also verifies that domain-specific knowledge is superior to open-domain knowledge in domain-specific task, and different knowledge bases have different performances in different tasks.

Download Full-text

Cross-Disciplinary Collaboration

Performing Knowledge ◽

10.1093/oso/9780190653545.003.0002 ◽

2020 ◽

pp. 21-32

Author(s):

Daphne Leong

Keyword(s):

Intercultural Communication ◽

Domain Knowledge ◽

Mutual Influence ◽

Specific Knowledge ◽

Domain Specific ◽

Cross Domain ◽

Domain Specific Knowledge ◽

Epistemic Objects

This chapter describes the things and people that facilitate collaboration across disciplines: shared items, shared objectives, and shared agents. (These concepts draw from literature on collaboration in the sciences and from research on intercultural communication.) Shared items function differently from discipline to discipline, while being identifiable across disciplines. Shared objectives comprise activity objects, the prospective outcomes of collaboration, and epistemic objects, knowledge sought. Shared agents function within and across two or more disciplines. In this book, shared items are represented primarily by scores (and recordings), activity objects by the book’s chapters, epistemic objects by interpretations of pieces and of analysis-performance relations, and shared agents by scholar-performers or performer-scholars. Mechanisms and processes of collaboration are briefly described: strategies for collaborating when views diverge, and degrees of collaborative convergence (working in parallel, translating or mediating knowledge for mutual influence, transforming domain-specific knowledge into new cross-domain knowledge).

Download Full-text

Exploring User Feedback of a E-Learning System: A Text Mining Approach

Human Interface and the Management of Information. Information and Interaction for Learning, Culture, Collaboration and Business, - Lecture Notes in Computer Science ◽

10.1007/978-3-642-39226-9_21 ◽

2013 ◽

pp. 182-191 ◽

Cited By ~ 3

Author(s):

Wen-Bin Yu ◽

Ronaldo Luna

Keyword(s):

Text Mining ◽

User Feedback ◽

Learning System ◽

System A ◽

E Learning

Download Full-text

An Effective Assessment of Knowledge Sharing and E-Learning Portals

International Journal of Web-Based Learning and Teaching Technologies ◽

10.4018/ijwltt.2015040101 ◽

2015 ◽

Vol 10 (2) ◽

pp. 1-12 ◽

Cited By ~ 1

Author(s):

D. Venkata Subramanian ◽

Angelina Geetha ◽

P. Shankar

Keyword(s):

Knowledge Sharing ◽

Industrialized Countries ◽

Specific Knowledge ◽

Web Based ◽

Domain Specific ◽

Domain Specific Knowledge ◽

E Learning ◽

Effective Assessment ◽

Process Trends ◽

Domain Independent

In recent years, most of the companies have increasingly realized the importance of the knowledge sharing portal and E-Learning portals to provide competitive knowledge for their employees. The knowledge stored in these portals varies from technical, process and project knowledge functional or domain specific knowledge to face the competitiveness among other companies or organizations, especially in industrialized countries. More than three-fourths of organizations have focused on their investment in technology and process trends that encourage user collaboration through Knowledge sharing and e-Learning Portals. There are many number of challenges in evaluating the effectiveness of the E-Learning Portals and Knowledge Portals. The primary goal of this paper is to illustrate how a domain independent multi-dimensional metric model and metric database can be built to assess the effectiveness of the Web Based Knowledge and E-Learning Portals.

Download Full-text

An ADOxx Based Environment for Problem Based Learning in Manufacturing Systems Design

MATEC Web of Conferences ◽

10.1051/matecconf/201929014003 ◽

2019 ◽

Vol 290 ◽

pp. 14003 ◽

Cited By ~ 1

Author(s):

Ion Dan Mironescu

Keyword(s):

Domain Knowledge ◽

Manufacturing Systems ◽

Learning Experience ◽

Problem Based Learning ◽

Systems Design ◽

Learning By Doing ◽

Design Solution ◽

Learning Material ◽

Domain Specific ◽

Domain Specific Knowledge

The Problem Based Learning (PBL) as student centred approach and learning-by-doing method is suited for the modern higher education. However, the first contact with the method can be overwhelming for the students, in the absence of prior domain knowledge. The preparation of the learning material can be time and resource consuming for the teacher. The goal of the research was the implementation of an environment that should enhance the learning experience for the student and reduce the implementation burden for the teacher. The environment is based on the ADOxx platform and allows the collaboration of the learner teams and the teacher-learner interaction on three levels. The Metamodeling level supports the development of the domain-specific language used in the modelling of the manufacturing system; this activity stimulates and directs the gathering and consolidation of domain-specific knowledge. The modelling level allows the development of alternative design solution using models of the factory components. The Simulation level allows the analysis of these variants. The environment supports the teacher in developing instructional scaffolding and uses cases to ease the learners the first time contact with PBL. The functionality of the environment is presented using the case of designing a flexible food production line.

Download Full-text