Semi-Automatic Ontology Development

Creation and Integration of Reference Ontologies for Efficient LOD Management

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch007 ◽

2012 ◽

pp. 162-199 ◽

Cited By ~ 1

Author(s):

Mariana Damova ◽

Atanas Kiryakov ◽

Maurice Grinberg ◽

Michael K. Bergman ◽

Frédérick Giasson ◽

...

Keyword(s):

Real Life ◽

Open Data ◽

Real Data ◽

Heterogeneous Data ◽

Linked Open Data ◽

Data Space ◽

Upper Level ◽

Reference Knowledge ◽

Global Data ◽

The Web

The chapter introduces the process of design of two upper-level ontologies—PROTON and UMBEL—into reference ontologies and their integration in the so-called Reference Knowledge Stack (RKS). It is argued that RKS is an important step in the efforts of the Linked Open Data (LOD) project to transform the Web into a global data space with diverse real data, available for review and analysis. RKS is intended to make the interoperability between published datasets much more efficient than it is now. The approach discussed in the chapter consists of developing reference layers of upper-level ontologies by mapping them to certain LOD schemata and assigning instance data to them so they cover a reasonable portion of the LOD datasets. The chapter presents the methods (manual and semi-automatic) used in the creation of the RKS and gives examples that illustrate its advantages for managing highly heterogeneous data and its usefulness in real life knowledge intense applications.

Download Full-text

SODA

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch003 ◽

2012 ◽

pp. 48-77 ◽

Cited By ~ 2

Author(s):

Andreea Diosteanu ◽

Armando Stellato ◽

Andrea Turbati

Keyword(s):

Data Acquisition ◽

Information Extraction ◽

Open Source ◽

Research Area ◽

Software Systems ◽

Great Level ◽

Plug And Play ◽

Service Oriented ◽

Platform Independence ◽

Application Developers

In this chapter, the authors present Service Oriented Data Acquisition (SODA), a service-deployable open-source platform for retrieving and dynamically aggregating information extraction and knowledge acquisition software components. The motivation in creating such a system came from the observed gap between the large availability of Information Analysis components for different frameworks (such as UIMA [Ferrucci & Lally, 2004] and GATE [Cunningham, Maynard, Bontcheva, & Tablan, 2002]) and the difficulties in discovering, retrieving, integrating these components, and embedding them into software systems for knowledge feeding. By analyzing the research area, the authors noticed that there are a few solutions for this problem, though they all lack in assuring a great level of platform independence, collaboration, flexibility, and most of all, openness. The solution that they propose is targeted to different kinds of users, from application developers, benefiting from a semantic repository of inter-connectable information extraction and ontology feeding components, to final users, who can plug and play these components through SODA compliant clients.

Download Full-text

OntoWiktionary

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch006 ◽

2012 ◽

pp. 131-161 ◽

Cited By ~ 5

Author(s):

Christian M. Meyer ◽

Iryna Gurevych

Keyword(s):

Linked Data ◽

Structured Knowledge

For constructing their ontology OntoWiktionary, the authors present a two-step approach that involves (1) harvesting structured knowledge from Wiktionary and (2) ontologizing this knowledge (i.e., the formation of ontological concepts and relationships from the harvested knowledge). They evaluate their approach based on human judgments and find their new ontology to be of overall good quality. To encourage further research in this field, the authors make the final OntoWiktionary publicly available and suggest integrating this novel resource with the linked data cloud as well as other existing ontology projects.

Download Full-text

LMF Dictionary-Based Approach for Domain Ontology Generation

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch005 ◽

2012 ◽

pp. 106-130 ◽

Cited By ~ 2

Author(s):

Feten Baccar Ben Amar ◽

Bilel Gargouri ◽

Abdelmajid Ben Hamadou

Keyword(s):

Research Team ◽

Arabic Language ◽

Ontology Development ◽

Lexical Knowledge ◽

Quality Issue ◽

Semantic Fields ◽

Ontology Generation ◽

Dictionary Structure ◽

Validation Stage

In this chapter, the authors propose an approach for generating domain ontologies from LMF standardized dictionaries (ISO-24613). It consists, firstly, of deriving the target ontology core systematically from the explicit information of the LMF dictionary structure. Secondly, it aims at enriching such a core, taking advantage of textual sources with guided semantic fields available in the definitions and the examples of lexical entries. The originality of this work lies not only in the use of a unique and finely-structured source containing multi-domain and lexical knowledge of morphological, syntactic, and semantic levels, lending itself to ontological interpretations, but also in providing ontological elements with linguistic grounding. In addition, the proposed approach has addressed the quality issue that is of a major importance in ontology engineering. They have integrated a validation stage along with the extraction modules in order to maintain the consistency of the generated ontologies. Furthermore, the proposed approach was applied to a case study in the field of astronomy and the experiment has been carried out on the Arabic language. This choice is explained both by the great deficiency of work on Arabic ontology development and the availability within the research team of an LMF standardized Arabic dictionary.

Download Full-text

Aggregation and Maintenance of Multilingual Linked Data

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch008 ◽

2012 ◽

pp. 201-225

Author(s):

Ernesto William De Luca

Keyword(s):

Linked Data ◽

Open Data ◽

Relevant Information ◽

User Preferences ◽

Linked Open Data ◽

Lexical Resources ◽

Lexical Information ◽

Lexical Resource

In this chapter, the author presents his approach to aggregating and maintaining Multilingual Linked Data. He describes Lexical Resources and Lexical Linked Data, presenting a hybridization that ports the largest lexical resource EuroWordNet to the Linked Open Data cloud, interlinking it with other lexical resources. Furthermore, he shows the LexiRes RDF/OWL tool that gives the possibility to navigate this lexical information, helping authors of already available lexical resources in deleting or restructuring concepts using automatic merging methods. The chapter is concluded by a discussion on personalizing information according to user preferences, filtering relevant information while taking into account the multilingual background of the user.

Download Full-text

A Modular Framework to Learn Seed Ontologies from Text

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch002 ◽

2012 ◽

pp. 22-47 ◽

Cited By ~ 1

Author(s):

Davide Eynard ◽

Matteo Matteucci ◽

Fabio Marfia

Keyword(s):

Basic Block ◽

Specific Domain ◽

Process Ontology ◽

Knowledge Based ◽

Induction Process ◽

Learning From Text ◽

Short Time ◽

Corpus Selection ◽

Direct Use ◽

Selection Of

Ontologies are the basic block of modern knowledge-based systems; however, the effort and expertise required to develop them often prevents their widespread adoption. In this chapter, the authors present a tool for the automatic discovery of basic ontologies—they call them seed ontologies—starting from a corpus of documents related to a specific domain of knowledge. These seed ontologies are not meant for direct use, but they can be used to bootstrap the knowledge acquisition process by providing a selection of relevant terms and fundamental relationships. The tool is modular and it allows the integration of different methods/strategies in the indexing of the corpus, selection of relevant terms, discovery of hierarchies, and other relationships among terms. Like any induction process, ontology learning from text is prone to errors, so the authors do not expect a 100% correct ontology; according to their evaluation the result is closer to 80%, but this should be enough for a domain expert to complete the work with limited effort and in a short time.

Download Full-text

Mining Multiword Terms from Wikipedia

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch009 ◽

2012 ◽

pp. 226-258 ◽

Cited By ~ 4

Author(s):

Silvana Hartmann ◽

György Szarvas ◽

Iryna Gurevych

Keyword(s):

Domain Knowledge ◽

Large Scale ◽

Structural Information ◽

Initial Step ◽

Linguistic Processing ◽

Lexical Resources ◽

Terminology Extraction ◽

Knowledge Representations ◽

Multiword Units ◽

Relevant Domain

The collection of the specialized vocabulary of a particular domain (terminology) is an important initial step of creating formalized domain knowledge representations (ontologies). Terminology Extraction (TE) aims at automating this process by collecting the relevant domain vocabulary from existing lexical resources or collections of domain texts. In this chapter, the authors address the extraction of multiword terminology, as multiword terms are very frequent in terminology but typically poorly represented in standard lexical resources. They present their method for mining multiword terminology from Wikipedia and the freely available terminology resource that they extracted using the presented method. Terminology extraction based on Wikipedia exploits the advantages of a huge multilingual, domain-transcending knowledge source and large scale structural information that can identify potential multiword units without the need for linguistic processing tools. Thus, while evaluated in English, the proposed method is basically applicable to all languages in Wikipedia.

Download Full-text

Exploiting Transitivity in Probabilistic Models for Ontology Learning

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch010 ◽

2012 ◽

pp. 259-293 ◽

Cited By ~ 1

Author(s):

Francesca Fallucchi ◽

Fabio Massimo Zanzotto

Keyword(s):

User Interface ◽

Human Computer Interaction ◽

Structural Properties ◽

Graphical User Interface ◽

Probabilistic Models ◽

Learning System ◽

Ontology Learning ◽

Specific Domain ◽

New Information ◽

Computer Interaction

The authors propose probabilistic models for learning ontologies that expand existing ontologies taking into account both corpus-extracted evidence and the structure of the generated ontologies. The model exploits structural properties of target relations such as transitivity during learning. They then propose two extensions of the probabilistic models: a model for learning from a generic domain that can be exploited to extract new information in a specific domain and an incremental ontology learning system that puts human validations in the learning loop. This latter provides a graphical user interface and a human-computer interaction workflow supporting the incremental leaning loop.

Download Full-text

Mining XML Schemas to Extract Conceptual Knowledge

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch004 ◽

2012 ◽

pp. 79-105

Author(s):

Ivan Bedini ◽

Benjamin Nguyen ◽

Christopher Matheus ◽

Peter F. Patel-Schneider ◽

Aidan Boran

Keyword(s):

Semantic Web ◽

Conceptual Knowledge ◽

Heterogeneous Data ◽

Ontology Language ◽

Extensible Markup ◽

Data Source ◽

Description Framework ◽

Logical Representation ◽

Direct Use ◽

The Web

One of the promises of the Semantic Web is to support applications that easily and seamlessly deal with heterogeneous data. Most data in the Web, however, is in the Extensible Markup Language (XML) format, but using XML requires applications to understand the format of each data source that they access. Achieving the benefits of the Semantic Web involves transforming XML into the Semantic Web languages, OWL (the Web Ontology Language) and RDF (the Resource Description Framework), a process that generally has manual or only semi-automatic components. In this chapter, the authors present a set of patterns that enable the automatic transformation from XML Schema into RDF and OWL, enabling the direct use of much XML data in the Semantic Web. They focus on a possible logical representation of the first language and present an implementation, including a comparison with related works.

Download Full-text

Ontology-Based Information Extraction under a Bootstrapping Approach

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch001 ◽

2012 ◽

pp. 1-21 ◽

Cited By ~ 1

Author(s):

Elias Iosif ◽

Georgios Petasis ◽

Vangelis Karkaletsis

Keyword(s):

Information Extraction ◽

Extraction Process ◽

Extraction Mechanism ◽

Textual Content

The authors present an ontology-based information extraction process, which operates in a bootstrapping framework. The novelty of this approach lies in the continuous semantics extraction from textual content in order to evolve the underlying ontology, while the evolved ontology enhances in turn the information extraction mechanism. This process was implemented in the context of the R&D project BOEMIE1. The BOEMIE system was evaluated on the athletics domain.

Download Full-text

Semi-Automatic Ontology Development
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By IGI Global

Creation and Integration of Reference Ontologies for Efficient LOD Management

SODA

OntoWiktionary

LMF Dictionary-Based Approach for Domain Ontology Generation

Aggregation and Maintenance of Multilingual Linked Data

A Modular Framework to Learn Seed Ontologies from Text

Mining Multiword Terms from Wikipedia

Exploiting Transitivity in Probabilistic Models for Ontology Learning

Mining XML Schemas to Extract Conceptual Knowledge

Ontology-Based Information Extraction under a Bootstrapping Approach

Export Citation Format

Semi-Automatic Ontology DevelopmentLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By IGI Global

Creation and Integration of Reference Ontologies for Efficient LOD Management

SODA

OntoWiktionary

LMF Dictionary-Based Approach for Domain Ontology Generation

Aggregation and Maintenance of Multilingual Linked Data

A Modular Framework to Learn Seed Ontologies from Text

Mining Multiword Terms from Wikipedia

Exploiting Transitivity in Probabilistic Models for Ontology Learning

Mining XML Schemas to Extract Conceptual Knowledge

Ontology-Based Information Extraction under a Bootstrapping Approach

Semi-Automatic Ontology Development
Latest Publications