A Semantic Knowledge-Based Framework for Information Extraction and Exploration

2021 ◽  
Vol 13 (2) ◽  
pp. 85-109
Author(s):  
Abduladem Aljamel ◽  
Taha Osman ◽  
Dhavalkumar Thakker

The availability of online documents that describe domain-specific information provides an opportunity in employing a knowledge-based approach in extracting information from web data. This research proposes a novel comprehensive semantic knowledge-based framework that helps to transform unstructured data to be easily exploited by data scientists. The resultant sematic knowledgebase is reasoned to infer new facts and classify events that might be of importance to end users. The target use case for the framework implementation was the financial domain, which represents an important class of dynamic applications that require the modelling of non-binary relations. Such complex relations are becoming increasingly common in the era of linked open data. This research in modelling and reasoning upon such relations is a further contribution of the proposed semantic framework, where non-binary relations are semantically modelled by adapting the semantic reasoning axioms to fit the intermediate resources in the N-ary relations requirements.

Author(s):  
Jose María Alvarez Rodríguez ◽  
José Emilio Labra Gayo ◽  
Patricia Ordoñez de Pablos

The aim of this chapter is to present a proposal and a case study to describe the information about organizations in a standard way using the Linked Data approach. Several models and ontologies have been provided in order to formalize the data, structure and behaviour of organizations. Nevertheless, these tries have not been fully accepted due to some factors: (1) missing pieces to define the status of the organization; (2) tangled parts to specify the structure (concepts and relations) between the elements of the organization; 3) lack of text properties, and other factors. These divergences imply a set of incomplete approaches to formalize data and information about organizations. Taking into account the current trends of applying semantic web technologies and linked data to formalize, aggregate, and share domain specific information, a new model for organizations taking advantage of these initiatives is required in order to overcome existing barriers and exploit the corporate information in a standard way. This work is especially relevant in some senses to: (1) unify existing models to provide a common specification; (2) apply semantic web technologies and the Linked Data approach; (3) provide access to the information via standard protocols, and (4) offer new services that can exploit this information to trace the evolution and behaviour of the organization over time. Finally, this work is interesting to improve the clarity and transparency of some scenarios in which organizations play a key role, like e-procurement, e-health, or financial transactions.


Author(s):  
Poonam Jatwani ◽  
Pradeep Tomar ◽  
Vandana Dhingra

Web documents display information in the form of natural language text which is not understandable by machines. To search specific information from sea of web documents has become very challenging as it shows many unwanted non relevant documents along with relevant documents. To retrieve relevant information semantic knowledge can be stored in the domain specific ontology which helps in understanding user’s need to retrieve relevant information. Intensive research has been going on in the field of text processing to develop ontologies using NLP technique. The proposed technique is another effort in this direction. In this method to extract syntactic structure we have used Stanford parser which complete tokenization of text, parsing as well as morphological analysis. Semantic rules are defined manually to identify valid concepts and relation among them. Once concepts, properties and relationship among concepts are identified, extracted information is visualized in the form of ontology.


2013 ◽  
Vol 07 (04) ◽  
pp. 427-453 ◽  
Author(s):  
SHIMA DASTGHEIB ◽  
ARSHAM MESBAH ◽  
KRYS KOCHUT

Domain-specific ontologies have become integral components of numerous semantic- and knowledge-based applications. However, creating such ontologies and populating them with correct individuals is a difficult and time-consuming process. Recently, a vast amount of knowledge has become available as part of the Linked Open Data (LOD) project, which includes data sets in multiple areas. In this paper, we present mOntage, a novel ontology design and population framework, which allows a domain expert to easily define a domain ontology schema and specify the ontology's classes and properties in terms of the subsets of the existing LOD data sources. The classes and properties of the ontology being created can be defined either directly, in terms of existing LOD-available classes and properties, or can be newly constructed by the domain expert. The definitions, called maps, are encoded as part of the ontology itself, effectively converting it into a self-populating ontology. Finally, a dedicated software system automatically populates the ontology with instances obtained from the selected LOD sources by executing suitable SPARQL queries. We illustrate our framework by creating Cancer Treatment ontology in the area of biomedicine.


2019 ◽  
Vol 9 (14) ◽  
pp. 2852
Author(s):  
Malik Nabeel Ahmed Awan ◽  
Sharifullah Khan ◽  
Khalid Latif ◽  
Asad Masood Khattak

In modern society, people are heavily reliant on information available online through various channels, such as websites, social media, and web portals. Examples include searching for product prices, news, weather, and jobs. This paper focuses on an area of information extraction in e-recruitment, or job searching, which is increasingly used by a large population of users in across the world. Given the enormous volume of information related to job descriptions and users’ profiles, it is complicated to appropriately match a user’s profile with a job description, and vice versa. Existing information extraction techniques are unable to extract contextual entities. Thus, they fall short of extracting domain-specific information entities and consequently affect the matching of the user profile with the job description. The work presented in this paper aims to extract entities from job descriptions using a domain-specific dictionary. The extracted information entities are enriched with knowledge using Linked Open Data. Furthermore, job context information is expanded using a job description domain ontology based on the contextual and knowledge information. The proposed approach appropriately matches users’ profiles/queries and job descriptions. The proposed approach is tested using various experiments on data from real life jobs’ portals. The results show that the proposed approach enriches extracted data from job descriptions, and can help users to find more relevant jobs.


Procedia CIRP ◽  
2021 ◽  
Vol 97 ◽  
pp. 373-378
Author(s):  
Sharath Chandra Akkaladevi ◽  
Matthias Plasch ◽  
Michael Hofmann ◽  
Andreas Pichler

Land ◽  
2021 ◽  
Vol 10 (1) ◽  
pp. 33
Author(s):  
Palmyra Repette ◽  
Jamile Sabatini-Marques ◽  
Tan Yigitcanlar ◽  
Denilson Sell ◽  
Eduardo Costa

Since the advent of the second digital revolution, the exponential advancement of technology is shaping a world with new social, economic, political, technological, and legal circumstances. The consequential disruptions force governments and societies to seek ways for their cities to become more humane, ethical, inclusive, intelligent, and sustainable. In recent years, the concept of City-as-a-Platform was coined with the hope of providing an innovative approach for addressing the aforementioned disruptions. Today, this concept is rapidly gaining popularity, as more and more platform thinking applications become available to the city context—so-called platform urbanism. These platforms used for identifying and addressing various urbanization problems with the assistance of open data, participatory innovation opportunity, and collective knowledge. With these developments in mind, this study aims to tackle the question of “How can platform urbanism support local governance efforts in the development of smarter cities?” Through an integrative review of journal articles published during the last decade, the evolution of City-as-a-Platform was analyzed. The findings revealed the prospects and constraints for the realization of transformative and disruptive impacts on the government and society through the platform urbanism, along with disclosing the opportunities and challenges for smarter urban development governance with collective knowledge through platform urbanism.


Author(s):  
Beniamino Di Martino ◽  
Dario Branco ◽  
Luigi Colucci Cante ◽  
Salvatore Venticinque ◽  
Reinhard Scholten ◽  
...  

AbstractThis paper proposes a semantic framework for Business Model evaluation and its application to a real case study in the context of smart energy and sustainable mobility. It presents an ontology based representation of an original business model and examples of inferential rules for knowledge extraction and automatic population of the ontology. The real case study belongs to the GreenCharge European Project, that in these last years is proposing some original business models to promote sustainable e-mobility plans. An original OWL Ontology contains all relevant Business Model concepts referring to GreenCharge’s domain, including a semantic description of TestCards, survey results and inferential rules.


Author(s):  
Yufei Li ◽  
Xiaoyong Ma ◽  
Xiangyu Zhou ◽  
Pengzhen Cheng ◽  
Kai He ◽  
...  

Abstract Motivation Bio-entity Coreference Resolution focuses on identifying the coreferential links in biomedical texts, which is crucial to complete bio-events’ attributes and interconnect events into bio-networks. Previously, as one of the most powerful tools, deep neural network-based general domain systems are applied to the biomedical domain with domain-specific information integration. However, such methods may raise much noise due to its insufficiency of combining context and complex domain-specific information. Results In this paper, we explore how to leverage the external knowledge base in a fine-grained way to better resolve coreference by introducing a knowledge-enhanced Long Short Term Memory network (LSTM), which is more flexible to encode the knowledge information inside the LSTM. Moreover, we further propose a knowledge attention module to extract informative knowledge effectively based on contexts. The experimental results on the BioNLP and CRAFT datasets achieve state-of-the-art performance, with a gain of 7.5 F1 on BioNLP and 10.6 F1 on CRAFT. Additional experiments also demonstrate superior performance on the cross-sentence coreferences. Supplementary information Supplementary data are available at Bioinformatics online.


2004 ◽  
Vol 02 (01) ◽  
pp. 215-239 ◽  
Author(s):  
TOLGA CAN ◽  
YUAN-FANG WANG

We present a new method for conducting protein structure similarity searches, which improves on the efficiency of some existing techniques. Our method is grounded in the theory of differential geometry on 3D space curve matching. We generate shape signatures for proteins that are invariant, localized, robust, compact, and biologically meaningful. The invariancy of the shape signatures allows us to improve similarity searching efficiency by adopting a hierarchical coarse-to-fine strategy. We index the shape signatures using an efficient hashing-based technique. With the help of this technique we screen out unlikely candidates and perform detailed pairwise alignments only for a small number of candidates that survive the screening process. Contrary to other hashing based techniques, our technique employs domain specific information (not just geometric information) in constructing the hash key, and hence, is more tuned to the domain of biology. Furthermore, the invariancy, localization, and compactness of the shape signatures allow us to utilize a well-known local sequence alignment algorithm for aligning two protein structures. One measure of the efficacy of the proposed technique is that we were able to perform structure alignment queries 36 times faster (on the average) than a well-known method while keeping the quality of the query results at an approximately similar level.


Sign in / Sign up

Export Citation Format

Share Document