Application of Semantic Web Technologies for creating Linked Open Data in Information Retrieval

2018 ◽  
Vol 56 (1) ◽  
pp. 55
Author(s):  
Nirupama R. Warrier ◽  
J Shivarama
Author(s):  
Cecilia Avila-Garzon

Advances in semantic web technologies have rocketed the volume of linked data published on the web. In this regard, linked open data (LOD) has long been a topic of great interest in a wide range of fields (e.g. open government, business, culture, education, etc.). This article reports the results of a systematic literature review on LOD. 250 articles were reviewed for providing a general overview of the current applications, technologies, and methodologies for LOD. The main findings include: i) most of the studies conducted so far focus on the use of semantic web technologies and tools applied to contexts such as biology, social sciences, libraries, research, and education; ii) there is a lack of research with regard to a standardized methodology for managing LOD; and iii) a plenty of tools can be used for managing LOD, but most of them lack of user-friendly interfaces for querying datasets.


Semantic Web ◽  
2021 ◽  
pp. 1-21
Author(s):  
Gustavo Candela ◽  
Pilar Escobar ◽  
María Dolores Sáez ◽  
Manuel Marco-Such

Cultural heritage institutions are exploring Semantic Web technologies to publish and enrich their catalogues. Several initiatives, such as Labs, are based on the creative and innovative reuse of the materials published by cultural heritage institutions. In this way, quality has become a crucial aspect to identify and reuse a dataset for research. In this article, we propose a methodology to create Shape Expressions definitions in order to validate LOD datasets published by libraries. The methodology was then applied to four use cases based on datasets published by relevant institutions. It intends to encourage institutions to use ShEx to validate LOD datasets as well as to promote the reuse of LOD, made openly available by libraries.


Author(s):  
Juan-Antonio PASTOR-SÁNCHEZ

This paper shows how has been applied ISO-25964 standard to represent the UNESCO thesaurus through semantic web technologies. Based on the works done by the UNESKOS project, has been analyzed the joint use of SKOS and ISO THES ontologies to represent thesauri according to the data model of the ISO-25964 standard. The result has been an RDF dataset, accessible as Linked Open Data, using SKOS and ISO-THES with the properties of a vocabulary developed in the context of UNESKOS project. The conclusions point, among other aspects, a review of SKOS and adoption of appropriate technologies that facilitate the development of future works and research lines focused on the alignment of vocabularies


Author(s):  
Jose María Alvarez Rodríguez ◽  
José Emilio Labra Gayo ◽  
Patricia Ordoñez de Pablos

The aim of this chapter is to present a proposal and a case study to describe the information about organizations in a standard way using the Linked Data approach. Several models and ontologies have been provided in order to formalize the data, structure and behaviour of organizations. Nevertheless, these tries have not been fully accepted due to some factors: (1) missing pieces to define the status of the organization; (2) tangled parts to specify the structure (concepts and relations) between the elements of the organization; 3) lack of text properties, and other factors. These divergences imply a set of incomplete approaches to formalize data and information about organizations. Taking into account the current trends of applying semantic web technologies and linked data to formalize, aggregate, and share domain specific information, a new model for organizations taking advantage of these initiatives is required in order to overcome existing barriers and exploit the corporate information in a standard way. This work is especially relevant in some senses to: (1) unify existing models to provide a common specification; (2) apply semantic web technologies and the Linked Data approach; (3) provide access to the information via standard protocols, and (4) offer new services that can exploit this information to trace the evolution and behaviour of the organization over time. Finally, this work is interesting to improve the clarity and transparency of some scenarios in which organizations play a key role, like e-procurement, e-health, or financial transactions.


Author(s):  
Mounira Chkiwa ◽  
Anis Jedidi ◽  
Faiez Gargouri

In this paper, the authors present an overall description of their information retrieval system which makes a practical collaboration between Semantic Web and Fuzzy logic in order to have profit from their advantages in the information retrieval domain. Their system is dedicated for kids, for this reason the semantic/fuzzy collaboration materialized must be in the background of the information retrieval process because such category of users cannot certainly control semantic web technologies neither fuzzy logic commands. In this paper, the authors present the different services proposed by their system and how they use Semantic Web and Fuzzy logic to develop it. Evaluation tests of the system using universal measures show clearly its efficiency.


2020 ◽  
Vol 1 (1) ◽  
pp. 428-444 ◽  
Author(s):  
Silvio Peroni ◽  
David Shotton

OpenCitations is an infrastructure organization for open scholarship dedicated to the publication of open citation data as Linked Open Data using Semantic Web technologies, thereby providing a disruptive alternative to traditional proprietary citation indexes. Open citation data are valuable for bibliometric analysis, increasing the reproducibility of large-scale analyses by enabling publication of the source data. Following brief introductions to the development and benefits of open scholarship and to Semantic Web technologies, this paper describes OpenCitations and its data sets, tools, services, and activities. These include the OpenCitations Data Model; the SPAR (Semantic Publishing and Referencing) Ontologies; OpenCitations’ open software of generic applicability for searching, browsing, and providing REST APIs over resource description framework (RDF) triplestores; Open Citation Identifiers (OCIs) and the OpenCitations OCI Resolution Service; the OpenCitations Corpus (OCC), a database of open downloadable bibliographic and citation data made available in RDF under a Creative Commons public domain dedication; and the OpenCitations Indexes of open citation data, of which the first and largest is COCI, the OpenCitations Index of Crossref Open DOI-to-DOI Citations, which currently contains over 624 million bibliographic citations and is receiving considerable usage by the scholarly community.


2019 ◽  
Vol 19 (01) ◽  
pp. e05
Author(s):  
Marcos daniel Zarate ◽  
Carlos Buckle ◽  
Renato Mazzanti ◽  
Gustavo Samec

Scientific publication services are changing drastically, researchers demand intelligent search services to discover and relate scientific publications. Publishersneed to incorporate semantic information to better organize their digital assets and make publications more discoverable. In this paper, we present the on-going work to publish a subset of scientific publications of CONICET Digital as Linked Open Data. The objective of this work is to improve the recovery andreuse of data through Semantic Web technologies and Linked Data in the domain of scientific publications.To achieve these goals, Semantic Web standards and reference RDF schema’s have been taken into account (Dublin Core, FOAF, VoID, etc.). The conversion and publication process is guided by the methodological guidelines for publishing government linked data. We also outline how these data can be linked to other datasets DBLP, WIKIDATA and DBPEDIA on the web of data. Finally, we show some examples of queries that answer questions that initially CONICET Digital does not allow


Author(s):  
Mounira Chkiwa ◽  
Anis Jedidi ◽  
Faiez Gargouri

In this paper, the authors present an overall description of their information retrieval system which makes a practical collaboration between Semantic Web and Fuzzy logic in order to have profit from their advantages in the information retrieval domain. Their system is dedicated for kids, for this reason the semantic/fuzzy collaboration materialized must be in the background of the information retrieval process because such category of users cannot certainly control semantic web technologies neither fuzzy logic commands. In this paper, the authors present the different services proposed by their system and how they use Semantic Web and Fuzzy logic to develop it. Evaluation tests of the system using universal measures show clearly its efficiency.


Author(s):  
Torsten Priebe

The goal of this chapter is to show how Semantic Web technologies can help build integrative enterprise knowledge portals. Three main areas are identified: content management and metadata, global searching, and the integration of external content and applications. For these three areas the state-of-the-art as well as current research results are discussed. In particular, a metadata-based information retrieval and a context-based port let integration approach are presented. These have been implemented in a research prototype which is introduced in the Internet session at the end of the chapter.


2020 ◽  
pp. 17-33
Author(s):  
Оlexander V. Palagin ◽  
◽  
Мykola G. Petrenko ◽  

Introduction. Nowadays, numerous applications and tools are known that implement information retrieval technologies in various text sources in accordance with specified parameters. Moreover, the search results are provided to the user for each search parameter individually and not related to each other. And the application of Semantic Web technologies for the purpose of multi-parameter and related information retrieval in various sources in Ukraine is at the initial stage of development. A separate problem is the multimedia presentation of search results and their comparison with the conceptual structure of the domain of interest (Knowledge Domain) with the goal of extracting new knowledge. From this point of view, it is relevant for scientific research to process the scientific publications of one author, authors of a scientific unit and the academic institute as a whole, using the Semantic Web technologies, multimedia presentation of information, and effective support for the process of extracting new knowledge. Purpose. Designing the architecture and functioning algorithms of the instrumental complex for processing databases of scientific publications, as well as developing examples of using a formal description of a scientific article with a number of queries. Methods. The methods and models used in this work are based on Semantic Web information technologies focused on the development and use of subject ontologies. Ontologies are the basic components of these technologies both for conducting scientific research and creating large databases, including scientific publications of the authors. Results. The architecture of the instrumental complex for processing databases of scientific publications and the algorithms for its functioning at the preparatory and main stages have been developed. Examples of queries to the database of scientific publications that demonstrate the performance of IR are given. Conclusion. The article discusses the architecture of the instrumental complex for processing databases of scientific publications and the algorithms for its functioning at the preparatory and main stages. The steps of the preparatory phase, which are implemented by the knowledge engineer, are examined in detail. At the same time, the creation of two ontology models of the scientific article with the presentation of the corresponding ontographs was highlighted: the CRF-model describes the concepts contained in the article, and the OWL-model describes the structural components of the article. In conclusion, examples of queries to the databases of scientific publications are presented, demonstrating the performance of the instrumental complex. Further, it is necessary to expand the use in the development of IR technologies, such as cognitive semantics and graphics, multimedia presentation of information, focused on the effective support of the processes of extraction and/or generation of new knowledge.


Sign in / Sign up

Export Citation Format

Share Document