Modern Users of Libraries and the Linked Open Data Environment

Olga A. Lavrenova; Andrey A. Vinberg

doi:10.25281/0869-608x-2020-69-3-243-260

Modern Users of Libraries and the Linked Open Data Environment

Bibliotekovedenie [Library and Information Science (Russia)] ◽

10.25281/0869-608x-2020-69-3-243-260 ◽

2020 ◽

Vol 69 (3) ◽

pp. 243-260

Author(s):

Olga A. Lavrenova ◽

Andrey A. Vinberg

Keyword(s):

Classification System ◽

Query Language ◽

Open Data ◽

Basic Research ◽

Linked Open Data ◽

Knowledge Organization ◽

Global Network ◽

Knowledge Organization System ◽

Description Framework ◽

State Library

The goal of any library is to ensure high quality and general availability of information retrieval tools. The paper describes the project implemented by the Russian State Library (RSL) to present Library Bibliographic Classification as a Networked Knowledge Organization System. The project goal is to support content and provide tools for ensuring system’s interoperability with other resources of the same nature (i.e. with Linked Data Vocabularies) in the global network environment. The project was partially supported by the Russian Foundation for Basic Research (RFBR).The RSL General Classified Catalogue (GCC) was selected as the main data source for the Classification system of knowledge organization. The meaning of each classification number is expressed by complete string of wordings (captions), rather than the last level caption alone. Data converted to the Resource Description Framework (RDF) files based on the standard set of properties defined in the Simple Knowledge Organization System (SKOS) model was loaded into the semantic storage for subsequent data processing using the SPARQL query language. In order to enrich user queries for search of resources, the RSL has published its Classification System in the form of Linked Open Data (https://lod.rsl.ru) for searching in the RSL electronic catalogue. Currently, the work is underway to enable its smooth integration with other LOD vocabularies. The SKOS mapping tags are used to differentiate the types of connections between SKOS elements (concepts) existing in different concept schemes, for example, UDC, MeSH, authority data.The conceptual schemes of the leading classifications are fundamentally different from each other. Establishing correspondence between concepts is possible only on the basis of lexical and structural analysis to compute the concept similarity as a combination of attributes.The authors are looking forward to working with libraries in Russia and other countries to create a common space of Linked Open Data vocabularies.

Library Bibliographic Classification as a traditional knowledge organization system in the linked open data environment

Scientific and Technical Libraries ◽

10.33186/1027-3689-2017-4-44-60 ◽

2017 ◽

pp. 44-60 ◽

Cited By ~ 2

Author(s):

Olga Lavrenova ◽

Vasili Pavlov

Keyword(s):

Data Structure ◽

Semantic Web ◽

Traditional Knowledge ◽

Open Data ◽

Linked Open Data ◽

Knowledge Organization ◽

Knowledge Model ◽

Library Catalog ◽

Knowledge Organization System ◽

Data Environment

The task of the project is to introduce classification knowledge model as Linked Open Data, LOD, and to provide access to it from Semantic Web through standard network instruments. The reviewed system of RSL systematic catalog (Library Bibliographic Classification) subject divisions is accepted as a source for classification knowledge model The authors examine SKOS-based data structure and the software. Operation principles in terms of the search in e-library catalog and RSL traditional collections are discussed.

Documentary languages and knowledge organization systems in the context of the semantic web

Transinformação ◽

10.1590/s0103-37862013000200005 ◽

2013 ◽

Vol 25 (2) ◽

pp. 145-150 ◽

Cited By ~ 1

Author(s):

Marilda Lopes Ginez de Lara

Keyword(s):

Semantic Web ◽

Open Data ◽

Linked Open Data ◽

Knowledge Organization ◽

Initial Condition ◽

Web Based ◽

Simple Knowledge Organization System ◽

Knowledge Organization Systems ◽

Knowledge Organization System ◽

Conceptual Problems

The aim of this study was to discuss the need for formal documentary languages as a condition for it to function in the Semantic Web. Based on a bibliographic review, Linked Open Data is presented as an initial condition for the operationalization of the Semantic Web, similar to the movement of Linked Open Vocabularies that aimed to promote interoperability among vocabularies. We highlight the Simple Knowledge Organization System format by analyzing its main characteristics and presenting the new standard ISO 25964-1/2:2011/2012 -Thesauri and interoperability with other vocabularies, that revises previous recommendations, adding requirements for the interoperability and mapping of vocabularies. We discuss conceptual problems in the formalization of vocabularies and the need to invest critically in its operationalization, suggesting alternatives to harness the mapping of vocabularies.

TOWARDS AN EFFICIENT RDF DATASET SLICING

International Journal of Semantic Computing ◽

10.1142/s1793351x13400151 ◽

2013 ◽

Vol 07 (04) ◽

pp. 455-477 ◽

Cited By ~ 2

Author(s):

EDGARD MARX ◽

TOMMASO SORU ◽

SAEEDEH SHEKARPOUR ◽

SÖREN AUER ◽

AXEL-CYRILLE NGONGA NGOMO ◽

...

Keyword(s):

Information Needs ◽

Query Language ◽

Open Data ◽

Linked Open Data ◽

Connected Subgraph ◽

Triple Store ◽

Subgraph Pattern ◽

Order Of Magnitude ◽

Efficient Processing ◽

Description Framework

Over the last years, a considerable amount of structured data has been published on the Web as Linked Open Data (LOD). Despite recent advances, consuming and using Linked Open Data within an organization is still a substantial challenge. Many of the LOD datasets are quite large and despite progress in Resource Description Framework (RDF) data management their loading and querying within a triple store is extremely time-consuming and resource-demanding. To overcome this consumption obstacle, we propose a process inspired by the classical Extract-Transform-Load (ETL) paradigm. In this article, we focus particularly on the selection and extraction steps of this process. We devise a fragment of SPARQL Protocol and RDF Query Language (SPARQL) dubbed SliceSPARQL, which enables the selection of well-defined slices of datasets fulfilling typical information needs. SliceSPARQL supports graph patterns for which each connected subgraph pattern involves a maximum of one variable or Internationalized resource identifier (IRI) in its join conditions. This restriction guarantees the efficient processing of the query against a sequential dataset dump stream. Furthermore, we evaluate our slicing approach on three different optimization strategies. Results show that dataset slices can be generated an order of magnitude faster than by using the conventional approach of loading the whole dataset into a triple store.

A Hybrid Approach Combining R*-Tree and k-d Trees to Improve Linked Open Data Query Performance

Applied Sciences ◽

10.3390/app11052405 ◽

2021 ◽

Vol 11 (5) ◽

pp. 2405

Author(s):

Yuxiang Sun ◽

Tianyi Zhao ◽

Seulgi Yoon ◽

Yongju Lee

Keyword(s):

Flash Memory ◽

Query Language ◽

Hybrid Approach ◽

Open Data ◽

Main Memory ◽

Linked Open Data ◽

Index Structure ◽

Identification Algorithm ◽

Distributed Computing Systems ◽

Query Performance

Semantic Web has recently gained traction with the use of Linked Open Data (LOD) on the Web. Although numerous state-of-the-art methodologies, standards, and technologies are applicable to the LOD cloud, many issues persist. Because the LOD cloud is based on graph-based resource description framework (RDF) triples and the SPARQL query language, we cannot directly adopt traditional techniques employed for database management systems or distributed computing systems. This paper addresses how the LOD cloud can be efficiently organized, retrieved, and evaluated. We propose a novel hybrid approach that combines the index and live exploration approaches for improved LOD join query performance. Using a two-step index structure combining a disk-based 3D R*-tree with the extended multidimensional histogram and flash memory-based k-d trees, we can efficiently discover interlinked data distributed across multiple resources. Because this method rapidly prunes numerous false hits, the performance of join query processing is remarkably improved. We also propose a hot-cold segment identification algorithm to identify regions of high interest. The proposed method is compared with existing popular methods on real RDF datasets. Results indicate that our method outperforms the existing methods because it can quickly obtain target results by reducing unnecessary data scanning and reduce the amount of main memory required to load filtering results.

Chapter 1. The Need for Knowledge Organization. Introduction to the book Linking Knowledge: Linked Open Data for Knowledge Organization

10.5771/9783956506611-1 ◽

2021 ◽

pp. 1-23

Author(s):

Andrea Scharnhorst ◽

Richard P. Smiraglia

Keyword(s):

Open Data ◽

Linked Open Data ◽

Knowledge Organization

Using linked open data to enhance the discoverability, functionality and impact of Emblematica Online

Library Hi Tech ◽

10.1108/lht-11-2016-0126 ◽

2017 ◽

Vol 35 (1) ◽

pp. 159-178

Author(s):

Timothy W. Cole ◽

Myung-Ja K. Han ◽

Maria Janina Sarol ◽

Monika Biel ◽

David Maus

Keyword(s):

Open Data ◽

Resource Discovery ◽

Linked Open Data ◽

Content Type ◽

Domain Specific ◽

Special Collections ◽

Online Portal ◽

Emblem Books ◽

Description Framework ◽

Considerable Work

Purpose Early Modern emblem books are primary sources for scholars studying the European Renaissance. Linked Open Data (LOD) is an approach for organizing and modeling information in a data-centric manner compatible with the emerging Semantic Web. The purpose of this paper is to examine ways in which LOD methods can be applied to facilitate emblem resource discovery, better reveal the structure and connectedness of digitized emblem resources, and enhance scholar interactions with digitized emblem resources. Design/methodology/approach This research encompasses an analysis of the existing XML-based Spine (emblem-specific) metadata schema; the design of a new, domain-specific, Resource Description Framework compatible ontology; the mapping and transformation of metadata from Spine to both the new ontology and (separately) to the pre-existing Schema.org ontology; and the (experimental) modification of the Emblematica Online portal as a proof of concept to illustrate enhancements supported by LOD. Findings LOD is viable as an approach for facilitating discovery and enhancing the value to scholars of digitized emblem books; however, metadata must first be enriched with additional uniform resource identifiers and the workflow upgrades required to normalize and transform existing emblem metadata are substantial and still to be fully worked out. Practical implications The research described demonstrates the feasibility of transforming existing, special collections metadata to LOD. Although considerable work and further study will be required, preliminary findings suggest potential benefits of LOD for both users and libraries. Originality/value This research is unique in the context of emblem studies and adds to the emerging body of work examining the application of LOD best practices to library special collections.

How Agricultural Digital Innovation Can Benefit from Semantics: The Case of the AGROVOC Multilingual Thesaurus

Engineering Proceedings ◽

10.3390/engproc2021009017 ◽

2021 ◽

Vol 9 (1) ◽

pp. 17

Author(s):

Esther Mietzsch ◽

Daniel Martini ◽

Kristin Kolshus ◽

Andrea Turbati ◽

Imma Subirats

Keyword(s):

Knowledge Organization ◽

Structural Basis ◽

Uniform Resource Identifier ◽

Food And Agriculture ◽

Food And Agriculture Organization ◽

Knowledge Organization Systems ◽

Knowledge Organization System ◽

Recent Developments ◽

Description Framework ◽

Areas Of Interest

AGROVOC is the multilingual thesaurus managed and published by the Food and Agriculture Organization of the United Nations (FAO). Its content is available in more than 40 languages and covers all the FAO’s areas of interest. The structural basis is a resource description framework (RDF) and simple knowledge organization system (SKOS). More than 39,000 concepts identified by a uniform resource identifier (URI) and 800,000 terms are related through a hierarchical system and aligned to knowledge organization systems. This paper aims to illustrate the recent developments in the context of AGROVOC and to present use cases where it has contributed to enhancing the interoperability of data shared by different information systems.

Evaluating the quality of linked open data in digital libraries

Journal of Information Science ◽

10.1177/0165551520930951 ◽

2020 ◽

pp. 016555152093095

Author(s):

Gustavo Candela ◽

Pilar Escobar ◽

Rafael C Carrasco ◽

Manuel Marco-Such

Keyword(s):

Digital Libraries ◽

Open Data ◽

Quality Measures ◽

Linked Open Data ◽

Data Sets ◽

Design And Implementation ◽

Bibliographic Data ◽

Description Framework ◽

Resource Description

Cultural heritage institutions have recently started to share their metadata as Linked Open Data (LOD) in order to disseminate and enrich them. The publication of large bibliographic data sets as LOD is a challenge that requires the design and implementation of custom methods for the transformation, management, querying and enrichment of the data. In this report, the methodology defined by previous research for the evaluation of the quality of LOD is analysed and adapted to the specific case of Resource Description Framework (RDF) triples containing standard bibliographic information. The specified quality measures are reported in the case of four highly relevant libraries.

Modelo de dados abertos conectados para informação legislativa

Informação & Sociedade: Estudos ◽

10.22478/ufpb.1809-4783.2018v28n2.37979 ◽

2018 ◽

Vol 28 (2) ◽

Author(s):

Mariana Baptista Brandt ◽

Silvana Aparecida Borsetti Gregorio Vidotti ◽

José Eduardo Santarem Segundo

Keyword(s):

Resource Description Framework ◽

Linked Data ◽

World Wide ◽

Open Data ◽

Linked Open Data ◽

Dublin Core ◽

Description Framework ◽

E Mail ◽

Resource Description ◽

Rdf Schema

A presente pesquisa objetiva propor um modelo de dados abertos conectados (linked open data - LOD), para um conjunto de dados abertos legislativos da Câmara dos Deputados. Para tanto, procede-se à revisão de literatura sobre os conceitos de dados abertos, dados abertos governamentais, dados conectados (linked data), e dados abertos conectados (linked open data), seguido de pesquisa aplicada, com a modelagem de dados legislativos no modelo LOD. Para esta pesquisa foi selecionado o conjunto de dados "Deputados", que contém informações como partido político, unidade federativa, e-mail, legislatura, entre outras, sobre os parlamentares. Desse modo, observa-se que a estruturação do conjunto de dados em RDF (Resource Description Framework) é possível com reuso de vocabulários e padrões já estabelecidos na Web Semântica como Dublin Core, Friend of a Friend (FOAF), RDF e RDF Schema, além de vocabulários de áreas correlatas, como a Ontologia da Câmara dos Deputados italiana e a da Assembleia Nacional Francesa. Conforme recomendação do padrão Linked Data, os recursos foram relacionados também a outros conjuntos de LOD para enriquecimento semântico, como as bases Geonames e DBpedia. O estudo que permite concluir que a disponibilização dos dados governamentais, em especial, dados legislativos, pode ser feita seguindo as recomendações da W3C (World Wide Web Consortium) e, assim, integrar os dados legislativos à Web de Dados e ampliar as possibilidades de reuso e aplicações dos dados em ações de transparência e fiscalização, aproximando os cidadãos do Congresso e de seus representantes.

BioHackathon series in 2013 and 2014: improvements of semantic interoperability in life science data and services

F1000Research ◽

10.12688/f1000research.18238.1 ◽

2019 ◽

Vol 8 ◽

pp. 1677

Author(s):

Toshiaki Katayama ◽

Shuichi Kawashima ◽

Gos Micklem ◽

Shin Kawano ◽

Jin-Dong Kim ◽

...

Keyword(s):

Service Discovery ◽

Query Language ◽

Life Sciences ◽

Open Data ◽

Semantic Interoperability ◽

Science Data ◽

Rdf Data ◽

Description Framework ◽

Machine Readable

Publishing databases in the Resource Description Framework (RDF) model is becoming widely accepted to maximize the syntactic and semantic interoperability of open data in life sciences. Here we report advancements made in the 6th and 7th annual BioHackathons which were held in Tokyo and Miyagi respectively. This review consists of two major sections covering: 1) improvement and utilization of RDF data in various domains of the life sciences and 2) meta-data about these RDF data, the resources that store them, and the service quality of SPARQL Protocol and RDF Query Language (SPARQL) endpoints. The first section describes how we developed RDF data, ontologies and tools in genomics, proteomics, metabolomics, glycomics and by literature text mining. The second section describes how we defined descriptions of datasets, the provenance of data, and quality assessment of services and service discovery. By enhancing the harmonization of these two layers of machine-readable data and knowledge, we improve the way community wide resources are developed and published. Moreover, we outline best practices for the future, and prepare ourselves for an exciting and unanticipatable variety of real world applications in coming years.