Preparing Controlled Vocabularies for Linked Data: Benefits and Challenges

Controlled vocabularies used in cultural heritage organizations (galleries, libraries, archives, and museums) are a helpful way to standardize terminology but can also result in misrepresentation or exclusion of systemically marginalized groups. Library of Congress Subject Headings (LCSH) is one example of a widely used yet problematic controlled vocabulary for subject headings. In some cases, systemically marginalized groups are creating controlled vocabularies that better reflect their terminology. When a widely used vocabulary like LCSH and a controlled vocabulary from a marginalized community are both available as linked data, it is possible to incorporate the terminology from the marginalized community as an overlay or replacement for outdated or absent terms from more widely used vocabularies. This paper provides a use case for examining how the Homosaurus, an LGBTQ+ linked data controlled vocabulary, can provide an augmented and updated search experience to mitigate bias within a system that only uses LCSH for subject headings.

Download Full-text

Thesaurus and subject heading lists as Linked Data

Transinformação ◽

10.1590/2318-0889202133e200077 ◽

2021 ◽

Vol 33 ◽

Author(s):

Everton Rodrigues Barbosa ◽

Moisés Lima Dutra ◽

Angel Freddy Godoy Viera ◽

Douglas Dyllon Jeronimo de Macedo

Keyword(s):

Data Model ◽

Linked Data ◽

Open Data ◽

Controlled Vocabulary ◽

Conversion Process ◽

Controlled Vocabularies ◽

Knowledge Organization Systems ◽

Subject Headings ◽

Data Environment ◽

Type Of Institution

Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects.

Download Full-text

Enrichment Ontology via Linked Data

Advances in Computational Intelligence and Robotics - Advanced Metaheuristic Methods in Big Data Retrieval and Analytics ◽

10.4018/978-1-5225-7338-8.ch005 ◽

2019 ◽

pp. 91-103

Author(s):

Salvia Praga

Keyword(s):

Linked Data ◽

Point Of View ◽

Controlled Vocabularies ◽

Ontology Construction ◽

External Resources ◽

Automatic Construction

The automatic construction of ontologies from texts is usually based on the text itself, and the domain described is limited to the content of the text. In order to design semantically richer ontologies, the authors propose to extend the classical methods of ontology construction (1) by taking into account the text from the point of view of its structure and its content to build a first nucleus ontology and (2) enriching the ontology obtained by exploiting external resources (general texts and controlled vocabularies of the same domain). This chapter describes how these different resources are analyzed and exploited using linked data properties.

Download Full-text

Social Semantic Search

International Journal on Semantic Web and Information Systems ◽

10.4018/ijswis.2017100108 ◽

2017 ◽

Vol 13 (4) ◽

pp. 155-180 ◽

Cited By ~ 6

Author(s):

Laurens De Vocht ◽

Selver Softic ◽

Ruben Verborgh ◽

Erik Mannens ◽

Martin Ebner

Keyword(s):

Linked Data ◽

Structured Data ◽

Semantic Search ◽

Semantic Model ◽

Controlled Vocabularies ◽

Interactive Search ◽

Web Of Data ◽

High Degree ◽

Different Sources ◽

The Web

When researchers formulate search queries to find relevant content on the Web, those queries typically consist of keywords that can only be matched in the content or its metadata. The Web of Data extends this functionality by bringing structure and giving well-defined meaning to the content and it enables humans and machines to work together using controlled vocabularies. Due the high degree of mismatches between the structure of the content and the vocabularies in different sources, searching over multiple heterogeneous repositories of structured data is considered challenging. Therefore, the authors present a semantic search engine for researchers facilitating search in research related Linked Data. To facilitate high-precision interactive search, they annotated and interlinked structured research data with ontologies from various repositories in an effective semantic model. Furthermore, the authors' system is adaptive as researchers can synchronize using new social media accounts and efficiently explore new datasets.

Download Full-text

Merging Controlled Vocabularies through Semantic Alignment Based on Linked Data

Communications in Computer and Information Science - Metadata and Semantics Research ◽

10.1007/978-3-319-03437-9_32 ◽

2013 ◽

pp. 330-341

Author(s):

Ioannis Papadakis ◽

Konstantinos Kyprianos

Keyword(s):

Linked Data ◽

Controlled Vocabularies ◽

Semantic Alignment

Download Full-text

CALLING IT WHAT IT IS. THESAURI IN THE FLANDERS HERITAGE AGENCY: HISTORY, IMPORTANCE, USE AND TECHNOLOGICAL ADVANCES

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-2-w2-151-2017 ◽

2017 ◽

Vol IV-2/W2 ◽

pp. 151-156 ◽

Cited By ~ 1

Author(s):

S. Mortier ◽

K. Van Daele ◽

L. Meganck

Keyword(s):

Inventory Management ◽

Management System ◽

Linked Data ◽

Essential Feature ◽

Equivalence Relations ◽

Controlled Vocabularies ◽

Inventory Management System ◽

Technological Advances ◽

Core Part ◽

Oriented System

Heritage organizations in Flanders started using thesauri fairly recently compared to other countries. This paper starts with examining the historical use of thesauri and controlled vocabularies in computer systems by the Flemish Government dealing with immovable cultural heritage. Their evolution from simple, flat, controlled lists to actual thesauri with scope notes, hierarchical and equivalence relations and links to other thesauri will be discussed. An explanation will be provided for the evolution in our approach to controlled vocabularies, and how they radically changed querying and the way data is indexed in our systems.<br><br> Technical challenges inherent to complex thesauri and how to overcome them will be outlined. These issues being solved, thesauri have become an essential feature of the Flanders Heritage inventory management system. The number of vocabularies rose over the years and became an essential tool for integrating heritage from different disciplines.<br><br> As a final improvement, thesauri went from being a core part of one application (the inventory management system) to forming an essential part of a new general resource oriented system architecture for Flanders Heritage influenced by Linked Data. For this purpose, a generic SKOS based editor was created. Due to the SKOS model being generic enough to be used outside of Flanders Heritage, the decision was made early on to develop this editor as an open source project called Atramhasis and share it with the larger heritage world.

Download Full-text

Positioning Libraries for a New Bibliographic Universe

Library Resources and Technical Services ◽

10.5860/lrts.58n4.233 ◽

2014 ◽

Vol 58 (4) ◽

pp. 233 ◽

Cited By ~ 2

Author(s):

Kristin E. Martin ◽

Kavita Mundle

Keyword(s):

Linked Data ◽

English Language ◽

Major Theme ◽

Functional Requirements ◽

Controlled Vocabularies ◽

Practical Applications ◽

Bibliographic Records ◽

History Of ◽

Resource Description ◽

Language Literature

This paper surveys the English-language literature on cataloging and classification published during 2011 and 2012, covering both theory and application. A major theme of the literature centered on Resource Description and Access (RDA), as the period covered in this review includes the conclusion of the RDA test, revisions to RDA, and the implementation decision. Explorations in the theory and practical applications of the Functional Requirements for Bibliographic Records (FRBR), upon which RDA is organized, are also heavily represented. Library involvement with linked data through the creation of prototypes and vocabularies are explored further during the period. Other areas covered in the review include: classification, controlled vocabularies and name authority, evaluation and history of cataloging, special formats cataloging, cataloging and discovery services, non-AACR2/RDA metadata, cataloging workflows, and the education and careers of catalogers.

Download Full-text

Information Sharing Pipeline

10.31219/osf.io/hbwf8 ◽

2018 ◽

Author(s):

Violeta Ilik ◽

Lukas Koster

Keyword(s):

Information Sharing ◽

Real Time ◽

Linked Data ◽

Information Channel ◽

Controlled Vocabularies ◽

Subject Headings ◽

Shared Information ◽

Real Time Information ◽

Time Information ◽

Exchange Data

In this paper we discuss a proposal for creating an information sharing pipeline/real-time information channel, where all stakeholders would be able to engage in exchange/verification of information about entities in real time. The entities in question include personal and organizational names as well as subject headings from different controlled vocabularies. The proposed solution is a shared information pipeline where all stakeholders/agents would be able to share and exchange data about entities in real time. Three W3C recommended protocols are considered as potential solutions: the Linked Data Notifications protocol, the ActivityPub protocol, and the WebSub protocol. We compare and explore the three protocols for the purpose of identifying the best way to create an information sharing pipeline that would provide access to most up to date information to all stakeholders.

Download Full-text

Linked Conservation Data: the Adoption and Use of Vocabularies in the Field of Heritage Conservation for Publishing Conservation Records as Linked Data

KNOWLEDGE ORGANIZATION ◽

10.5771/0943-7444-2021-4-282 ◽

2021 ◽

Vol 48 (4) ◽

pp. 282-290

Author(s):

Athanasios Velios ◽

Kristen St.John

Keyword(s):

Decision Making ◽

Linked Data ◽

Structured Data ◽

Third Party ◽

Heritage Conservation ◽

Free Text ◽

Controlled Vocabularies ◽

Partial Coverage ◽

Party Platforms ◽

Treatment Techniques

One of the fundamental roles of memory organisations is to safe-keep collections and this includes activities around their preservation and conservation. Conservators produce documentation records of their work to assist future interpretation of objects and to explain decision making for conservation. This documentation may exist as structured data or free text and in both cases they require vocabularies that can be understood widely in the domain. This paper describes a survey of conservation professionals which allowed us to compile the vocabularies used in the domain. It includes an analysis of the vocabularies with key findings: a) the overlapping terms with multiple definitions, b) the partial coverage of the domain which is lacking controlled vocabularies for condition types and treatment techniques and c) the limited formats in which vocabularies are published, making them difficult to use within Linked Data implementations. The paper then describes an approach to improve the vocabulary landscape in conservation by providing guidelines for encoding and aligning vocabularies as well as considering third party platforms for sharing vocabularies in a sustainable way. The paper concludes with a summary of our findings and recommendations.

Download Full-text

The Linked Data Enterprise as Enabler for both Intra- and Inter-organizational Business Data Integration and Usage

2020 43rd International Convention on Information, Communication and Electronic Technology (MIPRO) ◽

10.23919/mipro48935.2020.9245320 ◽

2020 ◽

Author(s):

A M. Tjoa

Keyword(s):

Data Integration ◽

Linked Data ◽

Business Data

Download Full-text