Preparing Controlled Vocabularies for Linked Data: Benefits and Challenges

2015 ◽  
Vol 15 (3-4) ◽  
pp. 177-190 ◽  
Author(s):  
Silvia B. Southwick ◽  
Cory K. Lampert ◽  
Richard Southwick
2021 ◽  
Vol 40 (3) ◽  
Author(s):  
Juliet Hardesty ◽  
Allison Nolan

Controlled vocabularies used in cultural heritage organizations (galleries, libraries, archives, and museums) are a helpful way to standardize terminology but can also result in misrepresentation or exclusion of systemically marginalized groups. Library of Congress Subject Headings (LCSH) is one example of a widely used yet problematic controlled vocabulary for subject headings. In some cases, systemically marginalized groups are creating controlled vocabularies that better reflect their terminology. When a widely used vocabulary like LCSH and a controlled vocabulary from a marginalized community are both available as linked data, it is possible to incorporate the terminology from the marginalized community as an overlay or replacement for outdated or absent terms from more widely used vocabularies. This paper provides a use case for examining how the Homosaurus, an LGBTQ+ linked data controlled vocabulary, can provide an augmented and updated search experience to mitigate bias within a system that only uses LCSH for subject headings.


2021 ◽  
Vol 33 ◽  
Author(s):  
Everton Rodrigues Barbosa ◽  
Moisés Lima Dutra ◽  
Angel Freddy Godoy Viera ◽  
Douglas Dyllon Jeronimo de Macedo

Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects.


Author(s):  
Salvia Praga

The automatic construction of ontologies from texts is usually based on the text itself, and the domain described is limited to the content of the text. In order to design semantically richer ontologies, the authors propose to extend the classical methods of ontology construction (1) by taking into account the text from the point of view of its structure and its content to build a first nucleus ontology and (2) enriching the ontology obtained by exploiting external resources (general texts and controlled vocabularies of the same domain). This chapter describes how these different resources are analyzed and exploited using linked data properties.


2017 ◽  
Vol 13 (4) ◽  
pp. 155-180 ◽  
Author(s):  
Laurens De Vocht ◽  
Selver Softic ◽  
Ruben Verborgh ◽  
Erik Mannens ◽  
Martin Ebner

When researchers formulate search queries to find relevant content on the Web, those queries typically consist of keywords that can only be matched in the content or its metadata. The Web of Data extends this functionality by bringing structure and giving well-defined meaning to the content and it enables humans and machines to work together using controlled vocabularies. Due the high degree of mismatches between the structure of the content and the vocabularies in different sources, searching over multiple heterogeneous repositories of structured data is considered challenging. Therefore, the authors present a semantic search engine for researchers facilitating search in research related Linked Data. To facilitate high-precision interactive search, they annotated and interlinked structured research data with ontologies from various repositories in an effective semantic model. Furthermore, the authors' system is adaptive as researchers can synchronize using new social media accounts and efficiently explore new datasets.


Author(s):  
S. Mortier ◽  
K. Van Daele ◽  
L. Meganck

Heritage organizations in Flanders started using thesauri fairly recently compared to other countries. This paper starts with examining the historical use of thesauri and controlled vocabularies in computer systems by the Flemish Government dealing with immovable cultural heritage. Their evolution from simple, flat, controlled lists to actual thesauri with scope notes, hierarchical and equivalence relations and links to other thesauri will be discussed. An explanation will be provided for the evolution in our approach to controlled vocabularies, and how they radically changed querying and the way data is indexed in our systems.<br><br> Technical challenges inherent to complex thesauri and how to overcome them will be outlined. These issues being solved, thesauri have become an essential feature of the Flanders Heritage inventory management system. The number of vocabularies rose over the years and became an essential tool for integrating heritage from different disciplines.<br><br> As a final improvement, thesauri went from being a core part of one application (the inventory management system) to forming an essential part of a new general resource oriented system architecture for Flanders Heritage influenced by Linked Data. For this purpose, a generic SKOS based editor was created. Due to the SKOS model being generic enough to be used outside of Flanders Heritage, the decision was made early on to develop this editor as an open source project called Atramhasis and share it with the larger heritage world.


2014 ◽  
Vol 58 (4) ◽  
pp. 233 ◽  
Author(s):  
Kristin E. Martin ◽  
Kavita Mundle

This paper surveys the English-language literature on cataloging and classification published during 2011 and 2012, covering both theory and application. A major theme of the literature centered on Resource Description and Access (RDA), as the period covered in this review includes the conclusion of the RDA test, revisions to RDA, and the implementation decision. Explorations in the theory and practical applications of the Functional Requirements for Bibliographic Records (FRBR), upon which RDA is organized, are also heavily represented. Library involvement with linked data through the creation of prototypes and vocabularies are explored further during the period. Other areas covered in the review include: classification, controlled vocabularies and name authority, evaluation and history of cataloging, special formats cataloging, cataloging and discovery services, non-AACR2/RDA metadata, cataloging workflows, and the education and careers of catalogers.


2018 ◽  
Author(s):  
Violeta Ilik ◽  
Lukas Koster

In this paper we discuss a proposal for creating an information sharing pipeline/real-time information channel, where all stakeholders would be able to engage in exchange/verification of information about entities in real time. The entities in question include personal and organizational names as well as subject headings from different controlled vocabularies. The proposed solution is a shared information pipeline where all stakeholders/agents would be able to share and exchange data about entities in real time. Three W3C recommended protocols are considered as potential solutions: the Linked Data Notifications protocol, the ActivityPub protocol, and the WebSub protocol. We compare and explore the three protocols for the purpose of identifying the best way to create an information sharing pipeline that would provide access to most up to date information to all stakeholders.


2021 ◽  
Vol 48 (4) ◽  
pp. 282-290
Author(s):  
Athanasios Velios ◽  
Kristen St.John

One of the fundamental roles of memory organisations is to safe-keep collections and this includes activities around their preservation and conservation. Conservators produce documentation records of their work to assist future interpretation of objects and to explain decision making for conservation. This documentation may exist as structured data or free text and in both cases they require vocabularies that can be understood widely in the domain. This paper describes a survey of conservation professionals which allowed us to compile the vocabularies used in the domain. It includes an analysis of the vocabularies with key findings: a) the overlapping terms with multiple definitions, b) the partial coverage of the domain which is lacking controlled vocabularies for condition types and treatment techniques and c) the limited formats in which vocabularies are published, making them difficult to use within Linked Data implementations. The paper then describes an approach to improve the vocabulary landscape in conservation by providing guidelines for encoding and aligning vocabularies as well as considering third party platforms for sharing vocabularies in a sustainable way. The paper concludes with a summary of our findings and recommendations.


Sign in / Sign up

Export Citation Format

Share Document