The strongest link: libraries and linked data

The World Wide Web Consortium (W3C) as the main standardization body for Web standards has set a particular focus on publishing and integrating Open Data. In this chapter, the authors explain various standards from the W3C's Semantic Web activity and the—potential—role they play in the context of Open Data: RDF, as a standard data format for publishing and consuming structured information on the Web; the Linked Data principles for interlinking RDF data published across the Web and leveraging a Web of Data; RDFS and OWL to describe vocabularies used in RDF and for describing mappings between such vocabularies. The authors conclude with a review of current deployments of these standards on the Web, particularly within public Open Data initiatives, and discuss potential risks and challenges.

Download Full-text

Linked Data, Towards Realizing the Web of Data

Information Retrieval and Management ◽

10.4018/978-1-5225-5191-1.ch013 ◽

2018 ◽

pp. 292-312

Author(s):

Leila Zemmouchi-Ghomari

Keyword(s):

Semantic Web ◽

Linked Data ◽

Web Applications ◽

Online Data ◽

Web Of Data ◽

Promising Solution ◽

Basic Ideas ◽

Online Sources ◽

Significant Effort ◽

The Web

Data play a central role in the effectiveness and efficiency of web applications, such as the Semantic Web. However, data are distributed across a very large number of online sources, due to which a significant effort is needed to integrate this data for its proper utilization. A promising solution to this issue is the linked data initiative, which is based on four principles related to publishing web data and facilitating interlinked and structured online data rather than the existing web of documents. The basic ideas, techniques, and applications of the linked data initiative are surveyed in this paper. The authors discuss some Linked Data open issues and potential tracks to address these pending questions.

Download Full-text

Web Data Quality

International Journal on Semantic Web and Information Systems ◽

10.4018/ijswis.2014040101 ◽

2014 ◽

Vol 10 (2) ◽

pp. 1-6 ◽

Cited By ~ 9

Author(s):

Amrapali Zaveri ◽

Andrea Maurino ◽

Laure-Berti Equille

Keyword(s):

Semantic Web ◽

Data Quality ◽

Linked Data ◽

Web Data ◽

Semantic Web Technologies ◽

Future Directions ◽

Web Technologies ◽

Current State ◽

Web Of Data ◽

The Web

The standardization and adoption of Semantic Web technologies has resulted in an unprecedented volume of data being published as Linked Data (LD). However, the “publish first, refine later” philosophy leads to various quality problems arising in the underlying data such as incompleteness, inconsistency and semantic ambiguities. In this article, we describe the current state of Data Quality in the Web of Data along with details of the three papers accepted for the International Journal on Semantic Web and Information Systems' (IJSWIS) Special Issue on Web Data Quality. Additionally, we identify new challenges that are specific to the Web of Data and provide insights into the current progress and future directions for each of those challenges.

Download Full-text

Wikidata

Information Technology and Libraries ◽

10.6017/ital.v38i2.10886 ◽

2019 ◽

Vol 38 (2) ◽

pp. 72-81 ◽

Cited By ~ 1

Author(s):

Theo Van Veen

Keyword(s):

Linked Data ◽

Control Mechanism ◽

Critical Mass ◽

Named Entities ◽

Authority Control ◽

Barriers To Access ◽

Web Of Data ◽

The Web

Library catalogues may be connected to the linked data cloud through various types of thesauri. For name authority thesauri in particular I would like to suggest a fundamental break with the current distributed linked data paradigm: to make a transition from a multitude of different identifiers to using a single, universal identifier for all relevant named entities, in the form of the Wikidata identifier. Wikidata (https://wikidata.org) seems to be evolving into a major authority hub that is lowering barriers to access the web of data for everyone. Using the Wikidata identifier of notable entities as a common identifier for connecting resources has significant benefits compared to traversing the ever-growing linked data cloud. When the use of Wikidata reaches a critical mass, for some institutions, Wikidata could even serve as an authority control mechanism.

Download Full-text

Semantic Web Standards for Publishing and Integrating Open Data

Standards and Standardization ◽

10.4018/978-1-4666-8111-8.ch001 ◽

2015 ◽

pp. 1-20 ◽

Cited By ~ 1

Author(s):

Axel Polleres ◽

Simon Steyskal

Keyword(s):

Semantic Web ◽

World Wide ◽

Open Data ◽

Standard Data ◽

Web Standards ◽

Web Of Data ◽

Structured Information ◽

Potential Risks ◽

Rdf Data ◽

The Web

The World Wide Web Consortium (W3C) as the main standardization body for Web standards has set a particular focus on publishing and integrating Open Data. In this chapter, the authors explain various standards from the W3C's Semantic Web activity and the—potential—role they play in the context of Open Data: RDF, as a standard data format for publishing and consuming structured information on the Web; the Linked Data principles for interlinking RDF data published across the Web and leveraging a Web of Data; RDFS and OWL to describe vocabularies used in RDF and for describing mappings between such vocabularies. The authors conclude with a review of current deployments of these standards on the Web, particularly within public Open Data initiatives, and discuss potential risks and challenges.

Download Full-text

Data Linking for the Semantic Web

International Journal on Semantic Web and Information Systems ◽

10.4018/jswis.2011070103 ◽

2011 ◽

Vol 7 (3) ◽

pp. 46-76 ◽

Cited By ~ 59

Author(s):

Alfio Ferrara ◽

Andriy Nikolov ◽

François Scharffe

Keyword(s):

Graph Theory ◽

Natural Language Processing ◽

Semantic Web ◽

Language Processing ◽

Linked Data ◽

Background Information ◽

Web Of Data ◽

Comprehensive Survey ◽

Term Data ◽

The Web

By specifying that published datasets must link to other existing datasets, the 4th linked data principle ensures a Web of data and not just a set of unconnected data islands. The authors propose in this paper the term data linking to name the problem of finding equivalent resources on the Web of linked data. In order to perform data linking, many techniques were developed, finding their roots in statistics, database, natural language processing and graph theory. The authors begin this paper by providing background information and terminological clarifications related to data linking. Then a comprehensive survey over the various techniques available for data linking is provided. These techniques are classified along the three criteria of granularity, type of evidence, and source of the evidence. Finally, the authors survey eleven recent tools performing data linking and we classify them according to the surveyed techniques.

Download Full-text

Where the Semantic Web and Web 2.0 Meet Format Risk Management: P2 Registry

International Journal of Digital Curation ◽

10.2218/ijdc.v6i1.180 ◽

2011 ◽

Vol 6 (1) ◽

pp. 165-182 ◽

Cited By ~ 3

Author(s):

David Tarrant ◽

Steve Hitchcock ◽

Leslie Carr

Keyword(s):

Risk Management ◽

Semantic Web ◽

Linked Data ◽

Digital Preservation ◽

File Format ◽

Multiple Sources ◽

Single Institution ◽

Digital Format ◽

Machine Readable ◽

The Web

The Web is increasingly becoming a platform for linked data. This means making connections and adding value to data on the Web. As more data becomes openly available and more people are able to use the data, it becomes more powerful. An example is file format registries and the evaluation of format risks. Here the requirement for information is now greater than the effort that any single institution can put into gathering and collating this information. Recognising that more is better, the creators of PRONOM, JHOVE, GDFR and others are joining to lead a new initiative: the Unified Digital Format Registry. Ahead of this effort, a new RDF-based framework for structuring and facilitating file format data from multiple sources, including PRONOM, has demonstrated it is able to produce more links, and thus provide more answers to digital preservation questions - about format risks, applications, viewers and transformations - than the native data alone. This paper will describe this registry, P2, and its services, show how it can be used, and provide examples where it delivers more answers than the contributing resources. The P2 Registry is a reference platform to allow and encourage publication of preservation data, and also an examplar of what can be achieved if more data is published openly online as simple machine-readable documents. This approach calls for the active participation of the digital preservation community to contribute data by simply publishing it openly on the Web as linked data.

Download Full-text

Data Linking for the Semantic Web

Semantic Web ◽

10.4018/978-1-4666-3610-1.ch008 ◽

2013 ◽

pp. 169-200 ◽

Cited By ~ 15

Author(s):

Alfio Ferraram ◽

Andriy Nikolov ◽

François Scharffe

Keyword(s):

Graph Theory ◽

Natural Language Processing ◽

Semantic Web ◽

Language Processing ◽

Linked Data ◽

Background Information ◽

Web Of Data ◽

Comprehensive Survey ◽

Term Data ◽

The Web

By specifying that published datasets must link to other existing datasets, the 4th linked data principle ensures a Web of data and not just a set of unconnected data islands. The authors propose in this paper the term data linking to name the problem of finding equivalent resources on the Web of linked data. In order to perform data linking, many techniques were developed, finding their roots in statistics, database, natural language processing and graph theory. The authors begin this paper by providing background information and terminological clarifications related to data linking. Then a comprehensive survey over the various techniques available for data linking is provided. These techniques are classified along the three criteria of granularity, type of evidence, and source of the evidence. Finally, the authors survey eleven recent tools performing data linking and we classify them according to the surveyed techniques.

Download Full-text

Transformation Approach of Open Web Data to Linked Open Data

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2052 ◽

2021 ◽

Author(s):

Amina Meherehera ◽

Imane Mekideche ◽

Leila Zemmouchi-Ghomari ◽

Abdessamed Réda Ghomari

Keyword(s):

Linked Data ◽

Open Data ◽

Linked Open Data ◽

Web Data ◽

Semantic Web Technologies ◽

Automatic Interpretation ◽

Transformation Approach ◽

Web Of Data ◽

Machine Readable ◽

The Web

A large amount of data available over the Web and, in particular, the open data have, generally, heterogeneous formats and are not machine-readable. One promising solution to overcome the problems of heterogeneity and automatic interpretation is the Linked Data initiative, which aims to provide unified practices for publishing and contextually to link data on the Web, by using World Wide Web Consortium standards and the Semantic Web technologies. LinkedIn data promote the Web’s transformation from a web of documents to a web of data, ensuring that machines and software agents can interpret the semantics of data correctly and therefore infer new facts and return relevant web data search results. This paper presents an automatic generic transformation approach that manipulates several input formats of open web data to linked open data. This work aims to participate actively in the movement of publishing data compliant with linked data principles.

Download Full-text

Abstraction of linked data’s world

Visión electrónica ◽

10.14483/22484728.14397 ◽

2019 ◽

Vol 13 (1) ◽

pp. 57-74

Author(s):

Jhon Francined Herrera-Cubides ◽

Paulo Alonso Gaona-García ◽

Carlos Enrique Montenegro-Marín ◽

Salvador Sánchez-Alonso ◽

David Martin-Moncunill

Keyword(s):

Semantic Web ◽

Linked Data ◽

Basic Principles ◽

Problem Situations ◽

Data Process ◽

Web Of Data ◽

The Web

Linked Data, as a strategy of the Semantic Web, is based on application of some basic principles that contribute to the growth of the Web, thus allowing the transit of the Web of Documents to the Web of Data. Developed process by Linked Data is supported in different scenarios, which interact in order to carry out the linking of resources on the Web. Some of these scenarios present a solid technological background, while others propose challenges when they are implemented. This paper aims to identify and expose a generic abstraction of Linked Data, in order to identify problem situations that restrict Linked Data process.

Download Full-text