Data Integration for Open Data on the Web

Enabling Web-scale data integration in biomedicine through Linked Open Data

npj Digital Medicine ◽

10.1038/s41746-019-0162-5 ◽

2019 ◽

Vol 2 (1) ◽

Cited By ~ 3

Author(s):

Maulik R. Kamdar ◽

Javier D. Fernández ◽

Axel Polleres ◽

Tania Tudorache ◽

Mark A. Musen

Keyword(s):

Data Integration ◽

Biomedical Research ◽

Semantic Processing ◽

Open Data ◽

Heterogeneous Data ◽

Research Community ◽

Linked Open Data ◽

Biomedical Data ◽

Semantic Web Technologies ◽

The Web

Abstract The biomedical data landscape is fragmented with several isolated, heterogeneous data and knowledge sources, which use varying formats, syntaxes, schemas, and entity notations, existing on the Web. Biomedical researchers face severe logistical and technical challenges to query, integrate, analyze, and visualize data from multiple diverse sources in the context of available biomedical knowledge. Semantic Web technologies and Linked Data principles may aid toward Web-scale semantic processing and data integration in biomedicine. The biomedical research community has been one of the earliest adopters of these technologies and principles to publish data and knowledge on the Web as linked graphs and ontologies, hence creating the Life Sciences Linked Open Data (LSLOD) cloud. In this paper, we provide our perspective on some opportunities proffered by the use of LSLOD to integrate biomedical data and knowledge in three domains: (1) pharmacology, (2) cancer research, and (3) infectious diseases. We will discuss some of the major challenges that hinder the wide-spread use and consumption of LSLOD by the biomedical research community. Finally, we provide a few technical solutions and insights that can address these challenges. Eventually, LSLOD can enable the development of scalable, intelligent infrastructures that support artificial intelligence methods for augmenting human intelligence to achieve better clinical outcomes for patients, to enhance the quality of biomedical research, and to improve our understanding of living systems.

Download Full-text

Applications of Semantic Web in integrating open data and bibliographic records: a development example of an infomediary of Taiwanese indigenous people

The Electronic Library ◽

10.1108/el-09-2020-0258 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Han-Yu Sung ◽

Yu-Liang Chi

Keyword(s):

Semantic Web ◽

Data Integration ◽

Indigenous People ◽

Open Data ◽

Heterogeneous Data ◽

Data Sources ◽

Content Type ◽

Bibliographic Records ◽

Open Government Data ◽

The Web

Purpose This study aims to develop a Web-based application system called Infomediary of Taiwanese Indigenous Peoples (ITIP) that can help individuals comprehend the society and culture of indigenous people. The ITIP is based on the use of Semantic Web technologies to integrate a number of data sources, particularly including the bibliographic records of a museum. Moreover, an ontology model was developed to help users search cultural collections by topic concepts. Design/methodology/approach Two issues were identified that needed to be addressed: the integration of heterogeneous data sources and semantic-based information retrieval. Two corresponding methods were proposed: SPARQL federated queries were designed for data integration across the Web and ontology-driven queries were designed to semantically search by knowledge inference. Furthermore, to help users perform searches easily, three searching interfaces, namely, ethnicity, region and topic, were developed to take full advantage of the content available on the Web. Findings Most open government data provides structured but non-resource description framework data, Semantic Web consumers, therefore, require additional data conversion before the data can be used. On the other hand, although the library, archive and museum (LAM) community has produced some emerging linked data, very few data sets are released to the general public as open data. The Semantic Web’s vision of “web of data” remains challenging. Originality/value This study developed data integration from various institutions, including those of the LAM community. The development was conducted based on the mode of non-institution members (i.e. institutional outsiders). The challenges encountered included uncertain data quality and the absence of institutional participation.

Download Full-text

The read–write Linked Data Web

Philosophical Transactions of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rsta.2012.0513 ◽

2013 ◽

Vol 371 (1987) ◽

pp. 20120513 ◽

Cited By ~ 15

Author(s):

Tim Berners-Lee ◽

Kieron O’Hara

Keyword(s):

Future Development ◽

Linked Data ◽

Open Data ◽

Linked Open Data ◽

The Future ◽

The Web

This paper discusses issues that will affect the future development of the Web, either increasing its power and utility, or alternatively suppressing its development. It argues for the importance of the continued development of the Linked Data Web, and describes the use of linked open data as an important component of that. Second, the paper defends the Web as a read–write medium, and goes on to consider how the read–write Linked Data Web could be achieved.

Download Full-text

Data integration over the Web

Data & Knowledge Engineering ◽

10.1016/s0169-023x(02)00139-8 ◽

2003 ◽

Vol 44 (3) ◽

pp. 265-266 ◽

Cited By ~ 3

Author(s):

Z. Bellahsene

Keyword(s):

Data Integration ◽

The Web

Download Full-text

Publishing Statistical Data following the Linked Open Data Principles

Cases on Open-Linked Data and Semantic Web Applications ◽

10.4018/978-1-4666-2827-4.ch011 ◽

2013 ◽

pp. 199-226 ◽

Cited By ~ 5

Author(s):

Jose María Alvarez Rodríguez ◽

Jules Clement ◽

José Emilio Labra Gayo ◽

Hania Farhan ◽

Patricia Ordoñez de Pablos

Keyword(s):

Linked Data ◽

Statistical Data ◽

Open Data ◽

Linked Open Data ◽

Dimensional Measure ◽

The Web

This chapter introduces the promotion of statistical data to the Linked Open Data initiative in the context of the Web Index project. A framework for the publication of raw statistics and a method to convert them to Linked Data are also presented following the W3C standards RDF, SKOS, and OWL. This case study is focused on the Web Index project; launched by the Web Foundation, the Index is the first multi-dimensional measure of the growth, utility, and impact of the Web on people and nations. Finally, an evaluation of the advantages of using Linked Data to publish statistics is also presented in conjunction with a discussion and future steps sections.

Download Full-text

Semantic Web Standards for Publishing and Integrating Open Data

Advances in Electronic Government, Digital Divide, and Regional Development - Handbook of Research on Advanced ICT Integration for Governance and Policy Modeling ◽

10.4018/978-1-4666-6236-0.ch003 ◽

2014 ◽

pp. 28-47

Author(s):

Axel Polleres ◽

Simon Steyskal

Keyword(s):

Semantic Web ◽

World Wide ◽

Open Data ◽

Standard Data ◽

Web Standards ◽

Web Of Data ◽

Structured Information ◽

Potential Risks ◽

Rdf Data ◽

The Web

The World Wide Web Consortium (W3C) as the main standardization body for Web standards has set a particular focus on publishing and integrating Open Data. In this chapter, the authors explain various standards from the W3C's Semantic Web activity and the—potential—role they play in the context of Open Data: RDF, as a standard data format for publishing and consuming structured information on the Web; the Linked Data principles for interlinking RDF data published across the Web and leveraging a Web of Data; RDFS and OWL to describe vocabularies used in RDF and for describing mappings between such vocabularies. The authors conclude with a review of current deployments of these standards on the Web, particularly within public Open Data initiatives, and discuss potential risks and challenges.

Download Full-text

Creation and Integration of Reference Ontologies for Efficient LOD Management

Semi-Automatic Ontology Development ◽

10.4018/978-1-4666-0188-8.ch007 ◽

2012 ◽

pp. 162-199 ◽

Cited By ~ 1

Author(s):

Mariana Damova ◽

Atanas Kiryakov ◽

Maurice Grinberg ◽

Michael K. Bergman ◽

Frédérick Giasson ◽

...

Keyword(s):

Real Life ◽

Open Data ◽

Real Data ◽

Heterogeneous Data ◽

Linked Open Data ◽

Data Space ◽

Upper Level ◽

Reference Knowledge ◽

Global Data ◽

The Web

The chapter introduces the process of design of two upper-level ontologies—PROTON and UMBEL—into reference ontologies and their integration in the so-called Reference Knowledge Stack (RKS). It is argued that RKS is an important step in the efforts of the Linked Open Data (LOD) project to transform the Web into a global data space with diverse real data, available for review and analysis. RKS is intended to make the interoperability between published datasets much more efficient than it is now. The approach discussed in the chapter consists of developing reference layers of upper-level ontologies by mapping them to certain LOD schemata and assigning instance data to them so they cover a reasonable portion of the LOD datasets. The chapter presents the methods (manual and semi-automatic) used in the creation of the RKS and gives examples that illustrate its advantages for managing highly heterogeneous data and its usefulness in real life knowledge intense applications.

Download Full-text

Linked Open Statistical Metadata

Data Visualization and Statistical Literacy for Open and Big Data - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-2512-7.ch012 ◽

2017 ◽

pp. 297-320

Author(s):

Franck Cotton ◽

Daniel Gillman

Keyword(s):

Semantic Web ◽

Resource Description Framework ◽

Statistical Data ◽

Open Data ◽

Statistical Analyses ◽

Statistical Literacy ◽

Statistical Process ◽

Description Framework ◽

Resource Description ◽

The Web

Linked Open Statistical Metadata (LOSM) is Linked Open Data (LOD) applied to statistical metadata. LOD is a model for identifying, structuring, interlinking, and querying data published directly on the web. It builds on the standards of the semantic web defined by the W3C. LOD uses the Resource Description Framework (RDF), a simple data model expressing content as predicates linking resources between them or with literal properties. The simplicity of the model makes it able to represent any data, including metadata. We define statistical data as data produced through some statistical process or intended for statistical analyses, and statistical metadata as metadata describing statistical data. LOSM promotes discovery and the meaning and structure of statistical data in an automated way. Consequently, it helps with understanding and interpreting data and preventing inadequate or flawed visualizations for statistical data. This enhances statistical literacy and efforts at visualizing statistics.

Download Full-text

Towards Increasing Open Data Adoption Through Stream Data Integration and Imputation

Advances and Trends in Artificial Intelligence. Artificial Intelligence Practices - Lecture Notes in Computer Science ◽

10.1007/978-3-030-79457-6_2 ◽

2021 ◽

pp. 15-27

Author(s):

Robert Kunicki ◽

Maciej Grzenda

Keyword(s):

Data Integration ◽

Open Data ◽

Stream Data

Download Full-text

Framework for a Symmetric Integration Approach

Symmetry ◽

10.3390/sym11020224 ◽

2019 ◽

Vol 11 (2) ◽

pp. 224

Author(s):

Mihaela Muntean ◽

Claudiu Brândaş ◽

Tanita Cîrstea

Keyword(s):

Data Integration ◽

Open Data ◽

Application Integration ◽

Cloud Environment ◽

Integration Framework ◽

Symmetry Approach ◽

Integration Approach ◽

Implementation Aspects

An Application-to-Application integration framework in the cloud environment is proposed. The methodological demarche is developed using a data symmetry approach. Implementation aspects of integration considered the Open Data Protocol (OData) service as an integrator. An important issue in the cloud environment is to integrate and ensure the quality of transferred and processed data. An efficient way of ensuring the completeness and integrity of data transferred between different applications and systems is the symmetry of data integration. With these considerations, the integration of SAP Hybris Cloud for Customer with S/4 HANA Cloud was implemented.

Download Full-text