An Open Source System Architecture for Digital Geolinguistic Linked Open Data

Motivation Scientists increasingly rely on intelligent information systems to help them in their daily tasks, in particular for managing research objects, like publications or datasets. The relatively young research field of Semantic Publishing has been addressing the question how scientific applications can be improved through semantically rich representations of research objects, in order to facilitate their discovery and re-use. To complement the efforts in this area, we propose an automatic workflow to construct semantic user profiles of scholars, so that scholarly applications, like digital libraries or data repositories, can better understand their users’ interests, tasks, and competences, by incorporating these user profiles in their design. To make the user profiles sharable across applications, we propose to build them based on standard semantic web technologies, in particular the Resource Description Framework (RDF) for representing user profiles and Linked Open Data (LOD) sources for representing competence topics. To avoid the cold start problem, we suggest to automatically populate these profiles by analyzing the publications (co-)authored by users, which we hypothesize reflect their research competences. Results We developed a novel approach, ScholarLens, which can automatically generate semantic user profiles for authors of scholarly literature. For modeling the competences of scholarly users and groups, we surveyed a number of existing linked open data vocabularies. In accordance with the LOD best practices, we propose an RDF Schema (RDFS) based model for competence records that reuses existing vocabularies where appropriate. To automate the creation of semantic user profiles, we developed a complete, automated workflow that can generate semantic user profiles by analyzing full-text research articles through various natural language processing (NLP) techniques. In our method, we start by processing a set of research articles for a given user. Competences are derived by text mining the articles, including syntactic, semantic, and LOD entity linking steps. We then populate a knowledge base in RDF format with user profiles containing the extracted competences.We implemented our approach as an open source library and evaluated our system through two user studies, resulting in mean average precision (MAP) of up to 95%. As part of the evaluation, we also analyze the impact of semantic zoning of research articles on the accuracy of the resulting profiles. Finally, we demonstrate how these semantic user profiles can be applied in a number of use cases, including article ranking for personalized search and finding scientists competent in a topic —e.g., to find reviewers for a paper. Availability All software and datasets presented in this paper are available under open source licenses in the supplements and documented at http://www.semanticsoftware.info/semantic-user-profiling-peerj-2016-supplements. Additionally, development releases of ScholarLens are available on our GitHub page: https://github.com/SemanticSoftwareLab/ScholarLens.

Download Full-text

A Study on the Procedure for Constructing Linked Open Data of Records Information by Using Open Source Tool

Journal of the Korean Society for information Management ◽

10.3743/kosim.2017.34.1.341 ◽

2017 ◽

Vol 34 (1) ◽

pp. 341-371 ◽

Cited By ~ 2

Author(s):

Seung Rok Ha ◽

Jin Hee Yim ◽

Hae-young Rieh

Keyword(s):

Open Source ◽

Open Data ◽

Linked Open Data ◽

Open Source Tool

Download Full-text

TOWARD A LINKED OPEN DATA REPOSITORY ABOUT VIETNAMESE TOURISM

KỶ YẾU HỘI NGHỊ KHOA HỌC CÔNG NGHỆ QUỐC GIA LẦN THỨ XI NGHIÊN CỨU CƠ BẢN VÀ ỨNG DỤNG CÔNG NGHỆ THÔNG TIN ◽

10.15625/vap.2018.00067 ◽

2018 ◽

Author(s):

Le Anh Tien ◽

Cao Tuan Dung

Keyword(s):

Open Data ◽

Linked Open Data ◽

Data Repository

Download Full-text

Opportunités et défis. Linked (Open) Data

Dialogues avec la machine - Arabesques ◽

10.35562/arabesques.1397 ◽

2019 ◽

pp. 4-6

Author(s):

Makx Dekkers

Keyword(s):

Open Data ◽

Linked Open Data

Download Full-text

Europeana no Linked Open Data: conceitos de Web Semântica na dimensão aplicada das humanidades digitais

Pesquisa Brasileira em Ciência da Informação e Biblioteconomia ◽

10.22478/ufpb.1981-0695.2017v12n2.36529 ◽

2017 ◽

Vol 12 (2) ◽

Author(s):

Caio Saraiva Coneglian ◽

José Eduardo Santarem Segundo

Keyword(s):

Linked Data ◽

Open Data ◽

Linked Open Data

O surgimento de novas tecnologias, tem introduzido meios para a divulgação e a disponibilização das informações mais eficientemente. Uma iniciativa, chamada de Europeana, vem promovendo esta adaptação dos objetos informacionais dentro da Web, e mais especificamente no Linked Data. Desta forma, o presente estudo tem como objetivo apresentar uma discussão acerca da relação entre as Humanidades Digitais e o Linked Open Data, na figura da Europeana. Para tal, utilizamos uma metodologia exploratória e que busca explorar as questões relacionadas ao modelo de dados da Europeana, EDM, por meio do SPARQL. Como resultados, compreendemos as características do EDM, pela utilização do SPARQL. Identificamos, ainda, a importância que o conceito de Humanidades Digitais possui dentro do contexto da Europeana.Palavras-chave: Web semântica. Linked open data. Humanidades digitais. Europeana. EDM.Link: https://periodicos.ufsc.br/index.php/eb/article/view/1518-2924.2017v22n48p88/33031

Download Full-text

Decentralized Linked Open Data in Constrained Wireless Sensor Networks

2020 7th International Conference on Internet of Things: Systems, Management and Security (IOTSMS) ◽

10.1109/iotsms52051.2020.9340221 ◽

2020 ◽

Author(s):

Bart Moons ◽

Flor Sanders ◽

Thijs Paelman ◽

Jeroen Hoebeke

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Open Data ◽

Linked Open Data ◽

Wireless Sensor

Download Full-text

A Hybrid Approach Combining R*-Tree and k-d Trees to Improve Linked Open Data Query Performance

Applied Sciences ◽

10.3390/app11052405 ◽

2021 ◽

Vol 11 (5) ◽

pp. 2405

Author(s):

Yuxiang Sun ◽

Tianyi Zhao ◽

Seulgi Yoon ◽

Yongju Lee

Keyword(s):

Flash Memory ◽

Query Language ◽

Hybrid Approach ◽

Open Data ◽

Main Memory ◽

Linked Open Data ◽

Index Structure ◽

Identification Algorithm ◽

Distributed Computing Systems ◽

Query Performance

Semantic Web has recently gained traction with the use of Linked Open Data (LOD) on the Web. Although numerous state-of-the-art methodologies, standards, and technologies are applicable to the LOD cloud, many issues persist. Because the LOD cloud is based on graph-based resource description framework (RDF) triples and the SPARQL query language, we cannot directly adopt traditional techniques employed for database management systems or distributed computing systems. This paper addresses how the LOD cloud can be efficiently organized, retrieved, and evaluated. We propose a novel hybrid approach that combines the index and live exploration approaches for improved LOD join query performance. Using a two-step index structure combining a disk-based 3D R*-tree with the extended multidimensional histogram and flash memory-based k-d trees, we can efficiently discover interlinked data distributed across multiple resources. Because this method rapidly prunes numerous false hits, the performance of join query processing is remarkably improved. We also propose a hot-cold segment identification algorithm to identify regions of high interest. The proposed method is compared with existing popular methods on real RDF datasets. Results indicate that our method outperforms the existing methods because it can quickly obtain target results by reducing unnecessary data scanning and reduce the amount of main memory required to load filtering results.

Download Full-text

Heritage Connector: A Machine Learning Framework for Building Linked Open Data from Museum Collections

Applied AI Letters ◽

10.1002/ail2.23 ◽

2021 ◽

Author(s):

Kalyan Dutia ◽

John Stack

Keyword(s):

Machine Learning ◽

Open Data ◽

Linked Open Data ◽

Museum Collections ◽

Learning Framework

Download Full-text

Using Open Source, Open Data, and Civic Technology to Address the COVID-19 Pandemic and Infodemic

Yearbook of Medical Informatics ◽

10.1055/s-0041-1726488 ◽

2021 ◽

Author(s):

Shinji Kobayashi ◽

Luis Falcón ◽

Hamish Fraser ◽

Jørn Braa ◽

Pamod Amarakoon ◽

...

Keyword(s):

Open Source ◽

Medical Informatics ◽

Open Source Software ◽

Collective Intelligence ◽

Collaborative Work ◽

Open Data ◽

Theme Issue ◽

Health Organizations ◽

The World ◽

Civic Technology

Objectives: The emerging COVID-19 pandemic has caused one of the world’s worst health disasters compounded by social confusion with misinformation, the so-called “Infodemic”. In this paper, we discuss how open technology approaches - including data sharing, visualization, and tooling - can address the COVID-19 pandemic and infodemic. Methods: In response to the call for participation in the 2020 International Medical Informatics Association (IMIA) Yearbook theme issue on Medical Informatics and the Pandemic, the IMIA Open Source Working Group surveyed recent works related to the use of Free/Libre/Open Source Software (FLOSS) for this pandemic. Results: FLOSS health care projects including GNU Health, OpenMRS, DHIS2, and others, have responded from the early phase of this pandemic. Data related to COVID-19 have been published from health organizations all over the world. Civic Technology, and the collaborative work of FLOSS and open data groups were considered to support collective intelligence on approaches to managing the pandemic. Conclusion: FLOSS and open data have been effectively used to contribute to managing the COVID-19 pandemic, and open approaches to collaboration can improve trust in data.

Download Full-text

Enrichment of EHR with Linked Open Data for Risk Factors Identification

Proceedings of the 20th International Conference on Computer Systems and Technologies - CompSysTech '19 ◽

10.1145/3345252.3345290 ◽

2019 ◽

Author(s):

Svetla Boytcheva ◽

Galia Angelova ◽

Zhivko Angelov ◽

Dimitar Tcharaktchiev ◽

Vlayko Vodenicharov

Keyword(s):

Risk Factors ◽

Open Data ◽

Linked Open Data

Download Full-text