Extraction, analysis and publication of bibliographical references within an institutional repository

2016 ◽  
Vol 34 (2) ◽  
pp. 259-267 ◽  
Author(s):  
Götz Hatop

Purpose – The academic tradition of adding a reference section with references to cited and otherwise related academic material to an article provides a natural starting point for finding links to other publications. These links can then be published as linked data. Natural language processing technologies are available today that can perform the task of bibliographical reference extraction from text. Publishing references by the means of semantic web technologies is a prerequisite for a broader study and analysis of citations and thus can help to improve academic communication in a general sense. The paper aims to discuss these issues. Design/methodology/approach – This paper examines the overall workflow required to extract, analyze and semantically publish bibliographical references within an Institutional Repository with the help of open source software components. Findings – A publication infrastructure where references are available for software agents would enable additional benefits like citation analysis, e.g. the collection of citations of a known paper and the investigation of citation sentiment.The publication of reference information as demonstrated in this article is possible with existing semantic web technologies based on established ontologies and open source software components. Research limitations/implications – Only a limited number of metadata extraction programs have been considered for performance evaluation and reference extraction was tested for journal articles only, whereas Institutional Repositories usually do contain a large number of other material like monographs. Also, citation analysis is in an experimental state and citation sentiment is currently not published at all. For future work, the problem of distributing reference information between repositories is an important problem that needs to be tackled. Originality/value – Publishing reference information as linked data are new within the academic publishing domain.

Author(s):  
Jose María Alvarez Rodríguez ◽  
José Emilio Labra Gayo ◽  
Patricia Ordoñez de Pablos

The aim of this chapter is to present a proposal and a case study to describe the information about organizations in a standard way using the Linked Data approach. Several models and ontologies have been provided in order to formalize the data, structure and behaviour of organizations. Nevertheless, these tries have not been fully accepted due to some factors: (1) missing pieces to define the status of the organization; (2) tangled parts to specify the structure (concepts and relations) between the elements of the organization; 3) lack of text properties, and other factors. These divergences imply a set of incomplete approaches to formalize data and information about organizations. Taking into account the current trends of applying semantic web technologies and linked data to formalize, aggregate, and share domain specific information, a new model for organizations taking advantage of these initiatives is required in order to overcome existing barriers and exploit the corporate information in a standard way. This work is especially relevant in some senses to: (1) unify existing models to provide a common specification; (2) apply semantic web technologies and the Linked Data approach; (3) provide access to the information via standard protocols, and (4) offer new services that can exploit this information to trace the evolution and behaviour of the organization over time. Finally, this work is interesting to improve the clarity and transparency of some scenarios in which organizations play a key role, like e-procurement, e-health, or financial transactions.


2010 ◽  
Vol 2 (2) ◽  
pp. 29-40 ◽  
Author(s):  
Olivier Berger ◽  
Valentin Vlasceanu ◽  
Christian Bac ◽  
Quang Vu Dang ◽  
Stéphane Lauriere

Several public repositories and archives of “facts” about libre software projects, maintained either by open source communities or by research communities, have been flourishing over the Web in recent years. These have enabled new analysis and support for new quality assurance tasks. This paper presents some complementary existing tools, projects and models proposed both by OSS actors or research initiatives that are likely to lead to useful future developments in terms of study of the FLOSS phenomenon, and also to the very practitioners in the FLOSS development projects. A goal of the research conducted within the HELIOS project is to address bugs traceability issues. In this regard, the authors investigate the potential of using Semantic Web technologies in navigating between many different bugtracker systems scattered all over the open source ecosystem. By using Semantic Web techniques, it is possible to interconnect the databases containing data about open-source software projects development, which enables OSS partakers to identify resources, annotate them, and further interlink those using dedicated properties and collectively designing a distributed semantic graph.


Author(s):  
Amrapali Zaveri ◽  
Andrea Maurino ◽  
Laure-Berti Equille

The standardization and adoption of Semantic Web technologies has resulted in an unprecedented volume of data being published as Linked Data (LD). However, the “publish first, refine later” philosophy leads to various quality problems arising in the underlying data such as incompleteness, inconsistency and semantic ambiguities. In this article, we describe the current state of Data Quality in the Web of Data along with details of the three papers accepted for the International Journal on Semantic Web and Information Systems' (IJSWIS) Special Issue on Web Data Quality. Additionally, we identify new challenges that are specific to the Web of Data and provide insights into the current progress and future directions for each of those challenges.


2017 ◽  
Vol 51 (4) ◽  
pp. 387-405 ◽  
Author(s):  
Dunia Llanes-Padrón ◽  
Juan-Antonio Pastor-Sánchez

Purpose The purpose of this paper is to examine the Records in Contexts proposal of a conceptual model (RiC-CM) from the International Council on Archives’ (ICA) archival description and to propose an OWL ontology for its implementation in the semantic web. Design/methodology/approach The various elements of the model are studied and are related to earlier norms in order to understand their structure and the modeling of the ontology. Findings The analysis reveals the integrating nature of RiC-CM and the possibilities it offers for greater interoperability of data from archival descriptions. Two versions of an OWL ontology were developed to represent the conceptual model. The first makes a direct transposition of the conceptual model; the second optimizes the properties and relations in order to simplify the use and maintenance of the ontology. Research limitations/implications The proposed ontology will follow the considerations of the final version of the ICA’s RiC-CM. Practical implications The analysis affords an understanding of the role of RiC-CM in publishing online archival data sets, while the ontology is an initial approach to the semantic web technologies involved. Originality/value This paper offers an overview of Records in Contexts with respect to the advantages in the field of semantic interoperability, and supposes the first proposal of an ontology based on the conceptual model.


2019 ◽  
Vol 22 (2) ◽  
Author(s):  
Felipe Augusto Arakaki ◽  
Caio Saraiva Coneglian ◽  
Plácida Leopoldina Ventura Amorim da Costa Santos ◽  
José Eduardo Santarem Segundo

Considerando la expansión de la producción científica en ambientes informacionales digitales y las nuevas formas de disponibilización de datos siguiendo los principios Linked Data, se objetivó discutir posibilidades de relacionamiento de datasets y enriquecimiento semántico de metadatos en repositorios digitales. Adicionalmente, presentar un modelo de conversión de registros en RDF. Es una investigación teórica y exploratoria, que realizó una revisión bibliográfica sobre repositorios digitales y Linked Data. Se demostraron posibilidades del proceso de conversión, con la identificación de bases de datos, vocabularios y estándares que deben ser adoptados para que los datos generados sean enriquecidos semánticamente. El trabajo presenta un modelo que refleja los pasos a ser aplicados durante el proceso de disponibilización de metadatos de un repositorio digital en Linked Data. Se concluye que la integración entre repositorios digitales y las tecnologías de la Web Semántica permite la disponibilización de datos en Linked Data, la cual proporciona nuevos medios para la divulgación y la integración de los recursos en la Web. Considering the expansion of scientific production in digital information environments and the new forms of data availability following the principles of Linked Data, the objective is to discuss possibilities of relationships of datasets and semantic enrichment of metadata in digital repositories and present a model of conversion of records in RDF. It is a theoretical and exploratory research, performing a bibliographic review on digital repositories and Linked Data. In this way, we have demonstrated possibilities of the conversion process, identifying databases, vocabularies and standards that must be adopted so that the generated data is enriched semantically. The work presented a model that reflects the steps that must be taken in the process of making available the metadata of a digital repository in Linked Data. It is concluded that the integration between digital repositories and Semantic Web technologies allows the availability of data in Linked Data, which provides new means for the dissemination and integration of resources on Web.


Gamification ◽  
2015 ◽  
pp. 273-295 ◽  
Author(s):  
Irene Celino ◽  
Daniele Dell'Aglio

Knowledge-rich learning environments like simulation learning sessions call for the adoption of knowledge technologies to effectively manage information and data related to the learning supply and to the observation analysis. In this chapter, the authors illustrate the benefits and the challenges from the adoption of Linked Data and Semantic Web technologies to model, store, update, collect, and interpret learning data in simulation environments. The experience gained in applying this approach to a Simulation Learning system based on Serious Games proves the feasibility and the advantages of knowledge technologies in addressing and solving the issues faced by trainers and teachers in their daily practice.


2019 ◽  
Vol 15 (2) ◽  
pp. 236-254
Author(s):  
I-Cheng Chen ◽  
I-Ching Hsu

Purpose In recent years, governments around the world are actively promoting the Open Government Data (OGD) to facilitate reusing open data and developing information applications. Currently, there are more than 35,000 data sets available on the Taiwan OGD website. However, the existing Taiwan OGD website only provides keyword queries and lacks a friendly query interface. This study aims to address these issues by defining a DBpedia cloud computing framework (DCCF) for integrating DBpedia with Semantic Web technologies into Spark cluster cloud computing environment. Design/methodology/approach The proposed DCCF is used to develop a Taiwan OGD recommendation platform (TOGDRP) that provides a friendly query interface to automatically filter out the relevant data sets and visualize relationships between these data sets. Findings To demonstrate the feasibility of TOGDRP, the experimental results illustrate the efficiency of the different cloud computing models, including Hadoop YARN cluster model, Spark standalone cluster model and Spark YARN cluster model. Originality/value The novel solution proposed in this study is a hybrid approach for integrating Semantic Web technologies into Hadoop and Spark cloud computing environment to provide OGD data sets recommendation.


Semantic Web ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 163-167
Author(s):  
Antonis Bikakis ◽  
Eero Hyvönen ◽  
Stéphane Jean ◽  
Béatrice Markhoff ◽  
Alessandro Mosca

Cultural Heritage and Digital Humanities have become major application fields of Linked Data and Semantic Web technologies. This editorial introduces the special issue of the Semantic Web (SWJ) journal on Semantic Web for Cultural Heritage. In total 30 submissions for the call of papers were received, of which 11 were selected for publication. The papers cover a wide spectrum of modelled topics related to language, reading and writing, narratives, historical events and cultural artefacts, while describing reusable methodologies and tools for cultural data management. This issue indicates and demonstrates the high potential of Semantic Web technologies for applications in the Cultural Heritage domain.


2021 ◽  
Author(s):  
Gillian Byrne ◽  
Lisa Goddard

Semantic Web technologies have immense potential to transform the Internet into a distributed reasoning machine that will not only execute extremely precise searches, but will also have the ability to analyze the data it finds to create new knowledge. This paper examines the state of Semantic Web (also known as Linked Data) tools and infrastructure to determine whether semantic technologies are sufficiently mature for non–expert use, and to identify some of the obstacles to global Linked Data implementation.


2018 ◽  
Vol 36 (5) ◽  
pp. 826-841 ◽  
Author(s):  
Shakeel Ahmad Khan ◽  
Rubina Bhatti

Purpose The purpose of this paper is to explore useful Semantic Web technologies and ontology-based applications for digital libraries. It also investigates the perceptions of university librarians and academicians in Pakistan about Semantic Web technologies and their use in digital libraries. Design/methodology/approach An exploratory research design based on Delphi research strategy was conducted to answer the research questions. Interviews were conducted with a purposive sample of 50 key informants including university librarians and academicians to explore their perceptions about Semantic Web technologies and their use in digital libraries. Thematic analysis of interview data was conducted to obtain results. Findings The results of this paper showed that DuraCloud, Semantic information mashup, OntoEdit and resource description framework (RDF) are the various Semantic Web applications which are useful for digital libraries to develop semantic relationships among digital contents and increase their accessibility in the web environment. Findings revealed that Semantic Web provides precise results and meets user information needs in an effective way. Results also showed that next-generation digital libraries use context-awareness technology, intelligent agent software and detecting sensors to analyze user information needs and provide dynamic information services. This paper recommended that librarians should embrace the use of emerging web technologies in libraries and offer library services through the medium of the web. Practical implications This paper envisaged the future of digital library services and Semantic Web applications that can be used to re-structure metadata of digital library. This paper has practical implications for librarians to consider the useful applications of Semantic Web for digital library and enhance the interoperability of metadata among heterogeneous information systems. Practically, results obtained from this paper are highly useful for library schools and LIS teachers to up-date their curriculum by incorporating new contents related to web languages and Semantic Web applications for digital libraries. Originality/value This paper identifies various Semantic Web applications which are useful for developing Semantic Digital Libraries.


Sign in / Sign up

Export Citation Format

Share Document