NLP and the Representation of Data on the Semantic Web

One of the goals of data scientists and curators is to get information (contained in text) organized and integrated in a way that can be easily consumed by people and machines. A starting point for such a goal is to get a model to represent the information. This model should ease to obtain knowledge semantically (e.g., using reasoners and inferencing rules). In this sense, the Semantic Web is focused on representing the information through the Resource Description Framework (RDF) model, in which the triple (subject, predicate, object) is the basic unit of information. In this context, the natural language processing (NLP) field has been a cornerstone in the identification of elements that can be represented by triples of the Semantic Web. However, existing approaches for the representation of RDF triples from texts use diverse techniques and tasks for such purpose, which complicate the understanding of the process by non-expert users. This chapter aims to discuss the main concepts involved in the representation of the information through the Semantic Web and the NLP fields.

Download Full-text

Ethical Problems and Solutions in Natural Language Processing: Collection of Dialect Data as a Starting Point for Overcoming Age and Regional Biases

Journal of Research Methodology ◽

10.21487/jrm.2021.3.6.1.157 ◽

2021 ◽

Vol 6 (1) ◽

pp. 157-180

Author(s):

Jinung Kim

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Ethical Problems ◽

Starting Point ◽

Problems And Solutions

Download Full-text

Linked Open Statistical Metadata

Data Visualization and Statistical Literacy for Open and Big Data - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-2512-7.ch012 ◽

2017 ◽

pp. 297-320

Author(s):

Franck Cotton ◽

Daniel Gillman

Keyword(s):

Semantic Web ◽

Resource Description Framework ◽

Statistical Data ◽

Open Data ◽

Statistical Analyses ◽

Statistical Literacy ◽

Statistical Process ◽

Description Framework ◽

Resource Description ◽

The Web

Linked Open Statistical Metadata (LOSM) is Linked Open Data (LOD) applied to statistical metadata. LOD is a model for identifying, structuring, interlinking, and querying data published directly on the web. It builds on the standards of the semantic web defined by the W3C. LOD uses the Resource Description Framework (RDF), a simple data model expressing content as predicates linking resources between them or with literal properties. The simplicity of the model makes it able to represent any data, including metadata. We define statistical data as data produced through some statistical process or intended for statistical analyses, and statistical metadata as metadata describing statistical data. LOSM promotes discovery and the meaning and structure of statistical data in an automated way. Consequently, it helps with understanding and interpreting data and preventing inadequate or flawed visualizations for statistical data. This enhances statistical literacy and efforts at visualizing statistics.

Download Full-text

Technique for Transformation of Data From RDB to XML Then to RDF

Web Semantics for Textual and Visual Information Retrieval - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-2483-0.ch004 ◽

2017 ◽

pp. 70-91 ◽

Cited By ~ 1

Author(s):

Kaleem Razzaq Malik ◽

Tauqir Ahmad

Keyword(s):

Semantic Web ◽

Relational Database ◽

Resource Description Framework ◽

Data Science ◽

True Nature ◽

Extensible Markup ◽

Description Framework ◽

Mapping Techniques ◽

Resource Description ◽

The Web

This chapter will clearly show the need for better mapping techniques for Relational Database (RDB) all the way to Resource Description Framework (RDF). This includes coverage of each data model limitations and benefits for getting better results. Here, each form of data being transform has its own importance in the field of data science. As RDB is well known back end storage for information used to many kinds of applications; especially the web, desktop, remote, embedded, and network-based applications. Whereas, EXtensible Markup Language (XML) in the well-known standard for data for transferring among all computer related resources regardless of their type, shape, place, capability and capacity due to its form is in application understandable form. Finally, semantically enriched and simple of available in Semantic Web is RDF. This comes handy when with the use of linked data to get intelligent inference better and efficient. Multiple Algorithms are built to support this system experiments and proving its true nature of the study.

Download Full-text

Ontology evolution: a process-centric survey

The Knowledge Engineering Review ◽

10.1017/s0269888913000349 ◽

2013 ◽

Vol 30 (1) ◽

pp. 45-75 ◽

Cited By ~ 63

Author(s):

Fouad Zablith ◽

Grigoris Antoniou ◽

Mathieu d'Aquin ◽

Giorgos Flouris ◽

Haridimos Kondylakis ◽

...

Keyword(s):

Natural Language Processing ◽

Information Systems ◽

Semantic Web ◽

Natural Language ◽

Language Processing ◽

Process Model ◽

Ontology Evolution ◽

Depth Analysis ◽

Future Challenges ◽

Industrial Adoption

AbstractOntology evolution aims at maintaining an ontology up to date with respect to changes in the domain that it models or novel requirements of information systems that it enables. The recent industrial adoption of Semantic Web techniques, which rely on ontologies, has led to the increased importance of the ontology evolution research. Typical approaches to ontology evolution are designed as multiple-stage processes combining techniques from a variety of fields (e.g., natural language processing and reasoning). However, the few existing surveys on this topic lack an in-depth analysis of the various stages of the ontology evolution process. This survey extends the literature by adopting a process-centric view of ontology evolution. Accordingly, we first provide an overall process model synthesized from an overview of the existing models in the literature. Then we survey the major approaches to each of the steps in this process and conclude on future challenges for techniques aiming to solve that particular stage.

Download Full-text

A Natural Language Processing for Semantic Web Services

EUROCON 2005 - The International Conference on "Computer as a Tool" ◽

10.1109/eurcon.2005.1629902 ◽

2005 ◽

Cited By ~ 1

Author(s):

M. Stanojevic ◽

S. Vranes

Keyword(s):

Natural Language Processing ◽

Web Services ◽

Semantic Web ◽

Natural Language ◽

Language Processing ◽

Semantic Web Services

Download Full-text

Leveraging the semantic web and natural language processing to enhance drug-mechanism knowledge in drug product labels

Proceedings of the ACM international conference on Health informatics - IHI '10 ◽

10.1145/1882992.1883070 ◽

2010 ◽

Author(s):

Richard Boyce ◽

Henk Harkema ◽

Mike Conway

Keyword(s):

Natural Language Processing ◽

Semantic Web ◽

Natural Language ◽

Language Processing ◽

Drug Product ◽

Drug Mechanism

Download Full-text

Database Tuning using Natural Language Processing

ACM SIGMOD Record ◽

10.1145/3503780.3503788 ◽

2021 ◽

Vol 50 (3) ◽

pp. 27-28

Author(s):

Immanuel Trummer

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Training Data ◽

Language Models ◽

Learning Approaches ◽

Training Samples ◽

Starting Point ◽

Training Cost ◽

Transformer Model

Introduction. We have seen significant advances in the state of the art in natural language processing (NLP) over the past few years [20]. These advances have been driven by new neural network architectures, in particular the Transformer model [19], as well as the successful application of transfer learning approaches to NLP [13]. Typically, training for specific NLP tasks starts from large language models that have been pre-trained on generic tasks (e.g., predicting obfuscated words in text [5]) for which large amounts of training data are available. Using such models as a starting point reduces task-specific training cost as well as the number of required training samples by orders of magnitude [7]. These advances motivate new use cases for NLP methods in the context of databases.

Download Full-text

Analysis of RDF Syntaxes for Semantic Web Development

Applied Computer Systems ◽

10.1515/acss-2015-0017 ◽

2015 ◽

Vol 18 (1) ◽

pp. 33-42 ◽

Cited By ~ 1

Author(s):

Yevgeny Gryaznov ◽

Pavel Rusakov

Keyword(s):

Semantic Web ◽

Directed Graph ◽

Resource Description Framework ◽

Formal Model ◽

Information Representation ◽

Web Development ◽

Rdf Data ◽

Description Framework ◽

Rdf Graphs ◽

Resource Description

Abstract In this paper authors perform a research on possibilities of RDF (Resource Description Framework) syntaxes usage for information representation in Semantic Web. It is described why pure XML cannot be effectively used for this purpose, and how RDF framework solves this problem. Information is being represented in a form of a directed graph. RDF is only an abstract formal model for information representation and side tools are required in order to write down that information. Such tools are RDF syntaxes – concrete text or binary formats, which prescribe rules for RDF data serialization. Text-based RDF syntaxes can be developed on the existing format basis (XML, JSON) or can be an RDF-specific – designed from scratch to serve the only purpose – to serialize RDF graphs. Authors briefly describe some of the RDF syntaxes (both XML and non-XML) and compare them in order to identify strengths and weaknesses of each version. Serialization and deserialization speed tests using Jena library are made. The results from both analytical and experimental parts of this research are used to develop the recommendations for RDF syntaxes usage and to design a RDF/XML syntax subset, which is intended to simplify the development and raise compatibility of information serialized with this RDF syntax.

Download Full-text

Survey of directly mapping SQL databases to the Semantic Web

The Knowledge Engineering Review ◽

10.1017/s0269888911000208 ◽

2011 ◽

Vol 26 (4) ◽

pp. 445-486 ◽

Cited By ~ 35

Author(s):

Juan F. Sequeda ◽

Syed Hamid Tirmizi ◽

Oscar Corcho ◽

Daniel P. Miranker

Keyword(s):

Semantic Web ◽

Resource Description Framework ◽

Information Sources ◽

The Internet ◽

Web Ontology Language ◽

Ontology Language ◽

Description Framework ◽

Resource Description ◽

Semantic Properties ◽

Rdf Schema

AbstractThe Semantic Web anticipates integrated access to a large number of information sources on the Internet represented as Resource Description Framework (RDF). Given the large number of websites that are backed by SQL databases, methods that automate the translation of those databases to RDF are crucial. One approach, taken by a number of researchers, is to directly map the SQL schema to an equivalent Web Ontology Language (OWL) or RDF Schema representation, which in turn, implies an RDF representation for the relational data. This paper reviews this research, and derives a consolidated, overarching set of translation rules expressible as a stratified Datalog program. We present all the possible key combinations in an SQL schema and consider their implied semantic properties. We review the approaches and characterize them with respect to the scope of their coverage of SQL constructs.

Download Full-text

SWedt: A Semantic Web Editor Integrating Ontologies and Semantic Annotations with Resource Description Framework

Advanced Int'l Conference on Telecommunications and Int'l Conference on Internet and Web Applications and Services (AICT-ICIW'06) ◽

10.1109/aict-iciw.2006.184 ◽

2006 ◽

Cited By ~ 4

Author(s):

R.G. Pereira ◽

M.M. Freire

Keyword(s):

Semantic Web ◽

Resource Description Framework ◽

Semantic Annotations ◽

Description Framework ◽

Resource Description

Download Full-text