Querying Knowledge Graphs in Natural Language

Abstract Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks - the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.

Download Full-text

Querying knowledge graphs in natural language

Journal Of Big Data ◽

10.1186/s40537-020-00383-w ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Shiqi Liang ◽

Kurt Stockinger ◽

Tarcisio Mendes de Farias ◽

Maria Anisimova ◽

Manuel Gil

Keyword(s):

Natural Language ◽

Information Needs ◽

Large Scale ◽

Question Answering ◽

Deep Understanding ◽

Network Models ◽

Query Languages ◽

End Users ◽

Neural Network Models ◽

Knowledge Graphs

AbstractKnowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.

Download Full-text

Detailed Simulation of Large Scale Neural Network Models

Computational Neuroscience ◽

10.1007/978-1-4757-9800-5_144 ◽

1997 ◽

pp. 931-935 ◽

Cited By ~ 1

Author(s):

Anders Lansner ◽

Örjan Ekeberg ◽

Erik Fransén ◽

Per Hammarlund ◽

Tomas Wilhelmsson

Keyword(s):

Neural Network ◽

Large Scale ◽

Network Models ◽

Neural Network Models ◽

Detailed Simulation

Download Full-text

Importance-Aware Learning for Neural Headline Editing

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6467 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9282-9289

Author(s):

Qingyang Wu ◽

Lei Li ◽

Hao Zhou ◽

Ying Zeng ◽

Zhou Yu

Keyword(s):

Social Media ◽

Large Scale ◽

Network Models ◽

Language Models ◽

Neural Network Models ◽

Generation Task ◽

Social Media Platforms ◽

Editing Process ◽

Different Levels

Many social media news writers are not professionally trained. Therefore, social media platforms have to hire professional editors to adjust amateur headlines to attract more readers. We propose to automate this headline editing process through neural network models to provide more immediate writing support for these social media news writers. To train such a neural headline editing model, we collected a dataset which contains articles with original headlines and professionally edited headlines. However, it is expensive to collect a large number of professionally edited headlines. To solve this low-resource problem, we design an encoder-decoder model which leverages large scale pre-trained language models. We further improve the pre-trained model's quality by introducing a headline generation task as an intermediate task before the headline editing task. Also, we propose Self Importance-Aware (SIA) loss to address the different levels of editing in the dataset by down-weighting the importance of easily classified tokens and sentences. With the help of Pre-training, Adaptation, and SIA, the model learns to generate headlines in the professional editor's style. Experimental results show that our method significantly improves the quality of headline editing comparing against previous methods.

Download Full-text

Usage and Scaling of an Open-Source Spiking Multi-Area Model of Monkey Cortex

Lecture Notes in Computer Science - Brain-Inspired Computing ◽

10.1007/978-3-030-82427-3_4 ◽

2021 ◽

pp. 47-59

Author(s):

Sacha J. van Albada ◽

Jari Pronold ◽

Alexander van Meegen ◽

Markus Diesmann

Keyword(s):

Open Source ◽

Large Scale ◽

Network Models ◽

Macaque Monkey ◽

Source Model ◽

Model Specification ◽

Data Sets ◽

Neural Network Models ◽

Wide Range ◽

Ict Infrastructure

AbstractWe are entering an age of ‘big’ computational neuroscience, in which neural network models are increasing in size and in numbers of underlying data sets. Consolidating the zoo of models into large-scale models simultaneously consistent with a wide range of data is only possible through the effort of large teams, which can be spread across multiple research institutions. To ensure that computational neuroscientists can build on each other’s work, it is important to make models publicly available as well-documented code. This chapter describes such an open-source model, which relates the connectivity structure of all vision-related cortical areas of the macaque monkey with their resting-state dynamics. We give a brief overview of how to use the executable model specification, which employs NEST as simulation engine, and show its runtime scaling. The solutions found serve as an example for organizing the workflow of future models from the raw experimental data to the visualization of the results, expose the challenges, and give guidance for the construction of an ICT infrastructure for neuroscience.

Download Full-text

BUILD KNOWLEDGE GRAPH FROM HETEROGENEOUS DOCUMENTS

Journal of Science and Technology - IUH ◽

10.46242/jst-iuh.v47i05.761 ◽

2021 ◽

Vol 47 (05) ◽

Author(s):

NGUYỄN CHÍ HIẾU

Keyword(s):

Information Retrieval ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Question Answering ◽

Semantic Analysis ◽

Knowledge Graph ◽

Question Answering Systems ◽

Knowledge Graphs

Knowledge Graphs are applied in many fields such as search engines, semantic analysis, and question answering in recent years. However, there are many obstacles for building knowledge graphs as methodologies, data and tools. This paper introduces a novel methodology to build knowledge graph from heterogeneous documents. We use the methodologies of Natural Language Processing and deep learning to build this graph. The knowledge graph can use in Question answering systems and Information retrieval especially in Computing domain

Download Full-text

The predictive skill of neural network models for the large-scale dynamics of the multi-level Lorenz '96 systems

10.1002/essoar.10508229.1 ◽

2021 ◽

Author(s):

Seoleun Shin

Keyword(s):

Neural Network ◽

Large Scale ◽

Network Models ◽

Neural Network Models ◽

Predictive Skill ◽

Multi Level ◽

Lorenz 96

Download Full-text

Artificial Neural Network Models for Large-Scale Data

Advances in Data Mining and Database Management - Handbook of Research on Big Data and the IoT ◽

10.4018/978-1-5225-7432-3.ch022 ◽

2019 ◽

pp. 406-439

Author(s):

Vo Ngoc Phu ◽

Vo Thi Ngoc Tran

Keyword(s):

Large Scale ◽

Network Models ◽

Data Sets ◽

Neural Network Models ◽

Large Scale Data ◽

The World ◽

Commercial Applications ◽

Artificial Neural Network Models ◽

Scale Data ◽

Large Scale Data Sets

Artificial intelligence (ARTINT) and information have been famous fields for many years. A reason has been that many different areas have been promoted quickly based on the ARTINT and information, and they have created many significant values for many years. These crucial values have certainly been used more and more for many economies of the countries in the world, other sciences, companies, organizations, etc. Many massive corporations, big organizations, etc. have been established rapidly because these economies have been developed in the strongest way. Unsurprisingly, lots of information and large-scale data sets have been created clearly from these corporations, organizations, etc. This has been the major challenges for many commercial applications, studies, etc. to process and store them successfully. To handle this problem, many algorithms have been proposed for processing these big data sets.

Download Full-text

Question Answering and Generation

Applied Natural Language Processing ◽

10.4018/978-1-60960-741-8.ch001 ◽

2012 ◽

pp. 1-16 ◽

Cited By ~ 3

Author(s):

Arthur C. Graesser ◽

Vasile Rus ◽

Zhiqiang Cai ◽

Xiangen Hu

Keyword(s):

Natural Language ◽

Language Processing ◽

Information Needs ◽

Question Answering ◽

Learning Technologies ◽

Question Asking ◽

Tutoring Systems ◽

Computer Based Training ◽

Recent Developments ◽

Text Meaning Representation

Automated Question Answering and Asking are two active areas of Natural Language Processing with the former dominating the past decade and the latter most likely to dominate the next one. Due to the vast amounts of information available electronically in the Internet era, automated Question Answering is needed to fulfill information needs in an efficient and effective manner. Automated Question Answering is the task of providing answers automatically to questions asked in natural language. Typically, the answers are retrieved from large collections of documents. While answering any question is difficult, successful automated solutions to answer some type of questions, so-called factoid questions, have been developed recently, culminating with the just announced Watson Question Answering system developed by I.B.M. to compete in Jeopardy-like games. The flip process, automated Question Asking or Generation, is about generating questions from some form of input such as a text, meaning representation, or database. Question Asking/Generation is an important component in the full gamut of learning technologies, from conventional computer-based training to tutoring systems. Advances in Question Asking/Generation are projected to revolutionize learning and dialogue systems. This chapter presents an overview of recent developments in Question Answering and Generation starting with the landscape of questions that people ask.

Download Full-text

Evaluation of Pre-Trained Convolutional Neural Network Models for Object Recognition

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.15.17509 ◽

2018 ◽

Vol 7 (3.15) ◽

pp. 95 ◽

Cited By ~ 1

Author(s):

M Zabir ◽

N Fazira ◽

Zaidah Ibrahim ◽

Nurbaity Sabri

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Large Scale ◽

Visual Recognition ◽

Error Function ◽

Network Models ◽

Neural Network Models ◽

California Institute Of Technology ◽

Institute Of Technology ◽

Similar Accuracy

This paper aims to evaluate the accuracy performance of pre-trained Convolutional Neural Network (CNN) models, namely AlexNet and GoogLeNet accompanied by one custom CNN. AlexNet and GoogLeNet have been proven for their good capabilities as these network models had entered ImageNet Large Scale Visual Recognition Challenge (ILSVRC) and produce relatively good results. The evaluation results in this research are based on the accuracy, loss and time taken of the training and validation processes. The dataset used is Caltech101 by California Institute of Technology (Caltech) that contains 101 object categories. The result reveals that custom CNN architecture produces 91.05% accuracy whereas AlexNet and GoogLeNet achieve similar accuracy which is 99.65%. GoogLeNet consistency arrives at an early training stage and provides minimum error function compared to the other two models.

Download Full-text

Leveraging Domain Context for Question Answering Over Knowledge Graph

Data Science and Engineering ◽

10.1007/s41019-019-00109-w ◽

2019 ◽

Vol 4 (4) ◽

pp. 323-335 ◽

Cited By ~ 3

Author(s):

Peihao Tong ◽

Qifan Zhang ◽

Junjie Yao

Keyword(s):

Information Needs ◽

Question Answering ◽

Knowledge Graph ◽

New Approach ◽

Rich Domain ◽

Complex Information ◽

Question And Answer ◽

Knowledge Graphs ◽

The Rich ◽

Matching Models

Abstract With the growing availability of different knowledge graphs in a variety of domains, question answering over knowledge graph (KG-QA) becomes a prevalent information retrieval approach. Current KG-QA methods usually resort to semantic parsing, search or neural matching models. However, they cannot well tackle increasingly long input questions and complex information needs. In this work, we propose a new KG-QA approach, leveraging the rich domain context in the knowledge graph. We incorporate the new approach with question and answer domain context descriptions. Specifically, for questions, we enrich them with users’ subsequent input questions within a session and expand the input question representation. For the candidate answers, we equip them with surrounding context structures, i.e., meta-paths within the targeting knowledge graph. On top of these, we design a cross-attention mechanism to improve the question and answer matching performance. An experimental study on real datasets verifies these improvements. The new approach is especially beneficial for specific knowledge graphs with complex questions.

Download Full-text