Unseen Entity Handling in Complex Question Answering over Knowledge Base via Language Generation

Formal query building is an important part of complex question answering over knowledge bases. It aims to build correct executable queries for questions. Recent methods try to rank candidate queries generated by a state-transition strategy. However, this candidate generation strategy ignores the structure of queries, resulting in a considerable number of noisy queries. In this paper, we propose a new formal query building approach that consists of two stages. In the first stage, we predict the query structure of the question and leverage the structure to constrain the generation of the candidate queries. We propose a novel graph generation framework to handle the structure prediction task and design an encoder-decoder model to predict the argument of the predetermined operation in each generative step. In the second stage, we follow the previous methods to rank the candidate queries. The experimental results show that our formal query building approach outperforms existing methods on complex questions while staying competitive on simple questions.

Download Full-text

SPARQL QUERY GENERATION FOR COMPLEX QUESTION ANSWERING WITH BERT AND BILSTM-BASED MODEL

Computational Linguistics and Intellectual Technologies ◽

10.28995/2075-7182-2020-19-270-282 ◽

2020 ◽

Author(s):

D. A. Evseev ◽

◽

M. Yu. Arkhipov ◽

Keyword(s):

Natural Language ◽

Knowledge Base ◽

Question Answering ◽

Sparql Query ◽

Question Answering System ◽

Model Ranking ◽

Natural Language Question ◽

Query Generation ◽

Language Question ◽

Complex Question

In this paper we describe question answering system for answering of complex questions over Wikidata knowledge base. Unlike simple questions, which require extraction of single fact from the knowledge base, complex questions are based on more than one triplet and need logical or comparative reasoning. The proposed question answering system translates a natural language question into a query in SPARQL language, execution of which gives an answer. The system includes the models which define the SPARQL query template corresponding to the question and then fill the slots in the template with entities, relations and numerical values. For entity detection we use BERTbased sequence labelling model. Ranking of candidate relations is performed in two steps with BiLSTM and BERT-based models. The proposed models are the first solution for LC-QUAD2.0 dataset. The system is capable of answering complex questions which involve comparative or boolean reasoning.

Download Full-text

Retrieve, Program, Repeat: Complex Knowledge Base Question Answering via Alternate Meta-learning

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/509 ◽

2020 ◽

Author(s):

Yuncheng Hua ◽

Yuan-Fang Li ◽

Gholamreza Haffari ◽

Guilin Qi ◽

Wei Wu

Keyword(s):

Knowledge Base ◽

Large Scale ◽

Question Answering ◽

Knowledge Bases ◽

Retrieval Model ◽

Test Question ◽

Weak Supervision ◽

Meta Learning ◽

Complex Knowledge ◽

Complex Question

A compelling approach to complex question answering is to convert the question to a sequence of actions, which can then be executed on the knowledge base to yield the answer, aka the programmer-interpreter approach. Use similar training questions to the test question, meta-learning enables the programmer to adapt to unseen questions to tackle potential distributional biases quickly. However, this comes at the cost of manually labeling similar questions to learn a retrieval model, which is tedious and expensive. In this paper, we present a novel method that automatically learns a retrieval model alternately with the programmer from weak supervision, i.e., the system’s performance with respect to the produced answers. To the best of our knowledge, this is the first attempt to train the retrieval model with the programmer jointly. Our system leads to state-of-the-art performance on a large-scale task for complex question answering over knowledge bases. We have released our code at https://github.com/DevinJake/MARL.

Download Full-text

Improving Query Graph Generation for Complex Question Answering over Knowledge Base

10.18653/v1/2021.emnlp-main.346 ◽

2021 ◽

Author(s):

Kechen Qin ◽

Cheng Li ◽

Virgil Pavlu ◽

Javed Aslam

Keyword(s):

Knowledge Base ◽

Question Answering ◽

Query Graph ◽

Complex Question ◽

Graph Generation

Download Full-text

Uma revisão breve sobre perguntas complexas em bases de conhecimento para sistemas de perguntas e respostas

10.5753/stil.2021.17808 ◽

2021 ◽

Author(s):

Jorão Gomes Jr. ◽

Rômulo Chrispim de Mello ◽

Ana Beatriz Kapps dos Reis ◽

Victor Ströele ◽

Jairo Francisco de Souza

Keyword(s):

Knowledge Base ◽

Question Answering ◽

Complex Knowledge ◽

Complex Question

O avanço nos sistemas de Question Answering alcançou resultados importantes e novos problemas relacionados, como Complex Question Answering e Knowledge Base Question Answering, surgiram. No entanto, faltam estudos que analisam o problema e abordagens para Complex Knowledge Base Question Answering (C-KBQA). Este trabalho preenche essa lacuna apresentando uma visão geral do C-KBQA. Uma coleção de 54 artigos foi selecionada e um mapa dos métodos, abordagens, tendências e lacunas sobre C-KBQA foi realizado. É mostrado que as questões de múltiplos saltos e restritivas são os dois tipos de questões abordadas na literatura. Três etapas foram identificadas para criar um sistema C-KBQA e duas abordagens são geralmente usadas.

Download Full-text

Answer Graph-based Interactive Attention Network for Question Answering over Knowledge Base

2020 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom) ◽

10.1109/ispa-bdcloud-socialcom-sustaincom51426.2020.00091 ◽

2020 ◽

Author(s):

Lu Ma ◽

Peng Zhang ◽

Dan Luo ◽

Meilin Zhou ◽

Qi Liang ◽

...

Keyword(s):

Knowledge Base ◽

Question Answering ◽

Attention Network

Download Full-text

Semantics-based Graph Approach to Complex Question-Answering

10.3115/v1/n15-2019 ◽

2015 ◽

Cited By ~ 1

Author(s):

Tomasz Jurczyk ◽

Jinho D. Choi

Keyword(s):

Question Answering ◽

Complex Question

Download Full-text

A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base

10.18653/v1/2021.findings-emnlp.159 ◽

2021 ◽

Author(s):

Yu Feng ◽

Jing Zhang ◽

Gaole He ◽

Wayne Xin Zhao ◽

Lemao Liu ◽

...

Keyword(s):

Knowledge Base ◽

Question Answering ◽

Numerical Reasoning

Download Full-text

Data mining for building knowledge bases: techniques, architectures and applications

The Knowledge Engineering Review ◽

10.1017/s0269888916000047 ◽

2016 ◽

Vol 31 (2) ◽

pp. 97-123 ◽

Cited By ~ 4

Author(s):

Alfred Krzywicki ◽

Wayne Wobcke ◽

Michael Bain ◽

John Calvo Martinez ◽

Paul Compton

Keyword(s):

Data Mining ◽

Knowledge Base ◽

Question Answering ◽

Knowledge Bases ◽

Event Extraction ◽

Data Sources ◽

Small Scale ◽

Knowledge Mining ◽

Practical Applications ◽

Unstructured Text

AbstractData mining techniques for extracting knowledge from text have been applied extensively to applications including question answering, document summarisation, event extraction and trend monitoring. However, current methods have mainly been tested on small-scale customised data sets for specific purposes. The availability of large volumes of data and high-velocity data streams (such as social media feeds) motivates the need to automatically extract knowledge from such data sources and to generalise existing approaches to more practical applications. Recently, several architectures have been proposed for what we callknowledge mining: integrating data mining for knowledge extraction from unstructured text (possibly making use of a knowledge base), and at the same time, consistently incorporating this new information into the knowledge base. After describing a number of existing knowledge mining systems, we review the state-of-the-art literature on both current text mining methods (emphasising stream mining) and techniques for the construction and maintenance of knowledge bases. In particular, we focus on mining entities and relations from unstructured text data sources, entity disambiguation, entity linking and question answering. We conclude by highlighting general trends in knowledge mining research and identifying problems that require further research to enable more extensive use of knowledge bases.

Download Full-text

Robust cross-lingual knowledge base question answering via knowledge distillation

Data Technologies and Applications ◽

10.1108/dta-12-2020-0312 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Shaofei Wang ◽

Depeng Dang

Keyword(s):

Knowledge Base ◽

Design Methodology ◽

Question Answering ◽

Ground Truth ◽

Student Model ◽

Content Type ◽

Final Performance ◽

Knowledge Distillation ◽

The Cross ◽

Cross Lingual

PurposePrevious knowledge base question answering (KBQA) models only consider the monolingual scenario and cannot be directly extended to the cross-lingual scenario, in which the language of questions and that of knowledge base (KB) are different. Although a machine translation (MT) model can bridge the gap through translating questions to the language of KB, the noises of translated questions could accumulate and further sharply impair the final performance. Therefore, the authors propose a method to improve the robustness of KBQA models in the cross-lingual scenario.Design/methodology/approachThe authors propose a knowledge distillation-based robustness enhancement (KDRE) method. Specifically, first a monolingual model (teacher) is trained by ground truth (GT) data. Then to imitate the practical noises, a noise-generating model is designed to inject two types of noise into questions: general noise and translation-aware noise. Finally, the noisy questions are input into the student model. Meanwhile, the student model is jointly trained by GT data and distilled data, which are derived from the teacher when feeding GT questions.FindingsThe experimental results demonstrate that KDRE can improve the performance of models in the cross-lingual scenario. The performance of each module in KBQA model is improved by KDRE. The knowledge distillation (KD) and noise-generating model in the method can complementarily boost the robustness of models.Originality/valueThe authors first extend KBQA models from monolingual to cross-lingual scenario. Also, the authors first implement KD for KBQA to develop robust cross-lingual models.

Download Full-text