Developing MCQA Framework for Basic Science Subjects using Distributed Similarity Model and Classification Based Approaches

In this paper, we proposed a novel approach to improve the performance of multiple choice question answering (MCQA) system using distributed semantic similarity and classification approach. We mainly focus on science-based MCQ which is really difficult to handle. Our proposed method is based on the hypothesis that the relation between question and answer of that question will be high in distributional semantic model rather than other options of that question. We are using IJCNLP shared Task 5 and SciQ dataset for our experiments. We have built three Models (i.e., Model 1, Model 2, Model 3) based on the dataset format. The basic difference between IJCNLP Task 5 and SciQ datasets is that SciQ dataset contains supporting text with questions whereas IJCNLP Task 5 dataset does not contain supporting text. Model 1 and Model 2 are mainly built to deal with IJCNLP Task 5 dataset whereas Model 3 is mainly built for SciQ dataset. Model 2 is mainly built to deal with the dependencies between options (i.e., all of these, two of them, none of them) whereas Model 1 is the basic model for MCQA and it cannot capture the dependencies between options. We also compare the result of SciQ dataset with supporting text (i.e., using Model 3) and without supporting text (i.e., using Model 1). We also compared our system with other existing methods. Though in some cases the performance of our proposed method is not satisfactory, we have noted that our submission is simple and robust that allows it to be more easily integrated into complex applications. This work investigates different techniques for choosing the correct answer of a given question in MCQA system. These experiments may therefore be useful to improve the performance of current science-based question answering (QA) systems. For IJCNLP Task 5 dataset, we achieved 44.5% using Model 2 and PubMed Dataset. Similarly for SciQ dataset we achieved 82.25% using Model 3 and PubMed dataset.

Download Full-text

An Open Domain Question Answering System Based on Improved System Similarity Model

2006 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2006.259170 ◽

2006 ◽

Cited By ~ 4

Author(s):

Yu-ming Zhao ◽

Zhi-ming Xu ◽

Yi Guan ◽

Xiao-long Wang

Keyword(s):

Question Answering ◽

Open Domain ◽

Question Answering System ◽

Similarity Model

Download Full-text

Curriculum Learning for Natural Answer Generation

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/587 ◽

2018 ◽

Cited By ~ 11

Author(s):

Cao Liu ◽

Shizhu He ◽

Kang Liu ◽

Jun Zhao

Keyword(s):

Full Advantage ◽

Real World ◽

Large Scale ◽

Question Answering ◽

Generative Models ◽

The Internet ◽

Basic Model ◽

The Arts ◽

Learning Data

By reason of being able to obtain natural language responses, natural answers are more favored in real-world Question Answering (QA) systems. Generative models learn to automatically generate natural answers from large-scale question answer pairs (QA-pairs). However, they are suffering from the uncontrollable and uneven quality of QA-pairs crawled from the Internet. To address this problem, we propose a curriculum learning based framework for natural answer generation (CL-NAG), which is able to take full advantage of the valuable learning data from a noisy and uneven-quality corpus. Specifically, we employ two practical measures to automatically measure the quality (complexity) of QA-pairs. Based on the measurements, CL-NAG firstly utilizes simple and low-quality QA-pairs to learn a basic model, and then gradually learns to produce better answers with richer contents and more complete syntaxes based on more complex and higher-quality QA-pairs. In this way, all valuable information in the noisy and uneven-quality corpus could be fully exploited. Experiments demonstrate that CL-NAG outperforms the state-of-the-arts, which increases 6.8% and 8.7% in the accuracy for simple and complex questions, respectively.

Download Full-text

Computational construction grammar for visual question answering

Linguistics Vanguard ◽

10.1515/lingvan-2018-0070 ◽

2019 ◽

Vol 5 (1) ◽

Author(s):

Jens Nevens ◽

Paul Van Eecke ◽

Katrien Beuls

Keyword(s):

Natural Language ◽

Question Answering ◽

Semantic Representation ◽

Construction Grammar ◽

Training Data ◽

Knowledge Sources ◽

Visual Question Answering ◽

Novel Approach ◽

Natural Language Question ◽

Grammar Model

AbstractIn order to be able to answer a natural language question, a computational system needs three main capabilities. First, the system needs to be able to analyze the question into a structured query, revealing its component parts and how these are combined. Second, it needs to have access to relevant knowledge sources, such as databases, texts or images. Third, it needs to be able to execute the query on these knowledge sources. This paper focuses on the first capability, presenting a novel approach to semantically parsing questions expressed in natural language. The method makes use of a computational construction grammar model for mapping questions onto their executable semantic representations. We demonstrate and evaluate the methodology on the CLEVR visual question answering benchmark task. Our system achieves a 100% accuracy, effectively solving the language understanding part of the benchmark task. Additionally, we demonstrate how this solution can be embedded in a full visual question answering system, in which a question is answered by executing its semantic representation on an image. The main advantages of the approach include (i) its transparent and interpretable properties, (ii) its extensibility, and (iii) the fact that the method does not rely on any annotated training data.

Download Full-text

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5962 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5182-5190

Author(s):

Pasquale Minervini ◽

Matko Bošnjak ◽

Tim Rocktäschel ◽

Sebastian Riedel ◽

Edward Grefenstette

Keyword(s):

Natural Language ◽

Link Prediction ◽

Question Answering ◽

Knowledge Bases ◽

Small Scale ◽

Reasoning Systems ◽

Novel Approach ◽

Real World Datasets ◽

Interpretable Models ◽

Machine Reading

Reasoning with knowledge expressed in natural language and Knowledge Bases (KBs) is a major challenge for Artificial Intelligence, with applications in machine reading, dialogue, and question answering. General neural architectures that jointly learn representations and transformations of text are very data-inefficient, and it is hard to analyse their reasoning process. These issues are addressed by end-to-end differentiable reasoning systems such as Neural Theorem Provers (NTPs), although they can only be used with small-scale symbolic KBs. In this paper we first propose Greedy NTPs (GNTPs), an extension to NTPs addressing their complexity and scalability limitations, thus making them applicable to real-world datasets. This result is achieved by dynamically constructing the computation graph of NTPs and including only the most promising proof paths during inference, thus obtaining orders of magnitude more efficient models 1. Then, we propose a novel approach for jointly reasoning over KBs and textual mentions, by embedding logic facts and natural language sentences in a shared embedding space. We show that GNTPs perform on par with NTPs at a fraction of their cost while achieving competitive link prediction results on large datasets, providing explanations for predictions, and inducing interpretable models.

Download Full-text

Arabic community question answering

Natural Language Engineering ◽

10.1017/s1351324918000426 ◽

2018 ◽

Vol 25 (1) ◽

pp. 5-41

Author(s):

PRESLAV NAKOV ◽

LLUÍS MÀRQUEZ ◽

ALESSANDRO MOSCHITTI ◽

HAMDY MUBARAK

Keyword(s):

Question Answering ◽

International Workshop ◽

Lessons Learned ◽

Future Research ◽

Learning Approaches ◽

Shared Task ◽

Community Question Answering ◽

Syntactic Information ◽

Corpus Creation ◽

Difficult Cases

AbstractWe analyze resources and models for Arabic community Question Answering (cQA). In particular, we focus on CQA-MD, our cQA corpus for Arabic in the domain of medical forums. We describe the corpus and the main challenges it poses due to its mix of informal and formal language, and of different Arabic dialects, as well as due to its medical nature. We further present a shared task on cQA at SemEval, the International Workshop on Semantic Evaluation, based on this corpus. We discuss the features and the machine learning approaches used by the teams who participated in the task, with focus on the models that exploit syntactic information using convolutional tree kernels and neural word embeddings. We further analyze and extend the outcome of the SemEval challenge by training a meta-classifier combining the output of several systems. This allows us to compare different features and different learning algorithms in an indirect way. Finally, we analyze the most frequent errors common to all approaches, categorizing them into prototypical cases, and zooming into the way syntactic information in tree kernel approaches can help solve some of the most difficult cases. We believe that our analysis and the lessons learned from the process of corpus creation as well as from the shared task analysis will be helpful for future research on Arabic cQA.

Download Full-text

HITSZ-ICRC: Exploiting Classification Approach for Answer Selection in Community Question Answering

10.18653/v1/s15-2035 ◽

2015 ◽

Cited By ~ 6

Author(s):

Yongshuai Hou ◽

Cong Tan ◽

Xiaolong Wang ◽

Yaoyun Zhang ◽

Jun Xu ◽

...

Keyword(s):

Question Answering ◽

Classification Approach ◽

Community Question Answering

Download Full-text

A Novel Approach on Visual Question Answering by Parameter Prediction using Faster Region Based Convolutional Neural Network

International Journal of Interactive Multimedia and Artificial Intelligence ◽

10.9781/ijimai.2018.08.004 ◽

2019 ◽

Vol 5 (5) ◽

pp. 30 ◽

Cited By ~ 3

Author(s):

Sudan Jha ◽

Anirban Dey ◽

Raghvendra Kumar ◽

Vijender Kumar-Solanki

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Question Answering ◽

Visual Question Answering ◽

Novel Approach ◽

Parameter Prediction

Download Full-text

Supervised attention for answer selection in community question answering

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v9.i2.pp203-211 ◽

2020 ◽

Vol 9 (2) ◽

pp. 203

Author(s):

Thanh Thi Ha ◽

Atsuhiro Takasu ◽

Thanh Chinh Nguyen ◽

Kiem Hieu Nguyen ◽

Van Nha Nguyen ◽

...

Keyword(s):

Language Processing ◽

Question Answering ◽

Irrelevant Information ◽

Social Question ◽

Community Question Answering ◽

Basic Model ◽

Proposed Model ◽

Questions And Answers ◽

Word Attention ◽

Better Than

Answer selection is an important task in Community Question Answering (CQA). In recent years, attention-based neural networks have been extensively studied in various natural language processing problems, including question answering. This paper explores matchLSTM for answer selection in CQA. A lexical gap in CQA is more challenging as questions and answers typical contain multiple sentences, irrelevant information, and noisy expressions. In our investigation, word-by-word attention in the original model does not work well on social question-answer pairs. We propose integrating supervised attention into matchLSTM. Specifically, we leverage lexical-semantic from external to guide the learning of attention weights for question-answer pairs. The proposed model learns more meaningful attention that allows performing better than the basic model. Our performance is among the top on SemEval datasets.

Download Full-text

Prediction of the Next Question for the Question Answering System

International Journal of Informatics and Communication Technology (IJ-ICT) ◽

10.11591/ijict.v5i2.pp51-59 ◽

2016 ◽

Vol 5 (2) ◽

pp. 51

Author(s):

Manvi Breja

Keyword(s):

Search Engine ◽

Association Rule ◽

Association Rule Mining ◽

Question Answering ◽

User Profile ◽

User Profiling ◽

Rule Mining ◽

Question Answering System ◽

Novel Approach ◽

Initial Question

User profiling, one of the main issue faced while implementing the efficient question answering system, in which the user profile is made, containing the data posed by the user, capturing their domain of interest. The paper presents the method of predicting the next related questions to the first initial question provided by the user to the question answering search engine. A novel approach of the association rule mining is highlighted in which the information is extracted from the log of the previously submitted questions to the question answering search engine, using algorithms for mining association rules and predicts the set of next questions that the user will provide to the system in the next session. Using this approach, the question answering system keeps the relevant answers of the next questions in the repository for providing a speedy response to the user and thus increasing the efficiency of the system.

Download Full-text

A Composite Natural Language Processing and Information Retrieval Approach to Question Answering Using a Structured Knowledge Base

International Journal of Semantic Computing ◽

10.1142/s1793351x17400141 ◽

2017 ◽

Vol 11 (03) ◽

pp. 345-371

Author(s):

Avani Chandurkar ◽

Ajay Bansal

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Natural Language ◽

Knowledge Base ◽

Language Processing ◽

Question Answering ◽

Automated System ◽

Free Form ◽

Question Answering System ◽

Novel Approach

With the inception of the World Wide Web, the amount of data present on the Internet is tremendous. This makes the task of navigating through this enormous amount of data quite difficult for the user. As users struggle to navigate through this wealth of information, the need for the development of an automated system that can extract the required information becomes urgent. This paper presents a Question Answering system to ease the process of information retrieval. Question Answering systems have been around for quite some time and are a sub-field of information retrieval and natural language processing. The task of any Question Answering system is to seek an answer to a free form factual question. The difficulty of pinpointing and verifying the precise answer makes question answering more challenging than simple information retrieval done by search engines. The research objective of this paper is to develop a novel approach to Question Answering based on a composition of conventional approaches of Information Retrieval (IR) and Natural Language processing (NLP). The focus is on using a structured and annotated knowledge base instead of an unstructured one. The knowledge base used here is DBpedia and the final system is evaluated on the Text REtrieval Conference (TREC) 2004 questions dataset.

Download Full-text