Context Based Quantum Language Model with Application to Question Answering

In this paper, we propose a novel data augmentation method with respect to the target context of the data via self-supervised learning. Instead of looking for the exact synonyms of masked words, the proposed method finds words that can replace the original words considering the context. For self-supervised learning, we can employ the masked language model (MLM), which masks a specific word within a sentence and obtains the original word. The MLM learns the context of a sentence through asymmetrical inputs and outputs. However, without using the existing MLM, we propose a label-masked language model (LMLM) that can include label information for the mask tokens used in the MLM to effectively use the MLM in data with label information. The augmentation method performs self-supervised learning using LMLM and then implements data augmentation through the trained model. We demonstrate that our proposed method improves the classification accuracy of recurrent neural networks and convolutional neural network-based classifiers through several experiments for text classification benchmark datasets, including the Stanford Sentiment Treebank-5 (SST5), the Stanford Sentiment Treebank-2 (SST2), the subjectivity (Subj), the Multi-Perspective Question Answering (MPQA), the Movie Reviews (MR), and the Text Retrieval Conference (TREC) datasets. In addition, since the proposed method does not use external data, it can eliminate the time spent collecting external data, or pre-training using external data.

Download Full-text

Building interactive sentence-aware representation based on generative language model for community question answering

Neurocomputing ◽

10.1016/j.neucom.2019.12.107 ◽

2020 ◽

Vol 389 ◽

pp. 93-107

Author(s):

Jinmeng Wu ◽

Tingting Mu ◽

Jeyarajan Thiyagalingam ◽

John Y. Goulermas

Keyword(s):

Question Answering ◽

Language Model ◽

Community Question Answering ◽

Generative Language

Download Full-text

A Hybrid Neural Network BERT-Cap Based on Pre-Trained Language Model and Capsule Network for User Intent Classification

Complexity ◽

10.1155/2020/8858852 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Hai Liu ◽

Yuanxia Liu ◽

Leung-Pun Wong ◽

Lap-Kei Lee ◽

Tianyong Hao

Keyword(s):

Neural Network ◽

Neural Network Model ◽

Question Answering ◽

Semantic Information ◽

Language Model ◽

Dialogue System ◽

Hybrid Neural Network ◽

Question Answering System ◽

User Intent ◽

Vital Component

User intent classification is a vital component of a question-answering system or a task-based dialogue system. In order to understand the goals of users’ questions or discourses, the system categorizes user text into a set of pre-defined user intent categories. User questions or discourses are usually short in length and lack sufficient context; thus, it is difficult to extract deep semantic information from these types of text and the accuracy of user intent classification may be affected. To better identify user intents, this paper proposes a BERT-Cap hybrid neural network model with focal loss for user intent classification to capture user intents in dialogue. The model uses multiple transformer encoder blocks to encode user utterances and initializes encoder parameters with a pre-trained BERT. Then, it extracts essential features using a capsule network with dynamic routing after utterances encoding. Experiment results on four publicly available datasets show that our model BERT-Cap achieves a F1 score of 0.967 and an accuracy of 0.967, outperforming a number of baseline methods, indicating its effectiveness in user intent classification.

Download Full-text

A Quantum Expectation Value Based Language Model with Application to Question Answering

Entropy ◽

10.3390/e22050533 ◽

2020 ◽

Vol 22 (5) ◽

pp. 533

Author(s):

Qin Zhao ◽

Chenguang Hou ◽

Changjian Liu ◽

Peng Zhang ◽

Ruifeng Xu

Keyword(s):

Hilbert Space ◽

Density Matrix ◽

Question Answering ◽

Language Model ◽

Language Models ◽

Quantum Model ◽

Expectation Value ◽

Proposed Model ◽

Matching Score ◽

The Relationship

Quantum-inspired language models have been introduced to Information Retrieval due to their transparency and interpretability. While exciting progresses have been made, current studies mainly investigate the relationship between density matrices of difference sentence subspaces of a semantic Hilbert space. The Hilbert space as a whole which has a unique density matrix is lack of exploration. In this paper, we propose a novel Quantum Expectation Value based Language Model (QEV-LM). A unique shared density matrix is constructed for the Semantic Hilbert Space. Words and sentences are viewed as different observables in this quantum model. Under this background, a matching score describing the similarity between a question-answer pair is naturally explained as the quantum expectation value of a joint question-answer observable. In addition to the theoretical soundness, experiment results on the TREC-QA and WIKIQA datasets demonstrate the computational efficiency of our proposed model with excellent performance and low time consumption.

Download Full-text

Question Answering over Knowledge Base using Language Model Embeddings

2020 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn48605.2020.9206698 ◽

2020 ◽

Author(s):

Japa Sai Sharath ◽

Rekabdar Banafsheh

Keyword(s):

Knowledge Base ◽

Question Answering ◽

Language Model

Download Full-text

Pre-trained Language Model for Biomedical Question Answering

Machine Learning and Knowledge Discovery in Databases - Communications in Computer and Information Science ◽

10.1007/978-3-030-43887-6_64 ◽

2020 ◽

pp. 727-740 ◽

Cited By ~ 1

Author(s):

Wonjin Yoon ◽

Jinhyuk Lee ◽

Donghyeon Kim ◽

Minbyul Jeong ◽

Jaewoo Kang

Keyword(s):

Question Answering ◽

Language Model

Download Full-text

Question Difficulty Estimation Based on Attention Model for Question Answering

Applied Sciences ◽

10.3390/app112412023 ◽

2021 ◽

Vol 11 (24) ◽

pp. 12023

Author(s):

Hyun-Je Song ◽

Su-Hwan Yoon ◽

Seong-Bae Park

Keyword(s):

Question Answering ◽

Language Model ◽

Difficulty Level ◽

Data Sets ◽

Simple Relationship ◽

Attention Model ◽

Proposed Model ◽

Key Factor ◽

Question Difficulty ◽

Information Components

This paper addresses a question difficulty estimation of which goal is to estimate the difficulty level of a given question in question-answering (QA) tasks. Since a question in the tasks is composed of a questionary sentence and a set of information components such as a description and candidate answers, it is important to model the relationship among the information components to estimate the difficulty level of the question. However, existing approaches to this task modeled a simple relationship such as a relationship between a questionary sentence and a description, but such simple relationships are insufficient to predict the difficulty level accurately. Therefore, this paper proposes an attention-based model to consider the complicated relationship among the information components. The proposed model first represents bi-directional relationships between a questionary sentence and each information component using a dual multi-head co-attention, since the questionary sentence is a key factor in the QA questions and it affects and is affected by information components. Then, the proposed model considers inter-information relationship over the bi-directional representations through a self-attention model. The inter-information relationship helps predict the difficulty of the questions accurately which require reasoning over multiple kinds of information components. The experimental results from three well-known and real-world QA data sets prove that the proposed model outperforms the previous state-of-the-art and pre-trained language model baselines. It is also shown that the proposed model is robust against the increase of the number of information components.

Download Full-text