MKPM: Multi keyword-pair matching for natural language sentences

Applied Intelligence ◽

10.1007/s10489-021-02306-5 ◽

2021 ◽

Author(s):

Xin Lu ◽

Yao Deng ◽

Ting Sun ◽

Yi Gao ◽

Jun Feng ◽

...

Keyword(s):

Natural Language ◽

Word Pair ◽

Question Answering ◽

Semantic Relationship ◽

Sentence Pair ◽

Word Level ◽

Sentence Level ◽

Pair Matching ◽

Task Architecture ◽

Sentence Matching

AbstractSentence matching is widely used in various natural language tasks, such as natural language inference, paraphrase identification and question answering. For these tasks, we need to understand the logical and semantic relationship between two sentences. Most current methods use all information within a sentence to build a model and hence determine its relationship to another sentence. However, the information contained in some sentences may cause redundancy or introduce noise, impeding the performance of the model. Therefore, we propose a sentence matching method based on multi keyword-pair matching (MKPM), which uses keyword pairs in two sentences to represent the semantic relationship between them, avoiding the interference of redundancy and noise. Specifically, we first propose a sentence-pair-based attention mechanism sp-attention to select the most important word pair from the two sentences as a keyword pair, and then propose a Bi-task architecture to model the semantic information of these keyword pairs. The Bi-task architecture is as follows: 1. In order to understand the semantic relationship at the word level between two sentences, we design a word-pair task (WP-Task), which uses these keyword pairs to complete sentence matching independently. 2. We design a sentence-pair task (SP-Task) to understand the sentence level semantic relationship between the two sentences by sentence denoising. Through the integration of the two tasks, our model can understand sentences more accurately from the two granularities of word and sentence. Experimental results show that our model can achieve state-of-the-art performance in several tasks. Our source code is publicly available1.

Download Full-text

Dual-View Variational Autoencoders for Semi-Supervised Text Matching

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/737 ◽

2019 ◽

Author(s):

Zhongbin Xie ◽

Shuai Ma

Keyword(s):

Question Answering ◽

Fundamental Problem ◽

Sentence Pair ◽

Community Question Answering ◽

Word Level ◽

Sentence Level ◽

Proposed Model ◽

Variational Autoencoder ◽

Matching Models ◽

Text Matching

Semantically matching two text sequences (usually two sentences) is a fundamental problem in NLP. Most previous methods either encode each of the two sentences into a vector representation (sentence-level embedding) or leverage word-level interaction features between the two sentences. In this study, we propose to take the sentence-level embedding features and the word-level interaction features as two distinct views of a sentence pair, and unify them with a framework of Variational Autoencoders such that the sentence pair is matched in a semi-supervised manner. The proposed model is referred to as Dual-View Variational AutoEncoder (DV-VAE), where the optimization of the variational lower bound can be interpreted as an implicit Co-Training mechanism for two matching models over distinct views. Experiments on SNLI, Quora and a Community Question Answering dataset demonstrate the superiority of our DV-VAE over several strong semi-supervised and supervised text matching models.

Download Full-text

Semantic Sentence Matching with Densely-Connected Recurrent and Co-Attentive Information

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016586 ◽

2019 ◽

Vol 33 ◽

pp. 6586-6593 ◽

Cited By ~ 20

Author(s):

Seonhoon Kim ◽

Inho Kang ◽

Nojun Kwak

Keyword(s):

Neural Network ◽

Natural Language ◽

Question Answering ◽

State Of The Art ◽

Attention Mechanism ◽

Semantic Relationship ◽

Convolutional Network ◽

Benchmark Datasets ◽

Feature Information ◽

Sentence Matching

Sentence matching is widely used in various natural language tasks such as natural language inference, paraphrase identification, and question answering. For these tasks, understanding logical and semantic relationship between two sentences is required but it is yet challenging. Although attention mechanism is useful to capture the semantic relationship and to properly align the elements of two sentences, previous methods of attention mechanism simply use a summation operation which does not retain original features enough. Inspired by DenseNet, a densely connected convolutional network, we propose a densely-connected co-attentive recurrent neural network, each layer of which uses concatenated information of attentive features as well as hidden features of all the preceding recurrent layers. It enables preserving the original and the co-attentive feature information from the bottommost word embedding layer to the uppermost recurrent layer. To alleviate the problem of an ever-increasing size of feature vectors due to dense concatenation operations, we also propose to use an autoencoder after dense concatenation. We evaluate our proposed architecture on highly competitive benchmark datasets related to sentence matching. Experimental results show that our architecture, which retains recurrent and attentive features, achieves state-of-the-art performances for most of the tasks.

Download Full-text

Use of a lexical knowledge base for information access systems

Terminology ◽

10.1075/term.5.2.08mag ◽

1998 ◽

Vol 5 (2) ◽

pp. 203-228 ◽

Cited By ~ 1

Author(s):

Bernardo Magnini

Keyword(s):

Natural Language ◽

Semantic Representation ◽

Information Access ◽

Dialogue Systems ◽

Complex Sentence ◽

Lexical Resources ◽

Lexical Knowledge ◽

Word Level ◽

Sentence Level

The role of generic lexical resources as well as specialized terminology is crucial in the design of complex dialogue systems, where a human interacts with the computer using Natural Language. Lexicon and terminology are supposed to store information for several purposes, including the discrimination of semantic-ally inconsistent interpretations, the use of lexical variations, the compositional construction of a semantic representation for a complex sentence and the ability to access equivalencies across different languages. For these purposes it is necessary to rely on representational tools that are both theoretically motivated and operationally well defined. In this paper we propose a solution to lexical and terminology representation which is based on the combination of a linguistically motivated upper model and a multilingual WordNet. The upper model accounts for the linguistic analysis at the sentence level, while the multilingual WordNet accounts for lexical and conceptual relations at the word level.

Download Full-text

Big Data Management and Analytics in Scientific Programming: A Deep Learning-Based Method for Aspect Category Classification of Question-Answering-Style Reviews

Scientific Programming ◽

10.1155/2020/4690974 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10 ◽

Cited By ~ 1

Author(s):

Hanqian Wu ◽

Mumu Liu ◽

Shangbin Zhang ◽

Zhike Wang ◽

Siliang Cheng

Keyword(s):

Deep Learning ◽

Language Processing ◽

Question Answering ◽

Product Information ◽

Empirical Studies ◽

Product Reviews ◽

Related Information ◽

Basic Task ◽

Word Level ◽

Sentence Level

Online product reviews are exploring on e-commerce platforms, and mining aspect-level product information contained in those reviews has great economic benefit. The aspect category classification task is a basic task for aspect-level sentiment analysis which has become a hot research topic in the natural language processing (NLP) field during the last decades. In various e-commerce platforms, there emerge various user-generated question-answering (QA) reviews which generally contain much aspect-related information of products. Although some researchers have devoted their efforts on the aspect category classification for traditional product reviews, the existing deep learning-based approaches cannot be well applied to represent the QA-style reviews. Thus, we propose a 4-dimension (4D) textual representation model based on QA interaction-level and hyperinteraction-level by modeling with different levels of the text representation, i.e., word-level, sentence-level, QA interaction-level, and hyperinteraction-level. In our experiments, the empirical studies on datasets from three domains demonstrate that our proposals perform better than traditional sentence-level representation approaches, especially in the Digit domain.

Download Full-text

Looking Beyond Sentence-Level Natural Language Inference for Question Answering and Text Summarization

10.18653/v1/2021.naacl-main.104 ◽

2021 ◽

Author(s):

Anshuman Mishra ◽

Dhruvesh Patel ◽

Aparna Vijayakumar ◽

Xiang Lorraine Li ◽

Pavan Kapanipathi ◽

...

Keyword(s):

Natural Language ◽

Question Answering ◽

Text Summarization ◽

Sentence Level

Download Full-text

Building a Discourse-Argument Hybrid System for Vietnamese Why-Question Answering

Computational Intelligence and Neuroscience ◽

10.1155/2021/6550871 ◽

2021 ◽

Vol 2021 ◽

pp. 1-16

Author(s):

Chinh Trong Nguyen ◽

Dang Tuan Nguyen

Keyword(s):

Deep Learning ◽

Discourse Analysis ◽

Natural Language ◽

Question Answering ◽

Learning Model ◽

Test Results ◽

Sentence Level ◽

Final Answer ◽

Why Questions ◽

Deep Learning Model

Recently, many deep learning models have archived high results in question answering task with overall F1 scores above 0.88 on SQuAD datasets. However, many of these models have quite low F1 scores on why-questions. These F1 scores range from 0.57 to 0.7 on SQuAD v1.1 development set. This means these models are more appropriate to the extraction of answers for factoid questions than for why-questions. Why-questions are asked when explanations are needed. These explanations are possibly arguments or simply subjective opinions. Therefore, we propose an approach to finding the answer for why-question using discourse analysis and natural language inference. In our approach, natural language inference is applied to identify implicit arguments at sentence level. It is also applied in sentence similarity calculation. Discourse analysis is applied to identify the explicit arguments and the opinions at sentence level in documents. The results from these two methods are the answer candidates to be selected as the final answer for each why-question. We also implement a system with our approach. Our system can provide an answer for a why-question and a document as in reading comprehension test. We test our system with a Vietnamese translated test set which contains all why-questions of SQuAD v1.1 development set. The test results show that our system cannot beat a deep learning model in F1 score; however, our system can answer more questions (answer rate of 77.0%) than the deep learning model (answer rate of 61.0%).

Download Full-text

Natural language understanding and the perspectives of question answering

10.3115/991813.991871 ◽

1982 ◽

Author(s):

Petr Sgall

Keyword(s):

Natural Language ◽

Question Answering ◽

Natural Language Understanding ◽

Language Understanding

Download Full-text

Comparative Question Answering System based on Natural Language Processing and Machine Learning

2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS) ◽

10.1109/icais50930.2021.9396015 ◽

2021 ◽

Author(s):

Rohit Arora ◽

Parth Singh ◽

Hemlata Goyal ◽

Sunita Singhal ◽

Smita Vijayvargiya

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Question Answering ◽

Question Answering System

Download Full-text

Deaf and Hard-of-hearing Users Evaluating Designs for Highlighting Key Words in Educational Lecture Videos

ACM Transactions on Accessible Computing ◽

10.1145/3470651 ◽

2021 ◽

Vol 14 (4) ◽

pp. 1-24

Author(s):

Sushant Kafle ◽

Becca Dingman ◽

Matt Huenerfauth

Keyword(s):

Experimental Study ◽

Key Words ◽

Hard Of Hearing ◽

Experimental Studies ◽

Design Parameters ◽

Educational Video ◽

Word Level ◽

Sentence Level ◽

First Occurrence ◽

Educational Videos

There are style guidelines for authors who highlight important words in static text, e.g., bolded words in student textbooks, yet little research has investigated highlighting in dynamic texts, e.g., captions during educational videos for Deaf or Hard of Hearing (DHH) users. In our experimental study, DHH participants subjectively compared design parameters for caption highlighting, including: decoration (underlining vs. italicizing vs. boldfacing), granularity (sentence level vs. word level), and whether to highlight only the first occurrence of a repeating keyword. In partial contrast to recommendations in prior research, which had not been based on experimental studies with DHH users, we found that DHH participants preferred boldface, word-level highlighting in captions. Our empirical results provide guidance for the design of keyword highlighting during captioned videos for DHH users, especially in educational video genres.

Download Full-text

Composing Questions through Conceptual Authoring

Computational Linguistics ◽

10.1162/coli.2007.33.1.105 ◽

2007 ◽

Vol 33 (1) ◽

pp. 105-133 ◽

Cited By ~ 26

Author(s):

Catalina Hallett ◽

Donia Scott ◽

Richard Power

Keyword(s):

Natural Language ◽

Question Answering ◽

Free Text ◽

Risk Averse ◽

Proof Of Concept ◽

Concept System ◽

Complex Queries ◽

Extensive Training ◽

Question Answering Systems ◽

Medical Histories

This article describes a method for composing fluent and complex natural language questions, while avoiding the standard pitfalls of free text queries. The method, based on Conceptual Authoring, is targeted at question-answering systems where reliability and transparency are critical, and where users cannot be expected to undergo extensive training in question composition. This scenario is found in most corporate domains, especially in applications that are risk-averse. We present a proof-of-concept system we have developed: a question-answering interface to a large repository of medical histories in the area of cancer. We show that the method allows users to successfully and reliably compose complex queries with minimal training.

Download Full-text