Joint Learning of Answer Selection and Answer Summary Generation in Community Question Answering

Yang Deng; Wai Lam; Yuexiang Xie; Daoyuan Chen; Yaliang Li; Min Yang; Ying Shen

doi:10.1609/aaai.v34i05.6266

Joint Learning of Answer Selection and Answer Summary Generation in Community Question Answering

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6266 ◽

2020 ◽

Vol 34 (05) ◽

pp. 7651-7658 ◽

Cited By ~ 2

Author(s):

Yang Deng ◽

Wai Lam ◽

Yuexiang Xie ◽

Daoyuan Chen ◽

Yaliang Li ◽

...

Keyword(s):

Large Scale ◽

Question Answering ◽

State Of The Art ◽

Reading Difficulties ◽

Text Summarization ◽

Essential Information ◽

Joint Learning ◽

Community Question Answering ◽

Proposed Model ◽

Correlation Information

Community question answering (CQA) gains increasing popularity in both academy and industry recently. However, the redundancy and lengthiness issues of crowdsourced answers limit the performance of answer selection and lead to reading difficulties and misunderstandings for community users. To solve these problems, we tackle the tasks of answer selection and answer summary generation in CQA with a novel joint learning model. Specifically, we design a question-driven pointer-generator network, which exploits the correlation information between question-answer pairs to aid in attending the essential information when generating answer summaries. Meanwhile, we leverage the answer summaries to alleviate noise in original lengthy answers when ranking the relevancy degrees of question-answer pairs. In addition, we construct a new large-scale CQA corpus, WikiHowQA, which contains long answers for answer selection as well as reference summaries for answer summarization. The experimental results show that the joint learning method can effectively address the answer redundancy issue in CQA and achieves state-of-the-art results on both answer selection and text summarization tasks. Furthermore, the proposed model is shown to be of great transferring ability and applicability for resource-poor CQA tasks, which lack of reference answer summaries.

Download Full-text

Large-Scale Goodness Polarity Lexicons for Community Question Answering

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '17 ◽

10.1145/3077136.3080757 ◽

2017 ◽

Cited By ~ 1

Author(s):

Todor Mihaylov ◽

Daniel Balchev ◽

Yasen Kiprov ◽

Ivan Koychev ◽

Preslav Nakov

Keyword(s):

Large Scale ◽

Question Answering ◽

Community Question Answering

Download Full-text

Multistep Flow Prediction on Car-Sharing Systems: A Multi-Graph Convolutional Neural Network with Attention Mechanism

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194019400187 ◽

2019 ◽

Vol 29 (11n12) ◽

pp. 1727-1740 ◽

Cited By ~ 3

Author(s):

Hongming Zhu ◽

Yi Luo ◽

Qin Liu ◽

Hongfei Fan ◽

Tianyou Song ◽

...

Keyword(s):

Large Scale ◽

State Of The Art ◽

Spatial Relations ◽

Attention Mechanism ◽

Temporal Relations ◽

Car Sharing ◽

Flow Prediction ◽

Proposed Model ◽

Decoder Architecture ◽

Deep Learning Model

Multistep flow prediction is an essential task for the car-sharing systems. An accurate flow prediction model can help system operators to pre-allocate the cars to meet the demand of users. However, this task is challenging due to the complex spatial and temporal relations among stations. Existing works only considered temporal relations (e.g. using LSTM) or spatial relations (e.g. using CNN) independently. In this paper, we propose an attention to multi-graph convolutional sequence-to-sequence model (AMGC-Seq2Seq), which is a novel deep learning model for multistep flow prediction. The proposed model uses the encoder–decoder architecture, wherein the encoder part, spatial and temporal relations are encoded simultaneously. Then the encoded information is passed to the decoder to generate multistep outputs. In this work, specific multiple graphs are constructed to reflect spatial relations from different aspects, and we model them by using the proposed multi-graph convolution. Attention mechanism is also used to capture the important relations from previous information. Experiments on a large-scale real-world car-sharing dataset demonstrate the effectiveness of our approach over state-of-the-art methods.

Download Full-text

Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018658 ◽

2019 ◽

Vol 33 ◽

pp. 8658-8665 ◽

Cited By ~ 10

Author(s):

Xiangpeng Li ◽

Jingkuan Song ◽

Lianli Gao ◽

Xianglong Liu ◽

Wenbing Huang ◽

...

Keyword(s):

Question Answering ◽

State Of The Art ◽

Computation Time ◽

Comparable Result ◽

Video Encoding ◽

Visual Question Answering ◽

Proposed Model ◽

Ablation Study ◽

The Given ◽

Video Question Answering

Most of the recent progresses on visual question answering are based on recurrent neural networks (RNNs) with attention. Despite the success, these models are often timeconsuming and having difficulties in modeling long range dependencies due to the sequential nature of RNNs. We propose a new architecture, Positional Self-Attention with Coattention (PSAC), which does not require RNNs for video question answering. Specifically, inspired by the success of self-attention in machine translation task, we propose a Positional Self-Attention to calculate the response at each position by attending to all positions within the same sequence, and then add representations of absolute positions. Therefore, PSAC can exploit the global dependencies of question and temporal information in the video, and make the process of question and video encoding executed in parallel. Furthermore, in addition to attending to the video features relevant to the given questions (i.e., video attention), we utilize the co-attention mechanism by simultaneously modeling “what words to listen to” (question attention). To the best of our knowledge, this is the first work of replacing RNNs with selfattention for the task of visual question answering. Experimental results of four tasks on the benchmark dataset show that our model significantly outperforms the state-of-the-art on three tasks and attains comparable result on the Count task. Our model requires less computation time and achieves better performance compared with the RNNs-based methods. Additional ablation study demonstrates the effect of each component of our proposed model.

Download Full-text

Supervised attention for answer selection in community question answering

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v9.i2.pp203-211 ◽

2020 ◽

Vol 9 (2) ◽

pp. 203

Author(s):

Thanh Thi Ha ◽

Atsuhiro Takasu ◽

Thanh Chinh Nguyen ◽

Kiem Hieu Nguyen ◽

Van Nha Nguyen ◽

...

Keyword(s):

Language Processing ◽

Question Answering ◽

Irrelevant Information ◽

Social Question ◽

Community Question Answering ◽

Basic Model ◽

Proposed Model ◽

Questions And Answers ◽

Word Attention ◽

Better Than

Answer selection is an important task in Community Question Answering (CQA). In recent years, attention-based neural networks have been extensively studied in various natural language processing problems, including question answering. This paper explores matchLSTM for answer selection in CQA. A lexical gap in CQA is more challenging as questions and answers typical contain multiple sentences, irrelevant information, and noisy expressions. In our investigation, word-by-word attention in the original model does not work well on social question-answer pairs. We propose integrating supervised attention into matchLSTM. Specifically, we leverage lexical-semantic from external to guide the learning of attention weights for question-answer pairs. The proposed model learns more meaningful attention that allows performing better than the basic model. Our performance is among the top on SemEval datasets.

Download Full-text

Dynamic user modeling for expert recommendation in community question answering

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200729 ◽

2020 ◽

Vol 39 (5) ◽

pp. 7281-7292

Author(s):

Tongze He ◽

Caili Guo ◽

Yunfei Chu ◽

Yang Yang ◽

Yanjun Wang

Keyword(s):

Real World ◽

Large Scale ◽

Question Answering ◽

Dynamic Change ◽

Superior Performance ◽

User Interest ◽

Short Term ◽

Expert Recommendation ◽

Community Question Answering ◽

User Expertise

Community Question Answering (CQA) websites has become an important channel for people to acquire knowledge. In CQA, one key issue is to recommend users with high expertise and willingness to answer the given questions, i.e., expert recommendation. However, a lot of existing methods consider the expert recommendation problem in a static context, ignoring that the real-world CQA websites are dynamic, with users’ interest and expertise changing over time. Although some methods that utilize time information have been proposed, their performance improvement can be limited due to fact that they fail they fail to consider the dynamic change of both user interests and expertise. To solve these problems, we propose a deep learning based framework for expert recommendation to exploit user interest and expertise in a dynamic environment. For user interest, we leverage Long Short-Term Memory (LSTM) to model user’s short-term interest so as to capture the dynamic change of users’ interests. For user expertise, we design user expertise network, which leverages feedback on users’ historical behavior to estimate their expertise on new question. We propose two methods in user expertise network according to whether the dynamic property of expertise is considered. The experimental results on a large-scale dataset from a real-world CQA site demonstrate the superior performance of our method.

Download Full-text

An effective approach for automatic interpretation of Chinese nominal compounds

Information Discovery and Delivery ◽

10.1108/idd-01-2017-0007 ◽

2017 ◽

Vol 45 (2) ◽

pp. 101-106

Author(s):

Weiguang Qu ◽

Rubing Dai ◽

Taizhong Wu ◽

Jian Liu ◽

Junsheng Zhou ◽

...

Keyword(s):

Design Methodology ◽

Large Scale ◽

Question Answering ◽

Crucial Issue ◽

Content Type ◽

Automatic Interpretation ◽

Proposed Model ◽

Hidden Knowledge ◽

Nominal Compounds ◽

Interpretation Model

Purpose Automatic interpretation of Nominal Compounds is a crucial issue for many applications, for example, sentence understanding, machine translation, question-answering system and so forth. Many automatic interpretation models of Nominal Compounds use the strategies based on verbs or rules to obtain the interpretation of compounds. However, the performances of these models are still limited. The purpose of this paper is to propose an effective approach for automatic interpretation of Chinese nominal compounds. Design/methodology/approach The authors propose a top-down and bottom-up model based on rules and large-scale corpus for automatic interpretation of Nominal Compounds. Findings Experimental results demonstrate that the proposed model outperforms the state-of-the-art automatic interpretation model. Originality/value The paper is an up-to-date study of automatic interpretation for Nominal Compounds. It can help people understand the meaning of Nominal Compounds in reading. With a better understanding of Nominal Compounds, we can discover more hidden knowledge in them.

Download Full-text

Facet Annotation by Extending CNN with a Matching Strategy

Neural Computation ◽

10.1162/neco_a_01077 ◽

2018 ◽

Vol 30 (6) ◽

pp. 1647-1672 ◽

Cited By ~ 2

Author(s):

Bei Wu ◽

Bifan Wei ◽

Jun Liu ◽

Zhaotong Guo ◽

Yuanhao Zheng ◽

...

Keyword(s):

Neural Networks ◽

Question Answering ◽

State Of The Art ◽

Data Sets ◽

Text Representation ◽

Real World Data ◽

Fine Grained ◽

Community Question Answering ◽

Matching Strategy ◽

Similarity Matrices

Most community question answering (CQA) websites manage plenty of question-answer pairs (QAPs) through topic-based organizations, which may not satisfy users' fine-grained search demands. Facets of topics serve as a powerful tool to navigate, refine, and group the QAPs. In this work, we propose FACM, a model to annotate QAPs with facets by extending convolution neural networks (CNNs) with a matching strategy. First, phrase information is incorporated into text representation by CNNs with different kernel sizes. Then, through a matching strategy among QAPs and facet label texts (FaLTs) acquired from Wikipedia, we generate similarity matrices to deal with the facet heterogeneity. Finally, a three-channel CNN is trained for facet label assignment of QAPs. Experiments on three real-world data sets show that FACM outperforms the state-of-the-art methods.

Download Full-text

Neural Abstractive Summarization with Structural Attention

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/514 ◽

2020 ◽

Author(s):

Tanya Chowdhury ◽

Sachin Kumar ◽

Tanmoy Chakraborty

Keyword(s):

Question Answering ◽

Popular Opinion ◽

Document Summarization ◽

Community Question Answering ◽

Proposed Model ◽

Abstractive Summarization

Attentional, RNN-based encoder-decoder architectures have obtained impressive performance on abstractive summarization of news articles. However, these methods fail to account for long term dependencies within the sentences of a document. This problem is exacerbated in multi-document summarization tasks such as summarizing the popular opinion in threads present in community question answering (CQA) websites such as Yahoo! Answers and Quora. These threads contain answers which often overlap or contradict each other. In this work, we present a hierarchical encoder based on structural attention to model such inter-sentence and inter-document dependencies. We set the popular pointer-generator architecture and some of the architectures derived from it as our baselines and show that they fail to generate good summaries in a multi-document setting. We further illustrate that our proposed model achieves significant improvement over the baseline in both single and multi-document summarization settings -- in the former setting, it beats the baseline by 1.31 and 7.8 ROUGE-1 points on CNN and CQA datasets, respectively; in the latter setting, the performance is further improved by 1.6 ROUGE-1 points on the CQA dataset.

Download Full-text

Dual-View Variational Autoencoders for Semi-Supervised Text Matching

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/737 ◽

2019 ◽

Author(s):

Zhongbin Xie ◽

Shuai Ma

Keyword(s):

Question Answering ◽

Fundamental Problem ◽

Sentence Pair ◽

Community Question Answering ◽

Word Level ◽

Sentence Level ◽

Proposed Model ◽

Variational Autoencoder ◽

Matching Models ◽

Text Matching

Semantically matching two text sequences (usually two sentences) is a fundamental problem in NLP. Most previous methods either encode each of the two sentences into a vector representation (sentence-level embedding) or leverage word-level interaction features between the two sentences. In this study, we propose to take the sentence-level embedding features and the word-level interaction features as two distinct views of a sentence pair, and unify them with a framework of Variational Autoencoders such that the sentence pair is matched in a semi-supervised manner. The proposed model is referred to as Dual-View Variational AutoEncoder (DV-VAE), where the optimization of the variational lower bound can be interpreted as an implicit Co-Training mechanism for two matching models over distinct views. Experiments on SNLI, Quora and a Community Question Answering dataset demonstrate the superiority of our DV-VAE over several strong semi-supervised and supervised text matching models.

Download Full-text

Distant Supervision for Relation Extraction with Sentence Selection and Interaction Representation

Wireless Communications and Mobile Computing ◽

10.1155/2021/8889075 ◽

2021 ◽

Vol 2021 ◽

pp. 1-16

Author(s):

Tiantian Chen ◽

Nianbin Wang ◽

Hongbin Wang ◽

Haomin Zhan

Keyword(s):

Large Scale ◽

Semantic Information ◽

State Of The Art ◽

Relation Extraction ◽

Semantic Features ◽

Distant Supervision ◽

Word Level ◽

Proposed Model ◽

Relation Prediction ◽

Better Than

Distant supervision (DS) has been widely used for relation extraction (RE), which automatically generates large-scale labeled data. However, there is a wrong labeling problem, which affects the performance of RE. Besides, the existing method suffers from the lack of useful semantic features for some positive training instances. To address the above problems, we propose a novel RE model with sentence selection and interaction representation for distantly supervised RE. First, we propose a pattern method based on the relation trigger words as a sentence selector to filter out noisy sentences to alleviate the wrong labeling problem. After clean instances are obtained, we propose the interaction representation using the word-level attention mechanism-based entity pairs to dynamically increase the weights of the words related to entity pairs, which can provide more useful semantic information for relation prediction. The proposed model outperforms the strongest baseline by 2.61 in F1-score on a widely used dataset, which proves that our model performs significantly better than the state-of-the-art RE systems.

Download Full-text