Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering

Sentence retrieval is an information retrieval technique that aims to find sentences corresponding to an information need. It is used for tasks like question answering (QA) or novelty detection. Since it is similar to document retrieval but with a smaller unit of retrieval, methods for document retrieval are also used for sentence retrieval like term frequency—inverse document frequency (TF-IDF), BM 25 , and language modeling-based methods. The effect of partial matching of words to sentence retrieval is an issue that has not been analyzed. We think that there is a substantial potential for the improvement of sentence retrieval methods if we consider this approach. We adapted TF-ISF, BM 25 , and language modeling-based methods to test the partial matching of terms through combining sentence retrieval with sequence similarity, which allows matching of words that are similar but not identical. All tests were conducted using data from the novelty tracks of the Text Retrieval Conference (TREC). The scope of this paper was to find out if such approach is generally beneficial to sentence retrieval. However, we did not examine in depth how partial matching helps or hinders the finding of relevant sentences.

Download Full-text

Semantic Search System using Word Embeddings for query expansion

2019 IEEE PES Innovative Smart Grid Technologies Conference - Latin America (ISGT Latin America) ◽

10.1109/isgt-la.2019.8894992 ◽

2019 ◽

Author(s):

Miguel A. Silva-Fuentes ◽

Hugo D. Calderon-Vilca ◽

Edwin F. Calderon-Vilca ◽

Flor C. Cardenas-Marino

Keyword(s):

Query Expansion ◽

Semantic Search ◽

Word Embeddings ◽

Search System

Download Full-text

The Research on Query Expansion for Chinese Question Answering System

Fuzzy Systems and Knowledge Discovery - Lecture Notes in Computer Science ◽

10.1007/11539506_71 ◽

2005 ◽

pp. 571-579 ◽

Cited By ~ 1

Author(s):

Zhengtao Yu ◽

Xiaozhong Fan ◽

Lirong Song ◽

Jianyi Guo

Keyword(s):

Query Expansion ◽

Question Answering ◽

Question Answering System

Download Full-text

Enhancing Question Retrieval in Community Question Answering Using Word Embeddings

Procedia Computer Science ◽

10.1016/j.procs.2019.09.203 ◽

2019 ◽

Vol 159 ◽

pp. 485-494 ◽

Cited By ~ 4

Author(s):

Nouha Othman ◽

Rim Faiz ◽

Kamel Smaïli

Keyword(s):

Question Answering ◽

Word Embeddings ◽

Community Question Answering

Download Full-text

Document Summarization Using Sentence-Level Semantic Based on Word Embeddings

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194019500086 ◽

2019 ◽

Vol 29 (02) ◽

pp. 177-196 ◽

Cited By ~ 1

Author(s):

Kamal Al-Sabahi ◽

Zhang Zuping

Keyword(s):

Language Processing ◽

Web Search ◽

Question Answering ◽

Information Overload ◽

Good Representation ◽

Intelligence Analysis ◽

Word Embeddings ◽

Question Answering Systems ◽

Active Research ◽

News Recommendation

In the era of information overload, text summarization has become a focus of attention in a number of diverse fields such as, question answering systems, intelligence analysis, news recommendation systems, search results in web search engines, and so on. A good document representation is the key point in any successful summarizer. Learning this representation becomes a very active research in natural language processing field (NLP). Traditional approaches mostly fail to deliver a good representation. Word embedding has proved an excellent performance in learning the representation. In this paper, a modified BM25 with Word Embeddings are used to build the sentence vectors from word vectors. The entire document is represented as a set of sentence vectors. Then, the similarity between every pair of sentence vectors is computed. After that, TextRank, a graph-based model, is used to rank the sentences. The summary is generated by picking the top-ranked sentences according to the compression rate. Two well-known datasets, DUC2002 and DUC2004, are used to evaluate the models. The experimental results show that the proposed models perform comprehensively better compared to the state-of-the-art methods.

Download Full-text