Exploiting Various Word Embedding Models for Query Expansion in Microblog

Enhanced word embedding similarity measures using fuzzy rules for query expansion

2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) ◽

10.1109/fuzz-ieee.2017.8015482 ◽

2017 ◽

Cited By ~ 2

Author(s):

Qian Liu ◽

Heyan Huang ◽

Jie Lut ◽

Yang Gao ◽

Guangquan Zhang

Keyword(s):

Query Expansion ◽

Similarity Measures ◽

Word Embedding ◽

Fuzzy Rules

Download Full-text

Merchandise Recommendation for Retail Events with Word Embedding Weighted Tf-idf and Dynamic Query Expansion

The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval - SIGIR '18 ◽

10.1145/3209978.3210202 ◽

2018 ◽

Cited By ~ 1

Author(s):

Ted Tao Yuan ◽

Zezhong Zhang

Keyword(s):

Query Expansion ◽

Word Embedding ◽

Dynamic Query

Download Full-text

Query Expansion for Sentence Retrieval Using Pseudo Relevance Feedback and Word Embedding

Lecture Notes in Computer Science - Experimental IR Meets Multilinguality, Multimodality, and Interaction ◽

10.1007/978-3-319-65813-1_8 ◽

2017 ◽

pp. 97-103 ◽

Cited By ~ 1

Author(s):

Piyush Arora ◽

Jennifer Foster ◽

Gareth J. F. Jones

Keyword(s):

Relevance Feedback ◽

Query Expansion ◽

Word Embedding ◽

Sentence Retrieval ◽

Pseudo Relevance Feedback

Download Full-text

Enhancing Query Expansion Method Using Word Embedding

2019 IEEE 9th International Conference on System Engineering and Technology (ICSET) ◽

10.1109/icsengt.2019.8906317 ◽

2019 ◽

Author(s):

Nuhu Yusuf ◽

Mohd Amin Mohd Yunus ◽

Norfaradilla Wahid ◽

Noorhaniza Wahid ◽

Nazri Mohd Nawi ◽

...

Keyword(s):

Query Expansion ◽

Expansion Method ◽

Word Embedding

Download Full-text

Query Expansion menggunakan Word Embedding dan Pseudo Relevance Feedback

Register Jurnal Ilmiah Teknologi Sistem Informasi ◽

10.26594/register.v5i1.1385 ◽

2019 ◽

Vol 5 (1) ◽

pp. 47 ◽

Cited By ~ 1

Author(s):

Evan Tanuwijaya ◽

Safri Adam ◽

Mohammad Fatoni Anggris ◽

Agus Zainal Arifin

Keyword(s):

Relevance Feedback ◽

Query Expansion ◽

Relevant Information ◽

Word Embedding ◽

Natural Languages ◽

F Measure ◽

Pseudo Relevance Feedback ◽

Better Than

Kata kunci merupakan hal terpenting dalam mencari sebuah informasi. Penggunaan kata kunci yang tepat menghasilkan informasi yang relevan. Saat penggunaannya sebagai query, pengguna menggunakan bahasa yang alami, sehingga terdapat kata di luar dokumen jawaban yang telah disiapkan oleh sistem. Sistem tidak dapat memproses bahasa alami secara langsung yang dimasukkan oleh pengguna, sehingga diperlukan proses untuk mengolah kata-kata tersebut dengan mengekspansi setiap kata yang dimasukkan pengguna yang dikenal dengan Query Expansion (QE). Metode QE pada penelitian ini menggunakan Word Embedding karena hasil dari Word Embedding dapat memberikan kata-kata yang sering muncul bersama dengan kata-kata dalam query. Hasil dari word embedding dipakai sebagai masukan pada pseudo relevance feedback untuk diperkaya berdasarkan dokumen jawaban yang telah ada. Metode QE diterapkan dan diuji coba pada aplikasi chatbot. Hasil dari uji coba metode QE yang diterapkan pada chatbot didapatkan nilai recall, precision, dan F-measure masing-masing 100%; 70% dan 82,35 %. Hasil tersebut meningkat 1,49% daripada chatbot tanpa menggunakan QE yang pernah dilakukan sebelumnya yang hanya meraih akurasi sebesar 68,51%. Berdasarkan hasil pengukuran tersebut, QE menggunakan word embedding dan pseudo relevance feedback pada chatbot dapat mengatasi query masukan dari pengguna yang ambigu dan alami, sehingga dapat memberikan jawaban yang relevan kepada pengguna. Keywords are the most important words and phrases used to obtain relevant information on content. Although users make use of natural languages, keywords are processed as queries by the system due to its inability to process. The language directly entered by the user is known as query expansion (QE). The proposed QE in this research uses word embedding owing to its ability to provide words that often appear along with those in the query. The results are used as inputs to the pseudo relevance feedback to be enriched based on the existing documents. This method is also applied to the chatbot application and precision, and F-measure values of the results obtained were 100%, 70%, 82.35% respectively. The results are 1.49% better than chatbot without using QE with 68.51% accuracy. Based on the results of these measurements, QE using word embedding and pseudo which gave relevance feedback in chatbots can resolve ambiguous and natural user’s input queries thereby enabling the system retrieve relevant answers.

Download Full-text

Query Expansion Based on Wikipedia Word Embedding and BabelNet Method for Searching Arabic Documents

International Journal of Intelligent Engineering and Systems ◽

10.22266/ijies2019.1031.20 ◽

2019 ◽

Vol 12 (5) ◽

pp. 202-213

Author(s):

Maryamah Maryamah ◽

◽

Agus Arifin ◽

Riyanarto Sarno ◽

Yasuhiko Morimoto ◽

...

Keyword(s):

Query Expansion ◽

Word Embedding

Download Full-text

Word-embedding-based pseudo-relevance feedback for Arabic information retrieval

Journal of Information Science ◽

10.1177/0165551518792210 ◽

2018 ◽

Vol 45 (4) ◽

pp. 429-442 ◽

Cited By ~ 5

Author(s):

Abdelkader El Mahdaouy ◽

Saïd Ouatik El Alaoui ◽

Eric Gaussier

Keyword(s):

Information Retrieval ◽

Relevance Feedback ◽

Query Expansion ◽

Main Idea ◽

Word Embedding ◽

Average Precision ◽

Standard Arabic ◽

Arabic Information Retrieval ◽

The Mean ◽

Pseudo Relevance Feedback

Pseudo-relevance feedback (PRF) is a very effective query expansion approach, which reformulates queries by selecting expansion terms from top k pseudo-relevant documents. Although standard PRF models have been proven effective to deal with vocabulary mismatch between users’ queries and relevant documents, expansion terms are selected without considering their similarity to the original query terms. In this article, we propose a method to incorporate word embedding (WE) similarity into PRF models for Arabic information retrieval (IR). The main idea is to select expansion terms using their distribution in the set of top pseudo-relevant documents along with their similarity to the original query terms. Experiments are conducted on the standard Arabic TREC 2001/2002 collection using three neural WE models. The obtained results show that our PRF extensions significantly outperform their baseline PRF models. Moreover, they enhanced the baseline IR model by 22% and 68% for the mean average precision (MAP) and the robustness index (RI), respectively.

Download Full-text

Word-embedding-based query expansion: Incorporating Deep Averaging Networks in Arabic document retrieval

Journal of Information Science ◽

10.1177/01655515211040659 ◽

2021 ◽

pp. 016555152110406

Author(s):

Yasir Hadi Farhan ◽

Shahrul Azman Mohd Noah ◽

Masnizah Mohd ◽

Jaffar Atwan

Keyword(s):

Query Expansion ◽

Document Retrieval ◽

Word Embedding ◽

Expansion Strategy ◽

Network Layers ◽

Search Tasks ◽

User Query ◽

Linear Neural Network ◽

The Individual ◽

Expansion Strategies

One of the main issues associated with search engines is the query–document vocabulary mismatch problem, a long-standing problem in Information Retrieval (IR). This problem occurs when a user query does not match the content of stored documents, and it affects most search tasks. Automatic query expansion (AQE) is one of the most common approaches used to address this problem. Various AQE techniques have been proposed; these mainly involve finding synonyms or related words for the query terms. Word embedding (WE) is one of the methods that are currently receiving significant attention. Most of the existing AQE techniques focus on expanding the individual query terms rather the entire query during the expansion process, and this can lead to query drift if poor expansion terms are selected. In this article, we introduce Deep Averaging Networks (DANs), an architecture that feeds the average of the WE vectors produced by the Word2Vec toolkit for the terms in a query through several linear neural network layers. This average vector is assumed to represent the meaning of the query as a whole and can be used to find expansion terms that are relevant to the complete query. We explore the potential of DANs for AQE in Arabic document retrieval. We experiment with using DANs for AQE in the classic probabilistic BM25 model as well as for two recent expansion strategies: Embedding-Based Query Expansion approach (EQE1) and Prospect-Guided Query Expansion Strategy (V2Q). Although DANs did not improve all outcomes when used in the BM25 model, it outperformed all baselines when incorporated into the EQE1 and V2Q expansion strategies.

Download Full-text