query log Latest Research Papers

I Know What You Need: Investigating Document Retrieval Effectiveness with Partial Session Contexts

ACM Transactions on Information Systems ◽

10.1145/3488667 ◽

2022 ◽

Vol 40 (3) ◽

pp. 1-30

Author(s):

Procheta Sen ◽

Debasis Ganguly ◽

Gareth J. F. Jones

Keyword(s):

Relevant Information ◽

Document Retrieval ◽

Context Information ◽

Information Need ◽

Search System ◽

Query Log ◽

Sequence Modeling ◽

Joint Embedding ◽

One Step ◽

A Current

Reducing user effort in finding relevant information is one of the key objectives of search systems. Existing approaches have been shown to effectively exploit the context from the current search session of users for automatically suggesting queries to reduce their search efforts. However, these approaches do not accomplish the end goal of a search system—that of retrieving a set of potentially relevant documents for the evolving information need during a search session. This article takes the problem of query prediction one step further by investigating the problem of contextual recommendation within a search session. More specifically, given the partial context information of a session in the form of a small number of queries, we investigate how a search system can effectively predict the documents that a user would have been presented with had he continued the search session by submitting subsequent queries. To address the problem, we propose a model of contextual recommendation that seeks to capture the underlying semantics of information need transitions of a current user’s search context. This model leverages information from a number of past interactions of other users with similar interactions from an existing search log. To identify similar interactions, as a novel contribution, we propose an embedding approach that jointly learns representations of both individual query terms and also those of queries (in their entirety) from a search log data by leveraging session-level containment relationships. Our experiments conducted on a large query log, namely the AOL, demonstrate that using a joint embedding of queries and their terms within our proposed framework of document retrieval outperforms a number of text-only and sequence modeling based baselines.

Mining Domain Terminologies Using Search Engine's Query Log

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3462327 ◽

2021 ◽

Vol 20 (6) ◽

pp. 1-32

Author(s):

Weijian Ni ◽

Tong Liu ◽

Qingtian Zeng ◽

Nengfu Xie

Keyword(s):

Language Processing ◽

Heterogeneous Network ◽

Transductive Learning ◽

Query Log ◽

Domain Specific ◽

Novel Approach ◽

Commercial Search Engine ◽

Domain Expertise ◽

Traditional Approaches ◽

Domain Independent

Domain terminologies are a basic resource for various natural language processing tasks. To automatically discover terminologies for a domain of interest, most traditional approaches mostly rely on a domain-specific corpus given in advance; thus, the performance of traditional approaches can only be guaranteed when collecting a high-quality domain-specific corpus, which requires extensive human involvement and domain expertise. In this article, we propose a novel approach that is capable of automatically mining domain terminologies using search engine's query log—a type of domain-independent corpus of higher availability, coverage, and timeliness than a manually collected domain-specific corpus. In particular, we represent query log as a heterogeneous network and formulate the task of mining domain terminology as transductive learning on the heterogeneous network. In the proposed approach, the manifold structure of domain-specificity inherent in query log is captured by using a novel network embedding algorithm and further exploited to reduce the need for the manual annotation efforts for domain terminology classification. We select Agriculture and Healthcare as the target domains and experiment using a real query log from a commercial search engine. Experimental results show that the proposed approach outperforms several state-of-the-art approaches.

Random Forest Bagging and X-Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data

Wireless Communications and Mobile Computing ◽

10.1155/2021/2730246 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Rajesh Kumar Dhanaraj ◽

Vinothsaravanan Ramakrishnan ◽

M. Poongodi ◽

Lalitha Krishnasamy ◽

Mounir Hamdi ◽

...

Keyword(s):

False Positive ◽

Time Complexity ◽

False Positive Rate ◽

Detection Accuracy ◽

Mobile Data ◽

Query Log ◽

Weak Learner ◽

Computational Overhead ◽

Positive Rate ◽

Sql Query

In the current ongoing crisis, people mostly rely on mobile phones for all the activities, but query analysis and mobile data security are major issues. Several research works have been made on efficient detection of antipatterns for minimizing the complexity of query analysis. However, more focus needs to be given to the accuracy aspect. In addition, for grouping similar antipatterns, a clustering process was performed to eradicate the design errors. To address the above-said issues and further enhance the antipattern detection accuracy with minimum time and false positive rate, in this work, Random Forest Bagging X-means SQL Query Clustering (RFBXSQLQC) technique is proposed. Different patterns or queries are initially gathered from the input SQL query log, and bootstrap samples are created. Then, for each pattern, various weak clusters are constructed via X-means clustering and are utilized as the weak learner (clusters). During this process, the input patterns are categorized into different clusters. Using the Bayesian information criterion, the similarity measure is employed to evaluate the similarity between the patterns and cluster weight. Based on the similarity value, patterns are assigned to either relevant or irrelevant groups. The weak learner results are aggregated to form strong clusters, and, with the aid of voting, a majority vote is considered for designing strong clusters with minimum time. Experiments are conducted to evaluate the performance of the RFBXSQLQC technique using the IIT Bombay dataset using the metrics like antipattern detection accuracy, time complexity, false-positive rate, and computational overhead with respect to the differing number of queries. The results revealed that the RFBXSQLQC technique outperforms the existing algorithms by 19% with pattern detection accuracy, 34% minimized time complexity, 64% false-positive rate, and 31% in terms of computational overhead.

What Were People Searching For? A Query Log Analysis of An Academic Search Engine

10.1109/jcdl52503.2021.00062 ◽

2021 ◽

Author(s):

Shaurya Rohatgi ◽

C. Lee Giles ◽

Jian Wu

Keyword(s):

Search Engine ◽

Log Analysis ◽

Query Log ◽

Query Log Analysis ◽

Academic Search

Relevant Feedback-Based User-Query Log Recommender System from Public Repository

Information and Communication Technology for Intelligent Systems - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-15-7078-0_54 ◽

2020 ◽

pp. 555-568

Author(s):

V. Kakulapati ◽

D. Vasumathi ◽

G. Suryanarayana

Keyword(s):

Recommender System ◽

Public Repository ◽

Query Log ◽

Relevant Feedback ◽

User Query

Time-aware query suggestion diversification for temporally ambiguous queries

The Electronic Library ◽

10.1108/el-12-2019-0296 ◽

2020 ◽

Vol 38 (4) ◽

pp. 725-744

Author(s):

Xiaojuan Zhang ◽

Xixi Jiang ◽

Jiewen Qin

Keyword(s):

Digital Libraries ◽

Web Search ◽

State Of The Art ◽

Experimental Information ◽

Query Suggestion ◽

High Coverage ◽

Query Log ◽

Content Type ◽

Search Tasks ◽

Time Aware

Purpose The purpose of this study is to generate diversified results for temporally ambiguous queries and the candidate queries are ensured to have a high coverage of subtopics, which are derived from different temporal periods. Design/methodology/approach Two novel time-aware query suggestion diversification models are developed by integrating semantics and temporality information involved in queries into two state-of-the-art explicit diversification algorithms (i.e. IA-select and xQuaD), respectively, and then specifying the components on which these two models rely on. Most importantly, first explored is how to explicitly determine query subtopics for each unique query from the query log or clicked documents and then modeling the subtopics into query suggestion diversification. The discussion on how to mine temporal intent behind a query from query log is also followed. Finally, to verify the effectiveness of the proposal, experiments on a real-world query log are conducted. Findings Preliminary experiments demonstrate that the proposed method can significantly outperform the existing state-of-the-art methods in terms of producing the candidate query suggestion for temporally ambiguous queries. Originality/value This study reports the first attempt to generate query suggestions indicating diverse interested time points to the temporally ambiguous (input) queries. The research will be useful in enhancing users’ search experience through helping them to formulate accurate queries for their search tasks. In addition, the approaches investigated in the paper are general enough to be used in many domains; that is, experimental information retrieval systems, Web search engines, document archives and digital libraries.

BIBSQLQC: Brown infomax boosted SQL query clustering algorithm to detect anti-patterns in the query log

TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES ◽

10.3906/elk-1912-108 ◽

2020 ◽

Vol 28 (4) ◽

pp. 2200-2212

Author(s):

Vinothsaravanan RAMAKRISHNAN ◽

Chenniappan PALANISAMY

Keyword(s):

Clustering Algorithm ◽

Query Log ◽

Sql Query

Incremental Refinement of Page Ranking of Web Pages

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2020070104 ◽

2020 ◽

Vol 10 (3) ◽

pp. 57-73

Author(s):

Prem Sagar Sharma ◽

Divakar Yadav

Keyword(s):

Information Retrieval ◽

Web Pages ◽

Web Based ◽

Query Log ◽

Large Size ◽

Retrieval Systems ◽

Page Ranking ◽

User Query ◽

Information Retrieval Systems ◽

Ranking Mechanism

Web-based information retrieval systems called search engines have made things easy for information seekers, but still do not provide guarantees about the relevance of the information provided to the users. Information retrieval systems provide the information to the user based on certain retrieval criteria. Due to the large size of the WWW, it is very common that a large number of documents get identified related to a particular domain. Therefore, to help users towards finding the best matching documents, a ranking mechanism is employed by the search engine. In this article, an improved architecture for an information retrieval system is proposed. The proposed system makes a query log for each user query and stores the results retrieved to the user for that query. The system also provides relevant results by analyzing the content of the pages retrieved for the user query.

The Slow Query Log

MySQL 8 Query Performance Tuning ◽

10.1007/978-1-4842-5584-1_9 ◽

2020 ◽

pp. 153-164

Author(s):

Jesper Wisborg Krogh

Keyword(s):

Query Log

Selection of the optimal type of thermal insulation structure based on the neural network modelling

E3S Web of Conferences ◽

10.1051/e3sconf/202021601037 ◽

2020 ◽

Vol 216 ◽

pp. 01037

Author(s):

Irina Akhmetova ◽

Elena Balzamova ◽

Veronika Bronskaya ◽

Denis Balzamov ◽

Konstantin Lapin ◽

...

Keyword(s):

Neural Network ◽

Web Application ◽

District Heating ◽

Heat Network ◽

Network Modelling ◽

Query Log ◽

Neural Network Modelling ◽

The Neural Network ◽

Selection Of ◽

Optimal Type

A software package with the user interface for calculating, analyzing and predicting the parameters of cogeneration-based district heating based on the neural network modelling is presented in order to optimize and ensure the reliability of heat networks. The package is the basis for a web-application that allows to calculate the characteristics of the heat network in accordance with the model, keep a query log and provide the possibility of administration.

query log
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

I Know What You Need: Investigating Document Retrieval Effectiveness with Partial Session Contexts

Mining Domain Terminologies Using Search Engine's Query Log

Random Forest Bagging and X-Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data

What Were People Searching For? A Query Log Analysis of An Academic Search Engine

Relevant Feedback-Based User-Query Log Recommender System from Public Repository

Time-aware query suggestion diversification for temporally ambiguous queries

BIBSQLQC: Brown infomax boosted SQL query clustering algorithm to detect anti-patterns in the query log

Incremental Refinement of Page Ranking of Web Pages

The Slow Query Log

Selection of the optimal type of thermal insulation structure based on the neural network modelling

Export Citation Format

query logRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

I Know What You Need: Investigating Document Retrieval Effectiveness with Partial Session Contexts

Mining Domain Terminologies Using Search Engine's Query Log

Random Forest Bagging and X-Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data

What Were People Searching For? A Query Log Analysis of An Academic Search Engine

Relevant Feedback-Based User-Query Log Recommender System from Public Repository

Time-aware query suggestion diversification for temporally ambiguous queries

BIBSQLQC: Brown infomax boosted SQL query clustering algorithm to detect anti-patterns in the query log

Incremental Refinement of Page Ranking of Web Pages

The Slow Query Log

Selection of the optimal type of thermal insulation structure based on the neural network modelling

query log
Recently Published Documents