Enhancing Web Search through Query Log Mining

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch117 ◽

2011 ◽

pp. 758-763 ◽

Cited By ~ 2

Author(s):

Ji-Rong Wen

Keyword(s):

Information Retrieval ◽

Search Engine ◽

Web Mining ◽

Web Search ◽

Information Source ◽

Query Log ◽

Additional Information ◽

Query Logs ◽

Query Log Mining ◽

The Web

Web query log is a type of file keeping track of the activities of the users who are utilizing a search engine. Compared to traditional information retrieval setting in which documents are the only information source available, query logs are an additional information source in the Web search setting. Based on query logs, a set of Web mining techniques, such as log-based query clustering, log-based query expansion, collaborative filtering and personalized search, could be employed to improve the performance of Web search.

Download Full-text

Multi-Agent-Based Information Retrieval System Using Information Scent in Query Log Mining for Effective Web Search

Information Retrieval and Management ◽

10.4018/978-1-5225-5191-1.ch012 ◽

2018 ◽

pp. 266-291

Author(s):

Suruchi Chawla

Keyword(s):

Information Retrieval ◽

Web Search ◽

Multi Agent System ◽

Information Need ◽

Agent System ◽

Query Log ◽

Information Scent ◽

Multi Agent ◽

Log Mining ◽

Query Log Mining

This chapter explains the multi-agent system for effective information retrieval using information scent in query log mining. The precision of search results is low due to difficult to infer the information need of the small size search query and therefore information need of the user is not satisfied effectively. Information Scent is used for modeling the information need of user web search session and clustering is performed to identify the similar information need sessions. Hyper Link-Induced Topic Search (HITS) is executed on clusters to generate the Hubs and authorities for web page recommendations to users who search with similar intents. This multi-agent system based on clustered query sessions uses query operations like expansion and recommendation to infer the information need of user search queries and recommends Hubs and authorities for effective web search.

Download Full-text

Human-Centred Web Search

Next Generation Search Engines ◽

10.4018/978-1-4666-0330-1.ch010 ◽

2012 ◽

pp. 217-238 ◽

Cited By ~ 4

Author(s):

Orland Hoeber

Keyword(s):

Information Retrieval ◽

Decision Support ◽

Search Engine ◽

Information Needs ◽

Web Search ◽

Active Role ◽

Simple Fact ◽

Search Results ◽

Retrieval Engine ◽

The Web

People commonly experience difficulties when searching the Web, arising from an incomplete knowledge regarding their information needs, an inability to formulate accurate queries, and a low tolerance for considering the relevance of the search results. While simple and easy to use interfaces have made Web search universally accessible, they provide little assistance for people to overcome the difficulties they experience when their information needs are more complex than simple fact-verification. In human-centred Web search, the purpose of the search engine expands from a simple information retrieval engine to a decision support system. People are empowered to take an active role in the search process, with the search engine supporting them in developing a deeper understanding of their information needs, assisting them in crafting and refining their queries, and aiding them in evaluating and exploring the search results. In this chapter, recent research in this domain is outlined and discussed.

Download Full-text

Users’ Click and Bookmark Based Personalization Using Modified Agglomerative Clustering for Web Search Engine

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213017300022 ◽

2017 ◽

Vol 26 (06) ◽

pp. 1730002 ◽

Cited By ~ 3

Author(s):

T. Dhiliphan Rajkumar ◽

S. P. Raja ◽

A. Suruliandi

Keyword(s):

Information Retrieval ◽

Search Engine ◽

Search Engines ◽

Web Search ◽

Irrelevant Information ◽

Experimental Results ◽

Agglomerative Clustering ◽

Semantic Level ◽

Web Search Engine ◽

The Web

Short and ambiguous queries are the major problems in search engines which lead to irrelevant information retrieval for the users’ input. The increasing nature of the information on the web also makes various difficulties for the search engine to provide the users needed results. The web search engine experience the ill effects of ambiguity, since the queries are looked at on a rational level rather than the semantic level. In this paper, for improving the performance of search engine as of the users’ interest, personalization is based on the users’ clicks and bookmarking is proposed. Modified agglomerative clustering is used in this work for clustering the results. The experimental results prove that the proposed work scores better precision, recall and F-score.

Download Full-text

Research and Improvement on Content-Based Web Search Engine

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.532-533.1282 ◽

2012 ◽

Vol 532-533 ◽

pp. 1282-1286

Author(s):

Zhi Chao Lin ◽

Lei Sun ◽

Xiao Liu

Keyword(s):

Information Retrieval ◽

Search Engine ◽

Full Text ◽

Search Engines ◽

Query Expansion ◽

Web Search ◽

Text Search ◽

Web Search Engine ◽

Query Word ◽

The Web

There is a lot of information contained in the World Wide Web. It has become a research focus to obtain the required related resources quickly and accurately from the web through the content-based search engines. Most current tools of full text web search engine, such as Lucene which is a widely used open source retrieval library in information retrieval field, are purely keyword based. This may not sufficient for users to retrieve in the web. In this paper, we employ a method to overcome the limitations of current full text search engines in represent of Lucene. We propose a Query Expansion and Information Retrieval approach which can help users to acquire more accurate contents from the web. The Query Expansion component finds expanded candidate words of the query word through WordNet which contains synonyms in several different senses; In the Information Retrieval component, the query word and its candidate words are used together as the input of the search module to get the result items. Furthermore, we can put the result items into different classes based on the expansion. Some experiments and the results are described in the late part of this paper.

Download Full-text

Web Information Retrieval

Social Information Technology ◽

10.4018/978-1-59904-774-4.ch016 ◽

2011 ◽

pp. 218-252 ◽

Cited By ~ 1

Author(s):

Guillaume Cabanac ◽

Max Chevalier ◽

Claude Chrisment ◽

Christine Julien ◽

Chantal Soulé-Dupuy ◽

...

Keyword(s):

Social Networks ◽

Information Retrieval ◽

Search Engine ◽

Information Search ◽

Information Needs ◽

Information Source ◽

Web Information Retrieval ◽

Web Information ◽

The Social ◽

The Web

Nowadays, the Web has become the most queried information source. To solve their information needs, individuals can use different types of tools or services like a search engine, for instance. Due to the high amount of information and the diversity of human factors, searching for information requires patience, perseverance, and sometimes luck. To help individuals during this task, search assistants feature adaptive techniques aiming at personalizing retrieved information. Moreover, thanks to the “new Web” (the Web 2.0), personal search assistants are evolving, using social techniques (social networks, sharing-based methods). Let us enter into the Social Web, where everyone collaborates with others in providing their experience, their expertise. This chapter introduces search assistants and underlines their evolution toward Social Information Search Assistants.

Download Full-text

Multi-Agent-Based Information Retrieval System Using Information Scent in Query Log Mining for Effective Web Search

Advances in Data Mining and Database Management - Web Data Mining and the Development of Knowledge-Based Decision Support Systems ◽

10.4018/978-1-5225-1877-8.ch008 ◽

2017 ◽

pp. 131-156

Author(s):

Suruchi Chawla

Keyword(s):

Information Retrieval ◽

Web Search ◽

Multi Agent System ◽

Information Need ◽

Agent System ◽

Query Log ◽

Information Scent ◽

Multi Agent ◽

Log Mining ◽

Query Log Mining

This chapter explains the multi-agent system for effective information retrieval using information scent in query log mining. The precision of search results is low due to difficult to infer the information need of the small size search query and therefore information need of the user is not satisfied effectively. Information Scent is used for modeling the information need of user web search session and clustering is performed to identify the similar information need sessions. Hyper Link-Induced Topic Search (HITS) is executed on clusters to generate the Hubs and authorities for web page recommendations to users who search with similar intents. This multi-agent system based on clustered query sessions uses query operations like expansion and recommendation to infer the information need of user search queries and recommends Hubs and authorities for effective web search.

Download Full-text

Human-Centred Web Search

Data Mining ◽

10.4018/978-1-4666-2455-9.ch096 ◽

2013 ◽

pp. 1852-1872

Author(s):

Orland Hoeber

Keyword(s):

Information Retrieval ◽

Decision Support ◽

Search Engine ◽

Information Needs ◽

Web Search ◽

Active Role ◽

Simple Fact ◽

Search Results ◽

Retrieval Engine ◽

The Web

People commonly experience difficulties when searching the Web, arising from an incomplete knowledge regarding their information needs, an inability to formulate accurate queries, and a low tolerance for considering the relevance of the search results. While simple and easy to use interfaces have made Web search universally accessible, they provide little assistance for people to overcome the difficulties they experience when their information needs are more complex than simple fact-verification. In human-centred Web search, the purpose of the search engine expands from a simple information retrieval engine to a decision support system. People are empowered to take an active role in the search process, with the search engine supporting them in developing a deeper understanding of their information needs, assisting them in crafting and refining their queries, and aiding them in evaluating and exploring the search results. In this chapter, recent research in this domain is outlined and discussed.

Download Full-text

A Roadmap to Integrate Document Clustering in Information Retrieval

Information Retrieval Methods for Multidisciplinary Applications ◽

10.4018/978-1-4666-3898-3.ch003 ◽

2013 ◽

pp. 31-45

Author(s):

R. Subhashini ◽

V.Jawahar Senthil Kumar

Keyword(s):

Information Retrieval ◽

Search Engines ◽

World Wide ◽

Clustering Algorithm ◽

Web Search ◽

Full Potential ◽

Digital Information ◽

Search Results ◽

The World ◽

The Web

The World Wide Web is a large distributed digital information space. The ability to search and retrieve information from the Web efficiently and effectively is an enabling technology for realizing its full potential. Information Retrieval (IR) plays an important role in search engines. Today’s most advanced engines use the keyword-based (“bag of words”) paradigm, which has inherent disadvantages. Organizing web search results into clusters facilitates the user’s quick browsing of search results. Traditional clustering techniques are inadequate because they do not generate clusters with highly readable names. This paper proposes an approach for web search results in clustering based on a phrase based clustering algorithm. It is an alternative to a single ordered result of search engines. This approach presents a list of clusters to the user. Experimental results verify the method’s feasibility and effectiveness.

Download Full-text

AntWeb—Web Search Based on Ant Behavior

Emerging Technologies of Text Mining ◽

10.4018/978-1-59904-373-9.ch010 ◽

2008 ◽

pp. 208-222

Author(s):

Li Weigang ◽

Wu Man Qi

Keyword(s):

Web Mining ◽

Web Search ◽

Theory Model ◽

Web Pages ◽

Web Portal ◽

Knowledge Based ◽

Log Files ◽

Ant Behavior ◽

Shortest Route ◽

The Web

This chapter presents a study of Ant Colony Optimization (ACO) to Interlegis Web portal, Brazilian legislation Website. The approach of AntWeb is inspired by ant colonies foraging behavior to adaptively mark the most significant link by means of the shortest route to arrive the target pages. The system considers the users in the Web portal as artificial ants and the links among the pages of the Web pages as the researching network. To identify the group of the visitors, Web mining is applied to extract knowledge based on preprocessing Web log files. The chapter describes the theory, model, main utilities and implementation of AntWeb prototype in Interlegis Web portal. The case study shows Off-line Web mining; simulations without and with the use of AntWeb; testing by modification of the parameters. The result demonstrates the sensibility and accessibility of AntWeb and the benefits for the Interlegis Web users.

Download Full-text