Visualizing the web search results with web search visualization using scatter plot

2013 ◽

pp. 31-45

Author(s):

R. Subhashini ◽

V.Jawahar Senthil Kumar

Keyword(s):

Information Retrieval ◽

Search Engines ◽

World Wide ◽

Clustering Algorithm ◽

Web Search ◽

Full Potential ◽

Digital Information ◽

Search Results ◽

The World ◽

The Web

The World Wide Web is a large distributed digital information space. The ability to search and retrieve information from the Web efficiently and effectively is an enabling technology for realizing its full potential. Information Retrieval (IR) plays an important role in search engines. Today’s most advanced engines use the keyword-based (“bag of words”) paradigm, which has inherent disadvantages. Organizing web search results into clusters facilitates the user’s quick browsing of search results. Traditional clustering techniques are inadequate because they do not generate clusters with highly readable names. This paper proposes an approach for web search results in clustering based on a phrase based clustering algorithm. It is an alternative to a single ordered result of search engines. This approach presents a list of clusters to the user. Experimental results verify the method’s feasibility and effectiveness.

Download Full-text

What is popular on Wikipedia and why?

First Monday ◽

10.5210/fm.v12i4.1765 ◽

2007 ◽

Cited By ~ 29

Author(s):

Anselm Spoerri

Keyword(s):

Search Engines ◽

Web Search ◽

Search Behavior ◽

Search Queries ◽

Search Results ◽

The Web

This paper analyzes which pages and topics are the most popular on Wikipedia and why. For the period of September 2006 to January 2007, the 100 most visited Wikipedia pages in a month are identified and categorized in terms of the major topics of interest. The observed topics are compared with search behavior on the Web. Search queries, which are identical to the titles of the most popular Wikipedia pages, are submitted to major search engines and the positions of popular Wikipedia pages in the top 10 search results are determined. The presented data helps to explain how search engines, and Google in particular, fuel the growth and shape what is popular on Wikipedia.

Download Full-text

BEYOND RANKED LISTS IN WEB SEARCH: AGGREGATING WEB CONTENT INTO TOPIC PAGES

International Journal of Semantic Computing ◽

10.1142/s1793351x10001103 ◽

2010 ◽

Vol 04 (04) ◽

pp. 509-534 ◽

Cited By ~ 3

Author(s):

NIRANJAN BALASUBRAMANIAN ◽

SILVIU CUCERZAN

Keyword(s):

Web Search ◽

Automatic Generation ◽

Selection Method ◽

Web Content ◽

Search Results ◽

Aggregate Information ◽

Search Logs ◽

The Web

We investigate the automatic generation of topic pages as an alternative to the current Web search paradigm. Topic pages explicitly aggregate information across documents, filter redundancy, and promote diversity of topical aspects. We propose a novel framework for building rich topical aspect models and selecting diverse information from the Web. In particular, we use Web search logs to build aspect models with various degrees of specificity, and then employ these aspect models as input to a sentence selection method that identifies relevant and non-redundant sentences from the Web. Automatic and manual evaluations on biographical topics show that topic pages built by our system compare favorably to regular Web search results and to MDS-style summaries of the Web results on all metrics employed.

Download Full-text

WEBYACHT: A CONCEPT-BASED SEARCH TOOL FOR WWW

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213099000105 ◽

1999 ◽

Vol 08 (02) ◽

pp. 137-156 ◽

Cited By ~ 1

Author(s):

CHING-CHI HSU ◽

CHIA-HUI CHANG

Keyword(s):

Information Search ◽

Relevance Feedback ◽

Web Search ◽

Automatic Assessment ◽

Feedback Mechanisms ◽

Document Ranking ◽

Search Results ◽

Web Information ◽

Search Tool ◽

The Web

This paper describes a Web information search tool called WebYacht. The goal of WebYacht is to solve the problem of imprecise search results in current Web search engines. Due to incomplete information given by users and the diversified information published on the Web, conventional document ranking based on an automatic assessment of document relevance to the query may not be the best approach when little information is given as in most cases. In order to clarify the ambiguity of the short queries given by users, WebYacht adopts cluster-based browsing model as well as relevance feedback to facilitate Web information search. The idea is to have users give two to three times more feedback in the same amount of time that would be required to give feedback for conventional feedback mechanisms. With the assistance of cluster-based representation provided by WebYacht, a lot of browsing labor can be reduced. In this paper, we explain the techniques used in the design of WebYacht and compare the performances of feedback interface designs and to conventional similarity ranking search results.

Download Full-text

A Study on Web Searching

Intelligent Agents for Data Mining and Information Retrieval ◽

10.4018/978-1-59140-194-0.ch014 ◽

2004 ◽

pp. 208-225

Author(s):

Shanfeng Zhu ◽

Xiaotie Deng ◽

Qizhi Fang ◽

Weimin Zhang

Keyword(s):

Search Engine ◽

Search Engines ◽

Web Search ◽

Experimental Results ◽

Web Searching ◽

Search Results ◽

Total Index ◽

Depth Study ◽

Web Search Engines ◽

The Web

Web search engines are one of the most popular services to help users find useful information on the Web. Although many studies have been carried out to estimate the size and overlap of the general web search engines, it may not benefit the ordinary web searching users, since they care more about the overlap of the top N (N=10, 20 or 50) search results on concrete queries, but not the overlap of the total index database. In this study, we present experimental results on the comparison of the overlap of the top N (N=10, 20 or 50) search results from AlltheWeb, Google, AltaVista and WiseNut for the 58 most popular queries, as well as for the distance of the overlapped results. These 58 queries are chosen from WordTracker service, which records the most popular queries submitted to some famous metasearch engines, such as MetaCrawler and Dogpile. We divide these 58 queries into three categories for further investigation. Through in-depth study, we observe a number of interesting results: the overlap of the top N results retrieved by different search engines is very small; the search results of the queries in different categories behave in dramatically different ways; Google, on average, has the highest overlap among these four search engines; each search engine tends to adopt a different rank algorithm independently.

Download Full-text

Clustering of the Web Search Results in Educational Recommender Systems

Educational Recommender Systems and Technologies ◽

10.4018/978-1-61350-489-5.ch007 ◽

2012 ◽

pp. 154-181 ◽

Cited By ~ 12

Author(s):

Constanta-Nicoleta Bodea ◽

Maria-Iuliana Dascalu ◽

Adina Lipai

Keyword(s):

Recommender Systems ◽

Clustering Algorithm ◽

Web Search ◽

Web Pages ◽

Lexical Database ◽

Assessment Task ◽

Search Results ◽

Meta Search ◽

Search Approach ◽

The Web

This chapter presents a meta-search approach, meant to deliver bibliography from the internet, according to trainees’ results obtained at an e-assessment task. The bibliography consists of web pages related to the knowledge gaps of the trainees. The meta-search engine is part of an education recommender system, attached to an e-assessment application for project management knowledge. Meta-search means that, for a specific query (or mistake made by the trainee), several search mechanisms for suitable bibliography (further reading) could be applied. The lists of results delivered by the standard search mechanisms are used to build thematically homogenous groups using an ontology-based clustering algorithm. The clustering process uses an educational ontology and WordNet lexical database to create its categories. The research is presented in the context of recommender systems and their various applications to the education domain.

Download Full-text

Human-Centred Web Search

Next Generation Search Engines ◽

10.4018/978-1-4666-0330-1.ch010 ◽

2012 ◽

pp. 217-238 ◽

Cited By ~ 4

Author(s):

Orland Hoeber

Keyword(s):

Information Retrieval ◽

Decision Support ◽

Search Engine ◽

Information Needs ◽

Web Search ◽

Active Role ◽

Simple Fact ◽

Search Results ◽

Retrieval Engine ◽

The Web

People commonly experience difficulties when searching the Web, arising from an incomplete knowledge regarding their information needs, an inability to formulate accurate queries, and a low tolerance for considering the relevance of the search results. While simple and easy to use interfaces have made Web search universally accessible, they provide little assistance for people to overcome the difficulties they experience when their information needs are more complex than simple fact-verification. In human-centred Web search, the purpose of the search engine expands from a simple information retrieval engine to a decision support system. People are empowered to take an active role in the search process, with the search engine supporting them in developing a deeper understanding of their information needs, assisting them in crafting and refining their queries, and aiding them in evaluating and exploring the search results. In this chapter, recent research in this domain is outlined and discussed.

Download Full-text

A Systematic Literature Review Of Web Search Personalization

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666200224105551 ◽

2020 ◽

Vol 13 ◽

Author(s):

Sunny Sharma ◽

Vijay Rana

Keyword(s):

Literature Review ◽

Search Engines ◽

Web Search ◽

Geographical Location ◽

Future Research ◽

Research Directions ◽

Search Results ◽

Future Research Directions ◽

The Web

: The Existing studies have already revealed that the information on the web is increasing rapidly. Ambiguous queries and user’s ability to express their intention through queries have been one of the key challenges in retrieving the accurate search results from the search engine. This paper in response explored different methodologies proposed during 2005-2019 by the eminent researchers for recommending better search results. Some of these methodologies are based on the users’ geographical location while others rely on re- rank the web results and refinement of user’s query. Fellow researchers can use this literature, to define the fundamental literature for their own work. Further a brief case study of major search engines like Google, Yahoo, Bing etc. along with the techniques used by these search engines for personalization are also discussed. Finally, the paper discusses some current issues and challenges related to the personalization which further lays the future research directions.

Download Full-text

Overlap in the Web Search Results of Google and Bing

The Journal of Web Science ◽

10.1561/106.00000005 ◽

2016 ◽

Vol 2 (1) ◽

pp. 17-30 ◽

Cited By ~ 1

Author(s):

Rakesh Agrawal

Keyword(s):

Web Search ◽

Search Results ◽

The Web

Download Full-text

A Study on Web Searching

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch115 ◽

2008 ◽

pp. 1926-1937

Author(s):

Shanfeng Chu ◽

Xiaotie Deng ◽

Qizhi Fang ◽

Weimin Zhang

Keyword(s):

Search Engine ◽

Search Engines ◽

Web Search ◽

Experimental Results ◽

Web Searching ◽

Search Results ◽

Total Index ◽

Depth Study ◽

Web Search Engines ◽

The Web

Web search engines are one of the most popular services to help users find useful information on the Web. Although many studies have been carried out to estimate the size and overlap of the general web search engines, it may not benefit the ordinary web searching users, since they care more about the overlap of the top N (N=10, 20 or 50) search results on concrete queries, but not the overlap of the total index database. In this study, we present experimental results on the comparison of the overlap of the top N (N=10, 20 or 50) search results from AlltheWeb, Google, AltaVista and WiseNut for the 58 most popular queries, as well as for the distance of the overlapped results. These 58 queries are chosen from WordTracker service, which records the most popular queries submitted to some famous metasearch engines, such as MetaCrawler and Dogpile. We divide these 58 queries into three categories for further investigation. Through in-depth study, we observe a number of interesting results: the overlap of the top N results retrieved by different search engines is very small; the search results of the queries in different categories behave in dramatically different ways; Google, on average, has the highest overlap among these four search engines; each search engine tends to adopt a different rank algorithm independently.

Download Full-text

Visualizing the web search results with web search visualization using scatter plot

A Roadmap to Integrate Document Clustering in Information Retrieval

What is popular on Wikipedia and why?

BEYOND RANKED LISTS IN WEB SEARCH: AGGREGATING WEB CONTENT INTO TOPIC PAGES

WEBYACHT: A CONCEPT-BASED SEARCH TOOL FOR WWW

A Study on Web Searching

Clustering of the Web Search Results in Educational Recommender Systems

Human-Centred Web Search

A Systematic Literature Review Of Web Search Personalization

Overlap in the Web Search Results of Google and Bing

A Study on Web Searching

Export Citation Format