Design of a Least Cost (LC) Vertical Search Engine based on Domain Specific Hidden Web Crawler

2017 ◽  
Vol 7 (2) ◽  
pp. 19-33
Author(s):  
Sudhakar Ranjan ◽  
Komal Kumar Bhatia

Now days with the advent of internet technologies and ecommerce the need for smart search engine for human life is rising. The traditional search engines are not intelligent as well as smart and thus lead to the rise in searching costs. In this paper, architecture of a vertical search engine based on the domain specific hidden web crawler is proposed. To make a least cost vertical search engine improvement in the following techniques like: searching, indexing, ranking, transaction and query interface are suggested. The domain term analyzer filters the useless information to the maximum extent and finally provides the users with high precision information. Through the experimental result it is shown that the system works on accelerating the access, computation, storage, communication time, increased efficiency and work professionally.

The Dark Web ◽  
2018 ◽  
pp. 319-333
Author(s):  
Sudhakar Ranjan ◽  
Komal Kumar Bhatia

Now days with the advent of internet technologies and ecommerce the need for smart search engine for human life is rising. The traditional search engines are not intelligent as well as smart and thus lead to the rise in searching costs. In this paper, architecture of a vertical search engine based on the domain specific hidden web crawler is proposed. To make a least cost vertical search engine improvement in the following techniques like: searching, indexing, ranking, transaction and query interface are suggested. The domain term analyzer filters the useless information to the maximum extent and finally provides the users with high precision information. Through the experimental result it is shown that the system works on accelerating the access, computation, storage, communication time, increased efficiency and work professionally.


2010 ◽  
Vol 143-144 ◽  
pp. 1270-1274 ◽  
Author(s):  
Fan Zhang ◽  
Xiu Lan Feng ◽  
Jin Sheng Yuan

With the rapid growth of forestry information,the amount of information on forestry is increasing rapidly. Comprehensive search engine is powerful,but its speed and accuracy of industry search is limited, owing to a host of information. According to the definition of vertical search engines,Heritrix Web Crawler and full text search framework of Lucene,this paper is mainly concerned the information of capture,indexing and search strategies in order to achieve an ideal forestry vertical search engine design. Experiments compared with comprehensive have proved the effectiveness of the proposed method.


2018 ◽  
Vol 176 ◽  
pp. 03014
Author(s):  
Yaru Cao ◽  
Ning Ma ◽  
Fucheng Wan ◽  
Xiangzhen He

Based on the research of vertical search engine and cross-language information retrieval, a crosslanguage vertical search engine design for e-commerce platform is proposed. It aims to solve the problem that it is difficult for Internet users to quickly, efficiently, and comprehensively search for valuable products, especially ethnic minority netizens. Cross-language in this article mainly refers to the conversion of Chinese, English, and Tibetan. Using dictionary-based query translation method to translate query words to achieve cross-language function. Improved Heritrix designed a web crawler information collection method. Using HtmlParser to achieve structured information extraction, and using Lucene to build an index and achieve retrieval.


Author(s):  
Richard Berendsen ◽  
Bogomil Kovachev ◽  
Edgar Meij ◽  
Maarten de Rijke ◽  
Wouter Weerkamp

Sign in / Sign up

Export Citation Format

Share Document