scholarly journals SIMHAR - Smart Distributed Web Crawler for the Hidden Web Using SIM+Hash and Redis Server

IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 117582-117592
Author(s):  
Sawroop Kaur ◽  
G. Geetha
Keyword(s):  
Author(s):  
Rosy Madaan ◽  
Ashutosh Dixit ◽  
A. K. Sharma ◽  
Komal Kumar Bhatia

2021 ◽  
Vol 69 (3) ◽  
pp. 2933-2948
Author(s):  
Sawroop Kaur ◽  
Aman Singh ◽  
G. Geetha ◽  
Mehedi Masud ◽  
Mohammed A. Alzain
Keyword(s):  

2014 ◽  
Vol 4 (2) ◽  
pp. 1-18
Author(s):  
Sonali Gupta ◽  
Komal Kumar Bhatia

A huge number of Hidden Web databases exists over the WWW forming a massive source of high quality information. Retrieval of this information for enriching the repository of the search engine is the prime target of a Hidden web crawler. Besides this, the crawler should perform this task at an affordable cost and resource utilization. This paper proposes a Random ranking mechanism whereby the queries to be raised by the hidden web crawler have been ranked. By ranking the queries according to the proposed mechanism, the Hidden Web crawler is able to make an optimal choice among the candidate queries and efficiently retrieve the Hidden web databases. The Hidden Web crawler proposed here also possesses an extensible and scalable framework to improve the efficiency of crawling. The proposed approach has also been compared with other methods of Hidden Web crawling existing in the literature.


The Dark Web ◽  
2018 ◽  
pp. 319-333
Author(s):  
Sudhakar Ranjan ◽  
Komal Kumar Bhatia

Now days with the advent of internet technologies and ecommerce the need for smart search engine for human life is rising. The traditional search engines are not intelligent as well as smart and thus lead to the rise in searching costs. In this paper, architecture of a vertical search engine based on the domain specific hidden web crawler is proposed. To make a least cost vertical search engine improvement in the following techniques like: searching, indexing, ranking, transaction and query interface are suggested. The domain term analyzer filters the useless information to the maximum extent and finally provides the users with high precision information. Through the experimental result it is shown that the system works on accelerating the access, computation, storage, communication time, increased efficiency and work professionally.


The Dark Web ◽  
2018 ◽  
pp. 65-83
Author(s):  
Sonali Gupta ◽  
Komal Kumar Bhatia

A huge number of Hidden Web databases exists over the WWW forming a massive source of high quality information. Retrieval of this information for enriching the repository of the search engine is the prime target of a Hidden web crawler. Besides this, the crawler should perform this task at an affordable cost and resource utilization. This paper proposes a Random ranking mechanism whereby the queries to be raised by the hidden web crawler have been ranked. By ranking the queries according to the proposed mechanism, the Hidden Web crawler is able to make an optimal choice among the candidate queries and efficiently retrieve the Hidden web databases. The Hidden Web crawler proposed here also possesses an extensible and scalable framework to improve the efficiency of crawling. The proposed approach has also been compared with other methods of Hidden Web crawling existing in the literature.


2017 ◽  
Vol 7 (2) ◽  
pp. 19-33
Author(s):  
Sudhakar Ranjan ◽  
Komal Kumar Bhatia

Now days with the advent of internet technologies and ecommerce the need for smart search engine for human life is rising. The traditional search engines are not intelligent as well as smart and thus lead to the rise in searching costs. In this paper, architecture of a vertical search engine based on the domain specific hidden web crawler is proposed. To make a least cost vertical search engine improvement in the following techniques like: searching, indexing, ranking, transaction and query interface are suggested. The domain term analyzer filters the useless information to the maximum extent and finally provides the users with high precision information. Through the experimental result it is shown that the system works on accelerating the access, computation, storage, communication time, increased efficiency and work professionally.


Author(s):  
Ma nvi ◽  
◽  
Komal Kumar Bhatia ◽  
Ashutosh Dixit
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document