A General Framework for Web Content Filtering

2009 ◽  
Vol 13 (3) ◽  
pp. 215-249 ◽  
Author(s):  
Elisa Bertino ◽  
Elena Ferrari ◽  
Andrea Perego
The Dark Web ◽  
2018 ◽  
pp. 114-137
Author(s):  
Dilip Kumar Sharma ◽  
A. K. Sharma

Web crawlers specialize in downloading web content and analyzing and indexing from surface web, consisting of interlinked HTML pages. Web crawlers have limitations if the data is behind the query interface. Response depends on the querying party's context in order to engage in dialogue and negotiate for the information. In this paper, the authors discuss deep web searching techniques. A survey of technical literature on deep web searching contributes to the development of a general framework. Existing frameworks and mechanisms of present web crawlers are taxonomically classified into four steps and analyzed to find limitations in searching the deep web.


2006 ◽  
pp. 112-132 ◽  
Author(s):  
Elisa Bertino ◽  
Elena Ferrari ◽  
Andrea Perego

The need to filter online information in order to protect users from possible harmful content can be considered as one of the most compelling social issues derived from the transformation of the Web into a public information space. Despite that Web rating and filtering systems have been developed and made publicly available quite early, no effective approach has been established so far, due to the inadequacy of the proposed solutions. Web filtering is then a challenging research area, needing the definition and enforcement of new strategies, considering both the current limitations and the future developments of Web technologies—in particular, the upcoming Semantic Web. In this chapter, we provide an overview of how Web filtering issues have been addressed by the available systems, bringing in relief both their advantages and shortcomings, and outlining future trends. As an example of how a more accurate and flexible filtering can be enforced, we devote the second part of this chapter to describing a multi-strategy approach, of which the main characteristics are the integration of both list- and metadata-based techniques and the adoption of sophisticated metadata schemes (e.g., conceptual hierarchies and ontologies) for describing both users’ characteristics and Web pages content.


IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 98069-98082
Author(s):  
Om Prakash Patel ◽  
Neha Bharill ◽  
Aruna Tiwari ◽  
Vikram Patel ◽  
Ojas Gupta ◽  
...  

2005 ◽  
Vol 7 (6) ◽  
pp. 1183-1190 ◽  
Author(s):  
P.Y. Lee ◽  
S.C. Hui ◽  
A.C.M. Fong

Kybernetes ◽  
2009 ◽  
Vol 38 (9) ◽  
pp. 1541-1555 ◽  
Author(s):  
A.C.M. Fong ◽  
S.C. Hui ◽  
P.Y. Lee

2002 ◽  
Vol 17 (5) ◽  
pp. 48-57 ◽  
Author(s):  
P.Y. Lee ◽  
S.C. Hui ◽  
A.C.M. Fong

Sign in / Sign up

Export Citation Format

Share Document