Clustering Deep Web Databases Semantically

Author(s):  
Ling Song ◽  
Jun Ma ◽  
Po Yan ◽  
Li Lian ◽  
Dongmei Zhang
Keyword(s):  
Deep Web ◽  
2011 ◽  
Vol 8 (3) ◽  
pp. 779-799 ◽  
Author(s):  
Ying Wang ◽  
Huilai Li ◽  
Wanli Zuo ◽  
Fengling He ◽  
Xin Wang ◽  
...  

Ontology plays an important role in locating Domain-Specific Deep Web contents, therefore, this paper presents a novel framework WFF for efficiently locating Domain-Specific Deep Web databases based on focused crawling and ontology by constructing Web Page Classifier(WPC), Form Structure Classifier(FSC) and Form Content Classifier(FCC) in a hierarchical fashion. Firstly, WPC discovers potentially interesting pages based on ontology-assisted focused crawler. Then, FSC analyzes the interesting pages and determines whether these pages subsume searchable forms based on structural characteristics. Lastly, FCC identifies searchable forms that belong to a given domain in the semantic level, and stores these URLs of Domain- Specific searchable forms to a database. Through a detailed experimental evaluation, WFF framework not only simplifies discovering process, but also effectively determines Domain-Specific databases.


2006 ◽  
Vol 9 (4) ◽  
pp. 585-622 ◽  
Author(s):  
James Caverlee ◽  
Ling Liu ◽  
Daniel Rocco
Keyword(s):  
Deep Web ◽  

Sign in / Sign up

Export Citation Format

Share Document