DWSpyder: a new schema extraction method for a deep web integration system

2019 ◽  
Vol 14 (2) ◽  
pp. 122
Author(s):  
Yasser Saissi ◽  
Ahmed Zellou ◽  
Ali Adri
2013 ◽  
Vol 756-759 ◽  
pp. 1855-1859
Author(s):  
Meng Juan Li ◽  
Lian Yin Jia ◽  
Jin Guo You ◽  
Jia Man Ding ◽  
Hai He Zhou

Deep web data integration has become the center of many research efforts in the recent few years. Near duplicate detection is very important for deep web integration system, there are seldom researches focusing on integrating deep web Integration and near duplicate detection together. In this paper, we develop a integration system, DWI-ndfree to solve this problem. The wrapper of DWI-ndfree consists of four parts: the form filler, the navigator, the extractor and the near duplicate detector. To find near duplicate records, we propose efficient algorithm CheckNearDuplicate. DWI-ndfree can integrate deep web data with near duplicate free and has been used to execute several web extraction and integration tasks efficiently.


2010 ◽  
Vol 3 (1-2) ◽  
pp. 1613-1616 ◽  
Author(s):  
Thomas Kabisch ◽  
Eduard C. Dragut ◽  
Clement Yu ◽  
Ulf Leser
Keyword(s):  
Deep Web ◽  

2009 ◽  
Vol 31 (8) ◽  
pp. 1412-1421
Author(s):  
Fang-Jiao JIANG ◽  
Xiao-Feng MENG ◽  
Lin-Lin JIA

Sign in / Sign up

Export Citation Format

Share Document