scholarly journals Selectivity Estimation for Exclusive Query Translation in Deep Web Data Integration

Author(s):  
Fangjiao Jiang ◽  
Weiyi Meng ◽  
Xiaofeng Meng
2013 ◽  
Vol 756-759 ◽  
pp. 1855-1859
Author(s):  
Meng Juan Li ◽  
Lian Yin Jia ◽  
Jin Guo You ◽  
Jia Man Ding ◽  
Hai He Zhou

Deep web data integration has become the center of many research efforts in the recent few years. Near duplicate detection is very important for deep web integration system, there are seldom researches focusing on integrating deep web Integration and near duplicate detection together. In this paper, we develop a integration system, DWI-ndfree to solve this problem. The wrapper of DWI-ndfree consists of four parts: the form filler, the navigator, the extractor and the near duplicate detector. To find near duplicate records, we propose efficient algorithm CheckNearDuplicate. DWI-ndfree can integrate deep web data with near duplicate free and has been used to execute several web extraction and integration tasks efficiently.


Sign in / Sign up

Export Citation Format

Share Document