scholarly journals Efficient and Effective Duplicate Detection in Hierarchical Data

2013 ◽  
Vol 25 (5) ◽  
pp. 1028-1041 ◽  
Author(s):  
L. Leitão ◽  
P. Calado ◽  
M. Herschel
2020 ◽  
Vol 17 (8) ◽  
pp. 3548-3552
Author(s):  
M. S. Roobini ◽  
P. B. S. Sumanth Kumar ◽  
P. B. Raviteja Reddy ◽  
Anitha Ponraj ◽  
J. Aruna

Although there is a long profession on distinguishing duplicates, just a handful in social data arrangements center around copy detection in ever more complex progressive systems, including XML data. Right now, present a novel technique for XML duplicate discovery, Renamed XMLDup. XMLDup utilizes a Bayesian algorithm defining the chance of duplicating two XML components, taking into account the data within the components, but also how data is structured. Likewise, to improve the effectiveness of Unit Review, Novel technique for pruning, equipped for noteworthy increases over the un-streamlined calculation rule, is introduced. We demonstrate through trials that our estimate is can accomplish high accuracy via trials we show our estimation is outflanking another cutting-edge duplicate discovery arrangement, both as far as proficiency and of adequacy.


Author(s):  
Duong Hoang ◽  
Brian Summa ◽  
Harsh Bhatia ◽  
Peter Lindstrom ◽  
Pavol Klacansky ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document