Using mutual information to determine relevance in Bayesian networks

Author(s):  
A. E. Nicholson ◽  
N. Jitnah
2014 ◽  
Vol 275 ◽  
pp. 115-126 ◽  
Author(s):  
Miguel Angel Gómez-Villegas ◽  
Paloma Main ◽  
Paola Viviani

2014 ◽  
Vol 25 (4) ◽  
pp. 1-16
Author(s):  
Boris Rabinovich ◽  
Mark Last

In this paper, the authors propose a five-step approach to the problem of identifying semantic correspondences between attributes of two database schemas. It is one of the key challenges in many database applications such as data integration and data warehousing. The authors' research is focused on uninterpreted schema matching, where the column names and column values are uninterpreted or unreliable. The approach implements Bayesian networks, Pearson's correlation and mutual information to identify inter-attribute dependencies. Additionally, the authors propose an extension to their algorithm that allows the user to manually enter the known mappings to improve the automated matching results. The five-step approach also allows data privacy preservation. The authors' evaluation experiments show that the proposed approach enhances the current set of schema matching techniques.


Sign in / Sign up

Export Citation Format

Share Document