Discovering Surprising Instances of Simpson's Paradox in Hierarchical Multidimensional Data

2008 ◽  
pp. 3235-3251
Author(s):  
Carem C. Fabris ◽  
Alex A. Freitas

This paper focuses on the discovery of surprising unexpected patterns based on a data mining method that consists of detecting instances of Simpson’s paradox. By its very nature, instances of this paradox tend to be surprising to the user. Previous work in the literature has proposed an algorithm for discovering instances of that paradox, but it addressed only flat data stored in a single relation. This work proposes a novel algorithm that considerably extends that previous work by discovering instances of Simpson’s paradox in hierarchical multidimensional data — the kind of data typically found in data warehouse and OLAP environments. Hence, the proposed algorithm can be regarded as integrating the areas of data mining and data warehousing by using an adapted data mining technique to discover surprising patterns from data warehouse and OLAP environments.

Author(s):  
Md. Sadeki Salman ◽  
Nazmun Naher Shila ◽  
Khalid Hasan ◽  
Piash Ahmed ◽  
Mumenunnessa Keya ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document