Characteristic Sets and Generalized Maximal Consistent Blocks in Mining Incomplete Data

Author(s):  
Patrick G. Clark ◽  
Cheng Gao ◽  
Jerzy W. Grzymala-Busse ◽  
Teresa Mroczek
2018 ◽  
Vol 453 ◽  
pp. 66-79 ◽  
Author(s):  
Patrick G. Clark ◽  
Cheng Gao ◽  
Jerzy W. Grzymala-Busse ◽  
Teresa Mroczek

2021 ◽  
pp. 3-17
Author(s):  
Patrick G. Clark ◽  
Jerzy W. Grzymala-Busse ◽  
Zdzislaw S. Hippe ◽  
Teresa Mroczek

2020 ◽  
Author(s):  
Patrick G Clark ◽  
Cheng Gao ◽  
Jerzy W Grzymala-Busse ◽  
Teresa Mroczek ◽  
Rafal Niemiec

Abstract In this paper, missing attribute values in incomplete data sets have three possible interpretations: lost values, attribute-concept values and ‘do not care’ conditions. For rule induction, we use characteristic sets and generalized maximal consistent blocks. Therefore, we apply six different approaches for data mining. As follows from our previous experiments, where we used an error rate evaluated by ten-fold cross validation as the main criterion of quality, no approach is universally the best. Thus, we decided to compare our six approaches using complexity of rule sets induced from incomplete data sets. We show that the smallest rule sets are induced from incomplete data sets with attribute-concept values, while the most complicated rule sets are induced from data sets with lost values. The choice between interpretations of missing attribute values is more important than the choice between characteristic sets and generalized maximal consistent blocks.


2006 ◽  
Vol 50 (2) ◽  
pp. 584
Author(s):  
Soo Jung Park ◽  
Dong Wan Shin ◽  
Byeong Uk Park ◽  
Woo Chul Kim ◽  
Man-Suk Oh

Sign in / Sign up

Export Citation Format

Share Document