Characteristic Sets and Generalized Maximal Consistent Blocks in Mining Incomplete Data

Rough Sets - Lecture Notes in Computer Science ◽

10.1007/978-3-319-60837-2_39 ◽

2017 ◽

pp. 477-486 ◽

Cited By ~ 5

Author(s):

Patrick G. Clark ◽

Cheng Gao ◽

Jerzy W. Grzymala-Busse ◽

Teresa Mroczek

Keyword(s):

Incomplete Data ◽

Characteristic Sets

Download Full-text

A Comparison of Characteristic Sets and Generalized Maximal Consistent Blocks in Mining Incomplete Data

Communications in Computer and Information Science - Information Processing and Management of Uncertainty in Knowledge-Based Systems. Theory and Foundations ◽

10.1007/978-3-319-91476-3_40 ◽

2018 ◽

pp. 480-489

Author(s):

Patrick G. Clark ◽

Cheng Gao ◽

Jerzy W. Grzymala-Busse ◽

Teresa Mroczek

Keyword(s):

Incomplete Data ◽

Characteristic Sets

Download Full-text

Complexity of Rule Sets in Mining Incomplete Data Using Characteristic Sets and Generalized Maximal Consistent Blocks

Lecture Notes in Computer Science - Hybrid Artificial Intelligent Systems ◽

10.1007/978-3-319-92639-1_8 ◽

2018 ◽

pp. 84-94

Author(s):

Patrick G. Clark ◽

Cheng Gao ◽

Jerzy W. Grzymala-Busse ◽

Teresa Mroczek ◽

Rafal Niemiec

Keyword(s):

Incomplete Data ◽

Characteristic Sets ◽

Rule Sets

Download Full-text

Characteristic sets and generalized maximal consistent blocks in mining incomplete data

Information Sciences ◽

10.1016/j.ins.2018.04.025 ◽

2018 ◽

Vol 453 ◽

pp. 66-79 ◽

Cited By ~ 5

Author(s):

Patrick G. Clark ◽

Cheng Gao ◽

Jerzy W. Grzymala-Busse ◽

Teresa Mroczek

Keyword(s):

Incomplete Data ◽

Characteristic Sets

Download Full-text

Mining Incomplete Data Using Global and Saturated Probabilistic Approximations Based on Characteristic Sets and Maximal Consistent Blocks

10.1007/978-3-030-87334-9_1 ◽

2021 ◽

pp. 3-17

Author(s):

Patrick G. Clark ◽

Jerzy W. Grzymala-Busse ◽

Zdzislaw S. Hippe ◽

Teresa Mroczek

Keyword(s):

Incomplete Data ◽

Characteristic Sets

Download Full-text

Complexity of rule sets in mining incomplete data using characteristic sets and generalized maximal consistent blocks

Logic Journal of IGPL ◽

10.1093/jigpal/jzaa041 ◽

2020 ◽

Author(s):

Patrick G Clark ◽

Cheng Gao ◽

Jerzy W Grzymala-Busse ◽

Teresa Mroczek ◽

Rafal Niemiec

Keyword(s):

Data Mining ◽

Error Rate ◽

Incomplete Data ◽

Cross Validation ◽

Rule Induction ◽

Data Sets ◽

Main Criterion ◽

Characteristic Sets ◽

Rule Sets ◽

Fold Cross Validation

Abstract In this paper, missing attribute values in incomplete data sets have three possible interpretations: lost values, attribute-concept values and ‘do not care’ conditions. For rule induction, we use characteristic sets and generalized maximal consistent blocks. Therefore, we apply six different approaches for data mining. As follows from our previous experiments, where we used an error rate evaluated by ten-fold cross validation as the main criterion of quality, no approach is universally the best. Thus, we decided to compare our six approaches using complexity of rule sets induced from incomplete data sets. We show that the smallest rule sets are induced from incomplete data sets with attribute-concept values, while the most complicated rule sets are induced from data sets with lost values. The choice between interpretations of missing attribute values is more important than the choice between characteristic sets and generalized maximal consistent blocks.

Download Full-text