AC-Stream: Associative classification over data streams using multiple class association rules

One of the main objectives of data mining as a promising multidisciplinary field in computer science is to provide a classification model to be used for decision support purposes. In the medical imaging domain, mammograms classification is a difficult diagnostic task which calls for development of automated classification systems. Associative classification, as a special case of association rules mining, has been adopted in classification problems for years. In this paper, an associative classification framework based on parallel mining of image blocks is proposed to be used for mammograms discrimination. Indeed, association rules mining is applied to a commonly used mammography image database to classify digital mammograms into three categories, namely normal, benign and malign. In order to do so, first images are preprocessed and then features are extracted from non-overlapping image blocks and discretized for rule discovery. Association rules are then discovered through parallel mining of transactional databases which correspond to the image blocks, and finally are used within a unique decision-making scheme to predict the class of unknown samples. Finally, experiments are conducted to assess the effectiveness of the proposed framework. Results show that the proposed framework proved successful in terms of accuracy, precision, and recall, and suggest that the framework could be used as the core of any future associative classifier to support mammograms discrimination.

Download Full-text

An Efficient Method for Associative Classification using Jaccard Measure

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1580.0982s1119 ◽

2019 ◽

Vol 8 (2S11) ◽

pp. 3448-3453

Keyword(s):

Association Rules ◽

Vital Role ◽

Classification Methods ◽

Associative Classification ◽

Data Mining Technique ◽

Target Class ◽

Interestingness Measures ◽

Mining Technique ◽

Major Disadvantage ◽

Traditional Classification

Classification is a data mining technique that categorizes the items in a database to target classes. The aim of classification is to accurately find the target class for each instance of the data. Associative classification is a classification method that uses Class Association Rules for classification. Associative classification is found to be often more accurate than some traditional classification methods. The major disadvantage of associative classification is the generation of redundant and weak class association rules. Weak class association rules results in increase in size and decrease in accuracy of the classifier. This paper proposes an efficient approach to build a compact and accurate classifier by using interestingness measures for pruning rules. Interestingness measures play a vital role in reducing the size and increasing the accuracy of classifier by pruning redundant or weak rules. Rules which are strong are retained and these rules are further used to build the classifier. The source of the data used in this paper is University of California Irvine Machine Learning Repository. The approach proposed in this paper is effective and the results show that the approach can produce a highly compact and accurate classifier

Download Full-text

Discovering Informative Association Rules for Associative Classification

2008 IEEE International Symposium on Knowledge Acquisition and Modeling Workshop ◽

10.1109/kamw.2008.4810675 ◽

2008 ◽

Author(s):

Zhitong Su ◽

Wei Song ◽

Danyang Cao ◽

Jinhong Li

Keyword(s):

Association Rules ◽

Associative Classification

Download Full-text

Finding Context Association Rules over Sensor-Actuator Data Streams

10.14257/astl.2014.62.19 ◽

2014 ◽

Cited By ~ 1

Author(s):

Ho Jin Woo ◽

Se Jung Shin ◽

Kil Hong Joo ◽

Won Suk Lee

Keyword(s):

Association Rules ◽

Data Streams ◽

Sensor Actuator

Download Full-text

Mining Positive and Negative Association Rules in Data Streams with a Sliding Window

2013 Fourth Global Congress on Intelligent Systems ◽

10.1109/gcis.2013.39 ◽

2013 ◽

Author(s):

Weimin Ouyang

Keyword(s):

Association Rules ◽

Data Streams ◽

Sliding Window ◽

Negative Association ◽

Negative Association Rules

Download Full-text

CMAR: accurate and efficient classification based on multiple class-association rules

Proceedings 2001 IEEE International Conference on Data Mining ◽

10.1109/icdm.2001.989541 ◽

2002 ◽

Cited By ~ 24

Author(s):

Wenmin Li ◽

Jiawei Han ◽

Jian Pei

Keyword(s):

Association Rules ◽

Multiple Class

Download Full-text

A Generic Approach for Mining Indirect Association Rules in Data Streams

Lecture Notes in Computer Science - Modern Approaches in Applied Intelligence ◽

10.1007/978-3-642-21822-4_11 ◽

2011 ◽

pp. 95-104 ◽

Cited By ~ 5

Author(s):

Wen-Yang Lin ◽

You-En Wei ◽

Chun-Hao Chen

Keyword(s):

Association Rules ◽

Data Streams ◽

Indirect Association

Download Full-text

Mining of Multiobjective Non-redundant Association Rules in Data Streams

Artificial Intelligence and Soft Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-29350-4_9 ◽

2012 ◽

pp. 73-81

Author(s):

Anamika Gupta ◽

Naveen Kumar ◽

Vasudha Bhatnagar

Keyword(s):

Association Rules ◽

Data Streams

Download Full-text

Efficient Mining of Data Streams Using Associative Classification Approach

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194015500059 ◽

2015 ◽

Vol 25 (03) ◽

pp. 605-631 ◽

Cited By ~ 6

Author(s):

Prasanna Lakshmi Kompalli ◽

Ramesh Kumar Cherku

Keyword(s):

Data Streams ◽

Processing Time ◽

Real Data ◽

Streaming Data ◽

Infinite Length ◽

Associative Classification ◽

Streaming Algorithm ◽

Scan Data ◽

Synthetic Datasets ◽

And Performance

Data stream associative classification poses many challenges to the data mining community. In this paper, we address four major challenges posed, namely, infinite length, extraction of knowledge with single scan, processing time, and accuracy. Since data streams are infinite in length, it is impractical to store and use all the historical data for training. Mining such streaming data for knowledge acquisition is a unique opportunity and even a tough task. A streaming algorithm must scan data once and extract knowledge. While mining data streams, processing time, and accuracy have become two important aspects. In this paper, we propose PSTMiner which considers the nature of data streams and provides an efficient classifier for predicting the class label of real data streams. It has greater potential when compared with many existing classification techniques. Additionally, we propose a compact novel tree structure called PSTree (Prefix Streaming Tree) for storing data. Extensive experiments conducted on 24 real datasets from UCI repository and synthetic datasets from MOA (Massive Online Analysis) show that PSTMiner is consistent. Empirical results show that performance of PSTMiner is highly competitive in terms of accuracy and performance time when compared with other approaches under windowed streaming model.

Download Full-text