Parallel Association Rules Pruning Algorithm on Hadoop MapReduce

Review and comparison of Apriori algorithm implementations on Hadoop-MapReduce and Spark

The Knowledge Engineering Review ◽

10.1017/s0269888918000127 ◽

2018 ◽

Vol 33 ◽

Cited By ~ 4

Author(s):

Eduardo P. S. Castro ◽

Thiago D. Maia ◽

Marluce R. Pereira ◽

Ahmed A. A. Esmin ◽

Denilson A. Pereira

Keyword(s):

Association Rules ◽

Data Sets ◽

Apriori Algorithm ◽

Mapreduce Framework ◽

Data Set ◽

Hadoop Mapreduce ◽

Detailed Assessment ◽

Mining Association Rules

AbstractSeveral Apriori algorithm implementations for mining association rules have been proposed in the literature using the Hadoop-MapReduce framework and, more recently, Spark. However, none of the works have made a detailed assessment of its performance, for example, comparing it with other implementations in various characteristics of data sets. In this work, we present a review of the main algorithms proposed for Hadoop-MapReduce and compared their implementations in a single environment under several different situations. Moreover, these algorithms had their implementations adapted to Spark, and also compared under the same circumstances. Based on the results of the experiments, we present a framework for recommending the Apriori implementation most appropriate for solving a given problem, according to the data set characteristics and minimum required support. The results show that Spark implementations overcome Hadoop-MapReduce implementations at runtime in most experiments. However, there is no single implementation that is the best in all the evaluated situations.

Download Full-text

Clustering of Association Rules for Big Datasets using Hadoop MapReduce

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2021.0120364 ◽

2021 ◽

Vol 12 (3) ◽

Author(s):

Salahadin A. Moahmmed ◽

Mohamed A. ◽

El-Sayed M.

Keyword(s):

Association Rules ◽

Hadoop Mapreduce

Download Full-text

Implementing Association Rules Technique to Predict Student Result based on Historical Data

PsycEXTRA Dataset ◽

10.1037/e667662012-005 ◽

2012 ◽

Author(s):

Azwa Abdul Aziz ◽

Julaily Aida Jusoh

Keyword(s):

Association Rules ◽

Historical Data

Download Full-text

Interactive Data Mining: A Short Background Study on Effective Interaction and Visualization by Association Rules

2nd International conference on Innovative Engineering Technologies (ICIET'2015) August 7-8, 2015 Bangkok (Thailand) ◽

10.15242/iie.e0815001 ◽

2015 ◽

Keyword(s):

Data Mining ◽

Association Rules ◽

Effective Interaction ◽

Interactive Data Mining ◽

Interactive Data

Download Full-text

An Effective Algorithm to Generate Positive and Negative Association Rules

International Journal of Innovative Research in Computer and Communication Engineering ◽

10.15680/ijircce.2014.0208031 ◽

2014 ◽

Vol 2 (8) ◽

pp. 5476-5481

Author(s):

Dr.B.Ramasubbareddy, K.Srinivas, B.Kavitharani

Keyword(s):

Association Rules ◽

Negative Association ◽

Effective Algorithm ◽

Negative Association Rules

Download Full-text

Fast Pruning Algorithm and Task Scheduling under Map/Reduce

International Journal of Performability Engineering ◽

10.23940/ijpe.20.10.p14.16271636 ◽

2020 ◽

Vol 16 (10) ◽

pp. 1627

Author(s):

Pei Shujun ◽

Zhang Yu ◽

Liang Chao

Keyword(s):

Task Scheduling ◽

Map Reduce ◽

Pruning Algorithm ◽

And Task

Download Full-text

Improved Macro-clusters generation using Top-k shared Micro-clusters in Data Streams

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i10.400 ◽

2017 ◽

Vol 7 (10) ◽

pp. 52

Author(s):

LAKSHMI PRANEETHA

Keyword(s):

Real Time ◽

Data Streams ◽

Bloom Filter ◽

Scientific Applications ◽

Pruning Algorithm ◽

Density Data ◽

Data Points ◽

Short Time ◽

Information Streams

Now-a-days data streams or information streams are gigantic and quick changing. The usage of information streams can fluctuate from basic logical, scientific applications to vital business and money related ones. The useful information is abstracted from the stream and represented in the form of micro-clusters in the online phase. In offline phase micro-clusters are merged to form the macro clusters. DBSTREAM technique captures the density between micro-clusters by means of a shared density graph in the online phase. The density data in this graph is then used in reclustering for improving the formation of clusters but DBSTREAM takes more time in handling the corrupted data points In this paper an early pruning algorithm is used before pre-processing of information and a bloom filter is used for recognizing the corrupted information. Our experiments on real time datasets shows that using this approach improves the efficiency of macro-clusters by 90% and increases the generation of more number of micro-clusters within in a short time.

Download Full-text

How Useful Can Be Data Mining For A Continuos Speech Therapist’s Education?

Balkan Region Conference on Engineering and Business Education ◽

10.2478/cplbu-2014-0050 ◽

2014 ◽

Vol 1 (1) ◽

pp. 339-342

Author(s):

Mirela Danubianu ◽

Dragos Mircea Danubianu

Keyword(s):

Data Mining ◽

Information And Communication Technology ◽

Association Rules ◽

Communication Technology ◽

Speech Therapy ◽

Proper Treatment ◽

Speech Impairments ◽

Information And Communication ◽

Specific Education

AbstractSpeech therapy can be viewed as a business in logopaedic area that aims to offer services for correcting language. A proper treatment of speech impairments ensures improved efficiency of therapy, so, in order to do that, a therapist must continuously learn how to adjust its therapy methods to patient's characteristics. Using Information and Communication Technology in this area allowed collecting a lot of data regarding various aspects of treatment. These data can be used for a data mining process in order to find useful and usable patterns and models which help therapists to improve its specific education. Clustering, classification or association rules can provide unexpected information which help to complete therapist's knowledge and to adapt the therapy to patient's needs.

Download Full-text