Load Balancing Approach Parallel Algorithm for Frequent Pattern Mining

A New Parallel Algorithm for Frequent Pattern Mining

Journal of Computational Intelligence and Electronic Systems ◽

10.1166/jcies.2013.1048 ◽

2013 ◽

Vol 2 (1) ◽

pp. 55-59 ◽

Cited By ~ 1

Author(s):

Saeid Masoumi ◽

Raziyeh Tabatabaei ◽

Mohammad-Reza Feizi-Derakhshi ◽

Khatereh Tabatabaei

Keyword(s):

Parallel Algorithm ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern

Download Full-text

Apriori-based High Efficiency Load Balancing Parallel Data Mining Algorithms on Multi-core Architectures

International Journal of Grid and High Performance Computing ◽

10.4018/ijghpc.2015040106 ◽

2015 ◽

Vol 7 (2) ◽

pp. 77-99 ◽

Cited By ~ 4

Author(s):

Kun-Ming Yu ◽

Sheng-Hui Liu ◽

Li-Wei Zhou ◽

Shu-Hao Wu

Keyword(s):

Data Mining ◽

Load Balancing ◽

Pattern Mining ◽

High Efficiency ◽

Computation Time ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Parallel Data ◽

Parallel Data Mining ◽

Mining Methods

Frequent pattern mining has been playing an essential role in knowledge discovery and data mining tasks that try to find usable patterns from databases. Efficiency is especially crucial for an algorithm in order to find frequent itemsets from a large database. Numerous methods have been proposed to solve this problem, such as Apriori and FP-growth. These are regarded as fundamental frequent pattern mining methods. In addition, parallel computing architectures, such as an on-cloud platform, a grid system, multi-core and GPU platform, have been popular in data mining. However, most of the algorithms have been proposed without considering the prevalent multi-core architectures. In this study, multi-core architectures were used as well as two high efficiency load balancing parallel data mining methods based on the Apriori algorithm. The main goal of the proposed algorithms was to reduce the massive number of duplicate candidates generated using previous methods. This goal was achieved for, in this detailed experimental study the algorithms performed better than the previous methods. The experimental results demonstrated that the proposed algorithms had dramatically reduced computation time when using more threads. Moreover, the observations showed that the workload was equally balanced among the computing units.

Download Full-text

A fast and parallel algorithm for frequent pattern mining from big data in many-task environments

International Journal of High Performance Computing and Networking ◽

10.1504/ijhpcn.2017.084244 ◽

2017 ◽

Vol 10 (3) ◽

pp. 157 ◽

Cited By ~ 1

Author(s):

Wei Tee Lin ◽

Chih Ping Chu

Keyword(s):

Big Data ◽

Parallel Algorithm ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Task Environments

Download Full-text

A Distributed Algorithm for Fast Mining Frequent Patterns in Limited and Varying Network Bandwidth Environments

Applied Sciences ◽

10.3390/app9091859 ◽

2019 ◽

Vol 9 (9) ◽

pp. 1859 ◽

Cited By ~ 1

Author(s):

Chun-Cheng Lin ◽

Wei-Ching Li ◽

Ju-Chin Chen ◽

Wen-Yu Chung ◽

Sheng-Hao Chung ◽

...

Keyword(s):

Load Balancing ◽

Pattern Mining ◽

Empirical Evaluation ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Frequent Patterns ◽

Computing Systems ◽

Network Bandwidth ◽

Large Databases ◽

Computing Environments

Data mining is a set of methods used to mine hidden information from data. It mainly includes frequent pattern mining, sequential pattern mining, classification, and clustering. Frequent pattern mining is used to discover the correlation among various sets of items within large databases. The rapid upward trend in data size slows the mining of frequent patterns. Numerous studies have attempted to develop algorithms that operate in distributed computing environments to accelerate the mining process. FLR-mining (Fast, Load balancing and Resource efficient mining algorithm) is one of the fastest methods of mining with efficient consideration of load balancing and resources. FLR-mining can automatically determine the appropriate number of computing nodes. However, FLR-mining and existing methods assume that the network bandwidth is constant. In practical distributed and many-task computing systems, this assumption fails because there are packet collisions caused by many mining tasks that run in a simultaneous manner. Therefore, a method that can consider the varying network bandwidth is necessary. In this study, we propose a method that can rapidly mine frequent patterns under the varying network bandwidth. The proposed method can also determine the appropriate number of computing nodes to efficiently utilize computing resources and achieve load balancing. Through empirical evaluation, the proposed method is shown to deliver excellent performance in terms of execution efficiency and load balancing.

Download Full-text

A fast and parallel algorithm for frequent pattern mining from big data in many-task environments

International Journal of High Performance Computing and Networking ◽

10.1504/ijhpcn.2017.10005138 ◽

2017 ◽

Vol 10 (3) ◽

pp. 157

Author(s):

Wei Tee Lin ◽

Chih Ping Chu

Keyword(s):

Big Data ◽

Parallel Algorithm ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Task Environments

Download Full-text

A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments

International Journal of Ad Hoc and Ubiquitous Computing ◽

10.1504/ijahuc.2010.035533 ◽

2010 ◽

Vol 6 (4) ◽

pp. 205 ◽

Cited By ~ 20

Author(s):

Kawuu W. Lin ◽

Der Jiunn Deng

Keyword(s):

Cloud Computing ◽

Parallel Algorithm ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Computing Environments

Download Full-text

An Adaptive Data Distribution Through Tree Rules in Frequent Pattern Mining

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit183894 ◽

2018 ◽

pp. 300-305

Keyword(s):

Information Sharing ◽

Pattern Mining ◽

Data Distribution ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

General Development ◽

Secure Information ◽

Evaluation Parameters ◽

Secure Information Sharing

Information sharing among the associations is a general development in a couple of zones like business headway and exhibiting. As bit of the touchy principles that ought to be kept private may be uncovered and such disclosure of delicate examples may impacts the advantages of the association that have the data. Subsequently the standards which are delicate must be secured before sharing the data. In this paper to give secure information sharing delicate guidelines are bothered first which was found by incessant example tree. Here touchy arrangement of principles are bothered by substitution. This kind of substitution diminishes the hazard and increment the utility of the dataset when contrasted with different techniques. Examination is done on certifiable dataset. Results shows that proposed work is better as appear differently in relation to various past strategies on the introduce of evaluation parameters.

Download Full-text

Learning and Synchronized Privacy Preserving Frequent Pattern Mining

Journal of Software ◽

10.3724/sp.j.1001.2011.04000 ◽

2011 ◽

Vol 22 (8) ◽

pp. 1749-1760

Author(s):

Yu-Hong GUO ◽

Yun-Hai TONG ◽

Shi-Wei TANG ◽

Leng-Dong WU

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Privacy Preserving ◽

Frequent Pattern

Download Full-text

RAKING: An Efficient K-Maximal Frequent Pattern Mining Algorithm on Uncertain Graph Database

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2010.01387 ◽

2010 ◽

Vol 33 (8) ◽

pp. 1387-1395 ◽

Cited By ~ 4

Author(s):

Meng HAN ◽

Wei ZHANG ◽

Jian-Zhong LI

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Graph Database ◽

Uncertain Graph ◽

Mining Algorithm ◽

Maximal Frequent Pattern

Download Full-text

Sliding window based weighted maximal frequent pattern mining over data streams

Expert Systems with Applications ◽

10.1016/j.eswa.2013.07.094 ◽

2014 ◽

Vol 41 (2) ◽

pp. 694-708 ◽

Cited By ~ 64

Author(s):

Gangin Lee ◽

Unil Yun ◽

Keun Ho Ryu

Keyword(s):

Data Streams ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Sliding Window ◽

Frequent Pattern ◽

Maximal Frequent Pattern

Download Full-text