MR-ARM: A MAP-REDUCE ASSOCIATION RULE MINING FRAMEWORK

Association rule is one of the primary tasks in data mining that discovers correlations among items in a transactional database. The majority of vertical and horizontal association rule mining algorithms have been developed to improve the frequent items discovery step which necessitates high demands on training time and memory usage particularly when the input database is very large. In this paper, we overcome the problem of mining very large data by proposing a new parallel Map-Reduce (MR) association rule mining technique called MR-ARM that uses a hybrid data transformation format to quickly finding frequent items and generating rules. The MR programming paradigm is becoming popular for large scale data intensive distributed applications due to its efficiency, simplicity and ease of use, and therefore the proposed algorithm develops a fast parallel distributed batch set intersection method for finding frequent items. Two implementations (Weka, Hadoop) of the proposed MR association rule algorithm have been developed and a number of experiments against small, medium and large data collections have been conducted. The ground bases of the comparisons are time required by the algorithm for: data initialisation, frequent items discovery, rule generation, etc. The results show that MR-ARM is very useful tool for mining association rules from large datasets in a distributed environment.

Download Full-text

Association Rule and Quantitative Association Rule Mining among Infrequent Items

Rare Association Rule Mining and Knowledge Discovery ◽

10.4018/978-1-60566-754-6.ch002 ◽

2010 ◽

pp. 15-32 ◽

Cited By ~ 1

Author(s):

Ling Zhou ◽

Stephen Yau

Keyword(s):

Data Mining ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Rule Mining ◽

Transactional Databases ◽

Frequent Items ◽

Increasing Demand ◽

Quantitative Association Rule

Association rule mining among frequent items has been extensively studied in data mining research. However, in recent years, there is an increasing demand for mining infrequent items (such as rare but expensive items). Since exploring interesting relationships among infrequent items has not been discussed much in the literature, in this chapter, the authors propose two simple, practical and effective schemes to mine association rules among rare items. Their algorithms can also be applied to frequent items with bounded length. Experiments are performed on the well-known IBM synthetic database. The authors’ schemes compare favorably to Apriori and FP-growth under the situation being evaluated. In addition, they explore quantitative association rule mining in transactional databases among infrequent items by associating quantities of items: some interesting examples are drawn to illustrate the significance of such mining.

Download Full-text

Ordering Policy and Inventory Classification Using Temporal Association Rule Mining

International Journal of Productivity Management and Assessment Technologies ◽

10.4018/ijpmat.2018010103 ◽

2018 ◽

Vol 6 (1) ◽

pp. 37-49 ◽

Cited By ~ 1

Author(s):

Reshu Agarwal

Keyword(s):

Inventory Management ◽

Association Rule ◽

Association Rule Mining ◽

Temporal Association ◽

Order Quantity ◽

Rule Mining ◽

Ordering Policy ◽

Temporal Association Rule ◽

Optimum Order ◽

Frequent Items

A modified framework that applies temporal association rule mining to inventory management is proposed in this article. The ordering policy of frequent items is determined and inventory is classified based on loss rule. This helps inventory managers to determine optimum order quantity of frequent items together with the most profitable item in each time-span. An example is illustrated to validate the results.

Download Full-text

Large-Scale Loop Detector Troubleshooting Using Clustering and Association Rule Mining

Journal of Transportation Engineering Part A Systems ◽

10.1061/jtepbs.0000387 ◽

2020 ◽

Vol 146 (7) ◽

pp. 04020064 ◽

Cited By ~ 1

Author(s):

Amin Ariannezhad ◽

Yao-Jan Wu

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Rule Mining ◽

Loop Detector

Download Full-text

Parallel SQL Based Association Rule Mining on Large Scale PC Cluster: Performance Comparison with Directly Coded C Implementation

Methodologies for Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/3-540-48912-6_12 ◽

1999 ◽

pp. 94-99 ◽

Cited By ~ 4

Author(s):

Iko Pramudiono ◽

Takahiko Shintani ◽

Takayuki Tamura ◽

Masaru Kitsuregawa

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Performance Comparison ◽

Rule Mining ◽

Cluster Performance ◽

Pc Cluster

Download Full-text

Performance Analysis for Parallel Generalized Association Rule Mining on a Large Scale PC Cluster

Euro-Par’99 Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/3-540-48311-x_206 ◽

1999 ◽

pp. 1455-1459 ◽

Cited By ~ 3

Author(s):

Takahiko Shintani ◽

Masato Oguchi ◽

Masaru Kitsuregawa

Keyword(s):

Performance Analysis ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Rule Mining ◽

Pc Cluster

Download Full-text

A Fault Diagnosis Method Based on Constrained Frequent Pattern Trees

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.39.449 ◽

2010 ◽

Vol 39 ◽

pp. 449-454

Author(s):

Jiang Hui Cai ◽

Wen Jun Meng ◽

Zhi Mei Chen

Keyword(s):

Fault Diagnosis ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Predicate Logic ◽

Frequent Pattern ◽

Rule Mining ◽

First Order ◽

Broad Term ◽

Diagnosis Method

Data mining is a broad term used to describe various methods for discovering patterns in data. A kind of pattern often considered is association rules, probabilistic rules stating that objects satisfying description A also satisfy description B with certain support and confidence. In this study, we first make use of the first-order predicate logic to represent knowledge derived from celestial spectra data. Next, we propose a concept of constrained frequent pattern trees (CFP) along with an algorithm used to construct CFPs, aiming to improve the efficiency and pertinence of association rule mining. The running results show that it is feasible and valuable to apply this method to mining the association rule and the improved algorithm can decrease related computation quantity in large scale and improve the efficiency of the algorithm. Finally, the simulation results of knowledge acquisition for fault diagnosis also show the validity of CFP algorithm.

Download Full-text

Ordering Policy Using Multi-Level Association Rule Mining

International Journal of Information Systems and Supply Chain Management ◽

10.4018/ijisscm.2018100105 ◽

2018 ◽

Vol 11 (4) ◽

pp. 84-101 ◽

Cited By ~ 1

Author(s):

Reshu Agarwal ◽

Sarla Pareek ◽

Biswajit Sarkar ◽

Mandeep Mittal

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Order Quantity ◽

Rule Mining ◽

Ordering Policy ◽

Optimum Order ◽

Frequent Items ◽

Multi Level ◽

Real Scenario ◽

Frequent Item Sets

In this article, an inventory model for a retailer's ordering policy is studied. Multi-level association rule mining is used to find frequent item-sets at each level by applying different threshold at different levels. During order quantity estimation, category, content, and brand of the items are considered, which leads to the discovery of more specific and concrete knowledge of the required order quantity. At each level, optimum order quantity of frequent items is determined. This assists inventory manager to order optimal quantity of items as per the actual requirement of the item with respect to their category, content and brand. An example is devised to explain the new approach. Further, to understand the effect of above approach in the real scenario, experiments are conducted on the exiting dataset.

Download Full-text

Research on Association Rule Mining Algorithm Based on Distributed Data

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.899 ◽

2014 ◽

Vol 998-999 ◽

pp. 899-902 ◽

Cited By ~ 1

Author(s):

Cheng Luo ◽

Ying Chen

Keyword(s):

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Frequent Itemsets ◽

Network Communication ◽

Data Mining Algorithm ◽

Distributed Data ◽

Rule Mining ◽

Mining Algorithm

Existing data miming algorithms have mostly implemented data mining under centralized environment, but the large-scale database exists in the distributed form. According to the existing problem of the distributed data mining algorithm FDM and its improved algorithms, which exist the problem that the frequent itemsets are lost and network communication cost too much. This paper proposes a association rule mining algorithm based on distributed data (ARADD). The mapping marks the array mechanism is included in the ARADD algorithm, which can not only keep the integrity of the frequent itemsets, but also reduces the cost of network communication. The efficiency of algorithm is proved in the experiment.

Download Full-text