Parallel SQL Based Association Rule Mining on Large Scale PC Cluster: Performance Comparison with Directly Coded C Implementation

An effective data mining method to automatically extract association rules between manufacturing capabilities and product features from the available historical data is essential for an efficient and cost-effective product development and production. This paper proposes a new binary particle swarm optimization- (BPSO-) based association rule mining (BPSO-ARM) method for discovering the hidden relationships between machine capabilities and product features. In particular, BPSO-ARM does not need to predefine thresholds of minimum support and confidence, which improves its applicability in real-world industrial cases. Moreover, a novel overlapping measure indication is further proposed to eliminate those lower quality rules to further improve the applicability of BPSO-ARM. The effectiveness of BPSO-ARM is demonstrated on a benchmark case and an industrial case about the automotive part manufacturing. The performance comparison indicates that BPSO-ARM outperforms other regular methods (e.g., Apriori) for ARM. The experimental results indicate that BPSO-ARM is capable of discovering important association rules between machine capabilities and product features. This will help support planners and engineers for the new product design and manufacturing.

Download Full-text

Large-Scale Loop Detector Troubleshooting Using Clustering and Association Rule Mining

Journal of Transportation Engineering Part A Systems ◽

10.1061/jtepbs.0000387 ◽

2020 ◽

Vol 146 (7) ◽

pp. 04020064 ◽

Cited By ~ 1

Author(s):

Amin Ariannezhad ◽

Yao-Jan Wu

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Rule Mining ◽

Loop Detector

Download Full-text

A Fault Diagnosis Method Based on Constrained Frequent Pattern Trees

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.39.449 ◽

2010 ◽

Vol 39 ◽

pp. 449-454

Author(s):

Jiang Hui Cai ◽

Wen Jun Meng ◽

Zhi Mei Chen

Keyword(s):

Fault Diagnosis ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Predicate Logic ◽

Frequent Pattern ◽

Rule Mining ◽

First Order ◽

Broad Term ◽

Diagnosis Method

Data mining is a broad term used to describe various methods for discovering patterns in data. A kind of pattern often considered is association rules, probabilistic rules stating that objects satisfying description A also satisfy description B with certain support and confidence. In this study, we first make use of the first-order predicate logic to represent knowledge derived from celestial spectra data. Next, we propose a concept of constrained frequent pattern trees (CFP) along with an algorithm used to construct CFPs, aiming to improve the efficiency and pertinence of association rule mining. The running results show that it is feasible and valuable to apply this method to mining the association rule and the improved algorithm can decrease related computation quantity in large scale and improve the efficiency of the algorithm. Finally, the simulation results of knowledge acquisition for fault diagnosis also show the validity of CFP algorithm.

Download Full-text

Research on Association Rule Mining Algorithm Based on Distributed Data

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.899 ◽

2014 ◽

Vol 998-999 ◽

pp. 899-902 ◽

Cited By ~ 1

Author(s):

Cheng Luo ◽

Ying Chen

Keyword(s):

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Frequent Itemsets ◽

Network Communication ◽

Data Mining Algorithm ◽

Distributed Data ◽

Rule Mining ◽

Mining Algorithm

Existing data miming algorithms have mostly implemented data mining under centralized environment, but the large-scale database exists in the distributed form. According to the existing problem of the distributed data mining algorithm FDM and its improved algorithms, which exist the problem that the frequent itemsets are lost and network communication cost too much. This paper proposes a association rule mining algorithm based on distributed data (ARADD). The mapping marks the array mechanism is included in the ARADD algorithm, which can not only keep the integrity of the frequent itemsets, but also reduces the cost of network communication. The efficiency of algorithm is proved in the experiment.

Download Full-text

Dynamic load balancing of large-scale distributed association rule mining

2011 IEEE International Conference on Computer Applications and Industrial Electronics (ICCAIE) ◽

10.1109/iccaie.2011.6162196 ◽

2011 ◽

Author(s):

Raja Tlili ◽

Yahya Slimani

Keyword(s):

Load Balancing ◽

Dynamic Load ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Dynamic Load Balancing ◽

Rule Mining ◽

Distributed Association

Download Full-text

Human Resource Allocation Based on Fuzzy Data Mining Algorithm

Complexity ◽

10.1155/2021/9489114 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

You Wu ◽

Zheng Wang ◽

Shengqi Wang

Keyword(s):

Data Mining ◽

Human Resource ◽

Business Process ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Data Mining Algorithm ◽

Rule Mining ◽

Mining Algorithm ◽

Database Technology

Data mining is currently a frontier research topic in the field of information and database technology. It is recognized as one of the most promising key technologies. Data mining involves multiple technologies, such as mathematical statistics, fuzzy theory, neural networks, and artificial intelligence, with relatively high technical content. The realization is also difficult. In this article, we have studied the basic concepts, processes, and algorithms of association rule mining technology. Aiming at large-scale database applications, in order to improve the efficiency of data mining, we proposed an incremental association rule mining algorithm based on clustering, that is, using fast clustering. First, the feasibility of realizing performance appraisal data mining is studied; then, the business process needed to realize the information system is analyzed, the business process-related links and the corresponding data input interface are designed, and then the data process to realize the data processing is designed, including data foundation and database model. Aiming at the high efficiency of large-scale database mining, database development tools are used to implement the specific system settings and program design of this algorithm. Incorporated into the human resource management system of colleges and universities, they carried out successful association broadcasting, realized visualization, and finally discovered valuable information.

Download Full-text