A Novel and Improved Apriori Algorithm

Apriori algorithm is the classical algorithm in data mining association rules. Because the Apriori algorithm needs scan database for many times, it runs too slowly. In order to improve the running efficiency, this paper improves the Apriori algorithm based on the Apriori analysis. The improved idea is that it transforms the transaction database into corresponding 0-1 matrix. Whose each vector and subsequent vector does inner product operation to receive support. And comparing with the given minsupport, the rows and columns will be deleted if vector are less than the minsupport, so as to reduce the size of the rating matrix, improve the running speeding. Because the improved algorithm only needs to scan the database once when running, therefore the running speeding is more quickly. The experiment also shows that this improved algorithm is efficient and feasible.

Download Full-text

Improvement of Apriori Algorithm Based on Boolean Matrix

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.159.144 ◽

2010 ◽

Vol 159 ◽

pp. 144-148

Author(s):

Bin Li Wang ◽

Yan Guang Shen

Keyword(s):

Data Mining ◽

Association Rules ◽

Boolean Matrix ◽

Apriori Algorithm ◽

System Cost ◽

Transaction Database

This paper introduces the association rules and Apriori algorithm in data mining, considering the disadvantages of Apriori algorithm, a new improved Apriori algorithm based on Boolean matrix is proposed .It scans transaction database only one time, thus reduces the system cost and increases efficiency of data mining.

Download Full-text

Improved Apriori Algorithm Based on Compressing Transactional Matrix Multiplication

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.460-461.409 ◽

2011 ◽

Vol 460-461 ◽

pp. 409-413

Author(s):

Yue Shun He ◽

Ping Du

Keyword(s):

Association Rules ◽

Matrix Multiplication ◽

Apriori Algorithm ◽

Frequent Item ◽

Classical Algorithm ◽

Mining Association Rules ◽

Low Efficiency ◽

Frequent Item Sets ◽

Improved Algorithm

Apriori algorithm is one of the most classical algorithm in association rules, however, the algorithm is low efficiency, such as firstly it needs to repeatedly scan the database, which spends much in I/O. Secondly, it create a large number of 2- candidate itemsets during outputting frequent 2- itemsets. Thirdly, it doesn’t cancel the useless itemsets during outputting frequent k- itemsets. In the paper, it describes an improved algorithm based on the compressed matrices which improve the efficiency during creating frequent k- itemsets on three aspects, which simply scans the database once, after compressed transactional matrix, and by multiplied matrix get the frequent item sets, which effectively improved the efficiency in mining association rules.

Download Full-text

Improved Apriori Algorithm in the Power System Fault

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.556-562.1510 ◽

2014 ◽

Vol 556-562 ◽

pp. 1510-1514

Author(s):

Li Qiang Lin ◽

Hong Wen Yan

Keyword(s):

Power System ◽

Association Rules ◽

Fault Analysis ◽

The Other ◽

Apriori Algorithm ◽

Extreme Climate ◽

Time Space ◽

Low Efficiency ◽

Candidate Item ◽

Improved Algorithm

For the low efficiency in generating candidate item sets of apriori algorithm, this paper presents a method based on property division to improve generating candidate item sets. Comparing the improved apriori algorithm with the other algorithm and the improved algorithm is applied to the power system accident cases in extreme climate. The experiment results show that the improved algorithm significantly improves the time efficiency of generating candidate item sets. And it can find the association rules among time, space, disasters and fault facilities in the power system accident cases in extreme climate. That is very useful in power system fault analysis.

Download Full-text

ANALISIS POLA PENJUALAN DENGAN MENGGUNAKAN ALGORITMA APRIORI PADA KOPERASI KARYAWAN YAYASAN ANAKKU

Jurnal Riset Informatika ◽

10.34288/jri.v2i1.68 ◽

2019 ◽

Vol 2 (1) ◽

pp. 31-36

Author(s):

Arfianto Darmawan ◽

Titin Kristiana

Keyword(s):

Data Mining ◽

Strong Association ◽

Apriori Algorithm ◽

Business Unit ◽

Savings And Loans ◽

Transaction Data ◽

Existing Problems ◽

A Value ◽

Transaction Database ◽

Using Data

The Anakku Foundation Cooperative is a multi-business cooperative consisting of shop businesses, savings and loans, and student shuttle services. Every sale of stuff services will be inputted data directly to each business unit. The Anakku Foundation Cooperative still has problems, including store transactions that cannot yet answer what items are often sold, when stock items are still difficult to determine the items that are still available or almost running out. Data mining techniques have been mostly used to overcome existing problems, one of which is the application of the Apriori algorithm to obtain information about the associations between products from a transaction database. Transaction data on school equipment sales at Cooperative Employees of Anakku Foundation can be reprocessed using Data mining applications so as to produce strong association rules between itemset sales of school supplies so that they can provide recommendations for item alignment and simplify the arrangement or strong item placement related to interdependence. The results are found that the highest value of support and confidence is if buying MUSLIM L1.5P1, so it would buy AL-IZHAR II LOGO with a value of 14.5% support and 79.5% confidence

Download Full-text

IMPLEMENTASI ALGORITMA APRIORI TERHADAP DATA PENJUALAN PADA PERUSAHAAN RETAIL

Jurnal Pilar Nusa Mandiri ◽

10.33480/pilar.v15i1.113 ◽

2019 ◽

Vol 15 (1) ◽

pp. 85-90 ◽

Cited By ~ 3

Author(s):

Jordy Lasmana Putra ◽

Mugi Raharjo ◽

Tommi Alfian Armawan Sandi ◽

Ridwan Ridwan ◽

Rizal Prasetyo

Keyword(s):

Data Mining ◽

Association Rules ◽

A Priori ◽

Apriori Algorithm ◽

Business World ◽

Transaction Data ◽

Retail Company ◽

Special Strategy

The development of the business world is increasingly rapid, so it needs a special strategy to increase the turnover of the company, in this case the retail company. In increasing the company's turnover can be done using the Data Mining process, one of which is using apriori algorithm. With a priori algorithm can be found association rules which can later be used as patterns of purchasing goods by consumers, this study uses a repository of 209 records consisting of 23 transactions and 164 attributes. From the results of this study, the goods with the name CREAM CUPID HEART COAT HANGER are the products most often purchased by consumers. By knowing the pattern of purchasing goods by consumers, the company management can increase the company's turnover by referring to the results of processing sales transaction data using a priori algorithm

Download Full-text

Finding Active Membership Functions for Genetic-Fuzzy Data Mining

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622015500297 ◽

2015 ◽

Vol 14 (06) ◽

pp. 1215-1242 ◽

Cited By ~ 3

Author(s):

Chun-Hao Chen ◽

Tzung-Pei Hong ◽

Yeong-Chyi Lee ◽

Vincent S. Tseng

Keyword(s):

Data Mining ◽

Association Rules ◽

Fuzzy Data ◽

Membership Functions ◽

Linguistic Terms ◽

Fuzzy Association Rules ◽

Fitness Functions ◽

The Given ◽

Binary Strings ◽

Active Membership

Since transactions may contain quantitative values, many approaches have been proposed to derive membership functions for mining fuzzy association rules using genetic algorithms (GAs), a process known as genetic-fuzzy data mining. However, existing approaches assume that the number of linguistic terms is predefined. Thus, this study proposes a genetic-fuzzy mining approach for extracting an appropriate number of linguistic terms and their membership functions used in fuzzy data mining for the given items. The proposed algorithm adjusts membership functions using GAs and then uses them to fuzzify the quantitative transactions. Each individual in the population represents a possible set of membership functions for the items and is divided into two parts, control genes (CGs) and parametric genes (PGs). CGs are encoded into binary strings and used to determine whether membership functions are active. Each set of membership functions for an item is encoded as PGs with real-number schema. In addition, seven fitness functions are proposed, each of which is used to evaluate the goodness of the obtained membership functions and used as the evolutionary criteria in GA. After the GA process terminates, a better set of association rules with a suitable set of membership functions is obtained. Experiments are made to show the effectiveness of the proposed approach.

Download Full-text

Mining Allocating Patterns in Investment Portfolios

Data Mining Applications for Empowering Knowledge Societies ◽

10.4018/978-1-59904-657-0.ch007 ◽

2009 ◽

pp. 110-135

Author(s):

Yanbo J. Wang ◽

Xinwei Zheng ◽

Frans Coenen

Keyword(s):

Data Mining ◽

Portfolio Management ◽

Association Rule ◽

Common Type ◽

Experimental Results ◽

Transaction Database ◽

Investment Portfolios ◽

The Given

An association rule (AR) is a common type of mined knowledge in data mining that describes an implicative co-occurring relationship between two sets of binary-valued transaction-database attributes, expressed in the form of an ? rule. A variation of ARs is the (WARs), which addresses the weighting issue in ARs. In this chapter, the authors introduce the concept of “one-sum” WAR and name such WARs as allocating patterns (ALPs). An algorithm is proposed to extract hidden and interesting ALPs from data. The authors further indicate that ALPs can be applied in portfolio management. Firstly by modelling a collection of investment portfolios as a one-sum weighted transaction- database that contains hidden ALPs. Secondly the authors show that ALPs, mined from the given portfolio-data, can be applied to guide future investment activities. The experimental results show good performance that demonstrates the effectiveness of using ALPs in the proposed application.

Download Full-text

Research of Association Rules Algorithm Based on Matrix under Cloud Computing

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.568-570.798 ◽

2014 ◽

Vol 568-570 ◽

pp. 798-801

Author(s):

Ye Qing Xiong ◽

Shu Dong Zhang

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Big Data ◽

Association Rules ◽

Apriori Algorithm ◽

Computing Technology ◽

Binary Matrix ◽

Transaction Data ◽

Big Data Mining ◽

Parallel Mining

It occurs time and space performance bottlenecks when traditional association rules algorithms are used to big data mining. This paper proposes a parallel algorithm based on matrix under cloud computing to improve Apriori algorithm. The algorithm uses binary matrix to store transaction data, uses matrix "and" operation to replace the connection between itemsets and combines cloud computing technology to implement the parallel mining for frequent itemsets. Under different conditions, the simulation shows it improves the efficiency, solves the performance bottleneck problem and can be widely used in big data mining with strong scalability and stability.

Download Full-text

The Application of Apriori Algorithm in Analysis on Admitted Students of Colleges and Universities

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.321-324.2578 ◽

2013 ◽

Vol 321-324 ◽

pp. 2578-2582

Author(s):

Qian Zhang

Keyword(s):

Data Mining ◽

Association Rules ◽

Colleges And Universities ◽

Apriori Algorithm ◽

Data Mining Techniques ◽

Minimum Support ◽

Sample Data ◽

Mining Association Rules

This paper examined the application of Apriori algorithm in extracting association rules in data mining by sample data on student enrollments. It studied the data mining techniques for extraction of association rules, analyzed the correlation between specialties and characteristics of admitted students, and evaluated the algorithm for mining association rules, in which the minimum support was 30% and the minimum confidence was 40%.

Download Full-text

Research into the Algorithm of Frequent Pattern Mining Based on across Linker

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.195-196.984 ◽

2012 ◽

Vol 195-196 ◽

pp. 984-986

Author(s):

Ming Ru Zhao ◽

Yuan Sun ◽

Jian Guo ◽

Ping Ping Dong

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Itemsets ◽

Frequent Pattern ◽

Apriori Algorithm ◽

Important Data ◽

Classical Algorithm ◽

Frequent Itemsets Mining ◽

Mining Frequent Itemsets

Frequent itemsets mining is an important data mining task and a focused theme in data mining research. Apriori algorithm is one of the most important algorithm of mining frequent itemsets. However, the Apriori algorithm scans the database too many times, so its efficiency is relatively low. The paper has therefore conducted a research on the mining frequent itemsets algorithm based on a across linker. Through comparing with the classical algorithm, the improved algorithm has obvious advantages.

Download Full-text