Research on Data Mining Based on Apriori Algorithm

2012 ◽  
Vol 532-533 ◽  
pp. 1675-1679
Author(s):  
Pei Ji Wang ◽  
Yu Lin Zhao

With the availability of inexpensive storage and the progress in data collection tools, many organizations have created large databases of business and scientific data, which create an imminent need and great opportunities for mining interesting knowledge from data.Mining association rules is an important topic in the data mining research. In the paper, research mining frequent itemsets algorithm based on recognizable matrix and mining association rules algorithm based on improved measure system, the above method is used to mine association rules to the students’ data table under Visual FoxPro 6.0.

2005 ◽  
Vol 1 (3) ◽  
pp. 129-135
Author(s):  
Jun Luo ◽  
Sanguthevar Rajasekaran

Association rules mining is an important data mining problem that has been studied extensively. In this paper, a simple but Fast algorithm for Intersecting attributes lists using hash Tables (FIT) is presented. FIT is designed for efficiently computing all the frequent itemsets in large databases. It deploys an idea similar to Eclat but has a much better computational performance than Eclat due to two reasons: 1) FIT makes fewer total number of comparisons for each intersection operation between two attributes lists, and 2) FIT significantly reduces the total number of intersection operations. Our experimental results demonstrate that the performance of FIT is much better than that of Eclat and Apriori algorithms.


2013 ◽  
Vol 321-324 ◽  
pp. 2578-2582
Author(s):  
Qian Zhang

This paper examined the application of Apriori algorithm in extracting association rules in data mining by sample data on student enrollments. It studied the data mining techniques for extraction of association rules, analyzed the correlation between specialties and characteristics of admitted students, and evaluated the algorithm for mining association rules, in which the minimum support was 30% and the minimum confidence was 40%.


2012 ◽  
Vol 195-196 ◽  
pp. 984-986
Author(s):  
Ming Ru Zhao ◽  
Yuan Sun ◽  
Jian Guo ◽  
Ping Ping Dong

Frequent itemsets mining is an important data mining task and a focused theme in data mining research. Apriori algorithm is one of the most important algorithm of mining frequent itemsets. However, the Apriori algorithm scans the database too many times, so its efficiency is relatively low. The paper has therefore conducted a research on the mining frequent itemsets algorithm based on a across linker. Through comparing with the classical algorithm, the improved algorithm has obvious advantages.


SinkrOn ◽  
2020 ◽  
Vol 4 (2) ◽  
pp. 1 ◽  
Author(s):  
Rusdiansyah Rusdiansyah ◽  
Nining Suharyanti ◽  
Triningsih Triningsih ◽  
Muhammad Darussalam

Pizza is a processed food originating from Italy and has been spread in various other countries including one of them in Indonesia. Pizza is a processed food that is currently sought after by various groups of people so as to make the pizza business opportunity very profitable, if it is run in a food business. Currently the pizza business has very favorable prospects when compared to other businesses. Moreover, the targeted target can be from all walks of life from children to adults. Pizza sales transactions that produce sales data every day, have not been able to maximize the use of sales data. Sales data is only stored as an archive, so it becomes a pile of data. Therefore the use of data mining is used to solve this problem. A priori algorithm is a data mining method by using minimum support parameters, minimum confidence and will analyze in the period of every month of sales transactions. This study produces data on the results of the process of association rules from the data collection of sales transactions. From the association rules it can be concluded that the pattern of pizza sales, where consumers more often buy Meatzza and Cheese Mania, as evidenced by the results of calculations using Apriori Algorithm and Rapidminer 5.3, with support of 30% and 60% confidence.


In the area of data mining for finding frequent itemset from huge database, there exist a lot of algorithms, out of all Apriori algorithm is the base of all algorithms. In Uapriori algorithm each items existential probability is examined with a given support count, if it is greater or equal then these items are known as frequent items, otherwise these are known as infrequent itemsets. In this paper matrix technology has been introduced over Uapriori algorithm which reduces execution time and computational complexity for finding frequent itemset from uncertain transactional database. In the modern era, volume of data is increasing exponentially and highly optimized algorithm is needed for processing such a large amount of data in less time. The proposed algorithm can be used in the field of data mining for retrieving frequent itemset from a large volume of database by taking very less computation complexity.


2008 ◽  
pp. 3222-3234
Author(s):  
Yun Sing Koh ◽  
Nathan Rountree ◽  
Richard O’Keefe

Discovering association rules efficiently is an important data mining problem. We define sporadic rules as those with low support but high confidence; for example, a rare association of two symptoms indicating a rare disease. To find such rules using the well-known Apriori algorithm, minimum support has to be set very low, producing a large number of trivial frequent itemsets. To alleviate this problem, we propose a new method of discovering sporadic rules without having to produce all other rules above the minimum support threshold. The new method, called Apriori-Inverse, is a variation of the Apriori algorithm that uses the notion of maximum support instead of minimum support to generate candidate itemsets. Candidate itemsets of interest to us fall below a maximum support value but above a minimum absolute support value. Rules above maximum support are considered frequent rules, which are of no interest to us, whereas rules that occur by chance fall below the minimum absolute support value. We define two classes of sporadic rule: perfectly sporadic rules (those that consist only of items falling below maximum support) and imperfectly sporadic rules (those that may contain items over the maximum support threshold). This article is an expanded version of Koh and Rountree (2005).


2011 ◽  
Vol 145 ◽  
pp. 292-296
Author(s):  
Lee Wen Huang

Data Mining means a process of nontrivial extraction of implicit, previously and potentially useful information from data in databases. Mining closed large itemsets is a further work of mining association rules, which aims to find the set of necessary subsets of large itemsets that could be representative of all large itemsets. In this paper, we design a hybrid approach, considering the character of data, to mine the closed large itemsets efficiently. Two features of market basket analysis are considered – the number of items is large; the number of associated items for each item is small. Combining the cut-point method and the hash concept, the new algorithm can find the closed large itemsets efficiently. The simulation results show that the new algorithm outperforms the FP-CLOSE algorithm in the execution time and the space of storage.


2019 ◽  
Vol 15 (1) ◽  
pp. 85-90 ◽  
Author(s):  
Jordy Lasmana Putra ◽  
Mugi Raharjo ◽  
Tommi Alfian Armawan Sandi ◽  
Ridwan Ridwan ◽  
Rizal Prasetyo

The development of the business world is increasingly rapid, so it needs a special strategy to increase the turnover of the company, in this case the retail company. In increasing the company's turnover can be done using the Data Mining process, one of which is using apriori algorithm. With a priori algorithm can be found association rules which can later be used as patterns of purchasing goods by consumers, this study uses a repository of 209 records consisting of 23 transactions and 164 attributes. From the results of this study, the goods with the name CREAM CUPID HEART COAT HANGER are the products most often purchased by consumers. By knowing the pattern of purchasing goods by consumers, the company management can increase the company's turnover by referring to the results of processing sales transaction data using a priori algorithm


Sign in / Sign up

Export Citation Format

Share Document