Research on Improved Apriori Algorithm Based on Data Mining in Electronic Cases

Author(s):  
Xiaoli Wang ◽  
Kui Su ◽  
Lirong Su

This article makes progress of a commonly used Apriori algorithm, and proposes a new Apriori algorithm based on event ID. In this article, association rules are gained from massive medical data through the new Apriori algorithm. This article proposes and then uses the association rules in the prediction system. This article aims at making the lifestyle-related diseases prediction system provide better service for people, for families and for the whole society. The prediction system can automatically give out health-related information of the user after the person's basic information is put in, and it would also give out some pieces of valuable advice according to the resultant data, helping people realize self-determinant health engagement.

The demand for data mining is now unavoidable in the medical industry due to its various applications and uses in predicting the diseases at the early stage. The methods available in the data mining theories are easy to extract the useful patterns and speed to recognize the task based outcomes. In data mining the classification models are really useful in building the classes for the medical data sets for future analysis in an accurate way. Besides these facilities, Association rules in data mining are a promising technique to find hidden patterns in a medical data set and have been successfully applied with market basket data, census data and financial data. Apriori algorithm, is considered to be a classic algorithm, is useful in mining frequent item sets on a database containing a large number of transactions and it also predicts the relevant association rules. Association rules capture the relationship of items that are present in data sets and when the data set contains continuous attributes, the existing algorithms may not work due to this, discretization can be applied to the association rules in order to find the relation between various patterns in data set. In this paper of our research, using Discretized Apriori the research work is done to predict the by-disease in people who are found with diabetic syndrome; also the rules extracted are analyzed. In the discretization step, numerical data is discretized and fed to the Apriori algorithm for better association rules to predict the diseases.


The demand for data mining is now unavoidable in the medical industry due to its various applications and uses in predicting the diseases at the early stage. The methods available in the data mining theories are easy to extract the useful patterns and speed to recognize the task based outcomes. In data mining the classification models are really useful in building the classes for the medical data sets for future analysis in an accurate way. Besides these facilities, Association rules in data mining are a promising technique to find hidden patterns in a medical data set and have been successfully applied with market basket data, census data and financial data. Apriori algorithm, is considered to be a classic algorithm, is useful in mining frequent item sets on a database containing a large number of transactions and it also predicts the relevant association rules. Association rules capture the relationship of items that are present in data sets and when the data set contains continuous attributes, the existing algorithms may not work due to this, discretization can be applied to the association rules in order to find the relation between various patterns in data set. In this paper of our research, using Discretized Apriori the research work is done to predict the by-disease in people who are found with diabetic syndrome; also the rules extracted are analyzed. In the discretization step, numerical data is discretized and fed to the Apriori algorithm for better association rules to predict the diseases.


2019 ◽  
Vol 15 (1) ◽  
pp. 85-90 ◽  
Author(s):  
Jordy Lasmana Putra ◽  
Mugi Raharjo ◽  
Tommi Alfian Armawan Sandi ◽  
Ridwan Ridwan ◽  
Rizal Prasetyo

The development of the business world is increasingly rapid, so it needs a special strategy to increase the turnover of the company, in this case the retail company. In increasing the company's turnover can be done using the Data Mining process, one of which is using apriori algorithm. With a priori algorithm can be found association rules which can later be used as patterns of purchasing goods by consumers, this study uses a repository of 209 records consisting of 23 transactions and 164 attributes. From the results of this study, the goods with the name CREAM CUPID HEART COAT HANGER are the products most often purchased by consumers. By knowing the pattern of purchasing goods by consumers, the company management can increase the company's turnover by referring to the results of processing sales transaction data using a priori algorithm


2014 ◽  
Vol 721 ◽  
pp. 543-546 ◽  
Author(s):  
Dong Juan Gu ◽  
Lei Xia

Apriori algorithm is the classical algorithm in data mining association rules. Because the Apriori algorithm needs scan database for many times, it runs too slowly. In order to improve the running efficiency, this paper improves the Apriori algorithm based on the Apriori analysis. The improved idea is that it transforms the transaction database into corresponding 0-1 matrix. Whose each vector and subsequent vector does inner product operation to receive support. And comparing with the given minsupport, the rows and columns will be deleted if vector are less than the minsupport, so as to reduce the size of the rating matrix, improve the running speeding. Because the improved algorithm only needs to scan the database once when running, therefore the running speeding is more quickly. The experiment also shows that this improved algorithm is efficient and feasible.


2014 ◽  
Vol 568-570 ◽  
pp. 798-801
Author(s):  
Ye Qing Xiong ◽  
Shu Dong Zhang

It occurs time and space performance bottlenecks when traditional association rules algorithms are used to big data mining. This paper proposes a parallel algorithm based on matrix under cloud computing to improve Apriori algorithm. The algorithm uses binary matrix to store transaction data, uses matrix "and" operation to replace the connection between itemsets and combines cloud computing technology to implement the parallel mining for frequent itemsets. Under different conditions, the simulation shows it improves the efficiency, solves the performance bottleneck problem and can be widely used in big data mining with strong scalability and stability.


2013 ◽  
Vol 321-324 ◽  
pp. 2578-2582
Author(s):  
Qian Zhang

This paper examined the application of Apriori algorithm in extracting association rules in data mining by sample data on student enrollments. It studied the data mining techniques for extraction of association rules, analyzed the correlation between specialties and characteristics of admitted students, and evaluated the algorithm for mining association rules, in which the minimum support was 30% and the minimum confidence was 40%.


SinkrOn ◽  
2020 ◽  
Vol 4 (2) ◽  
pp. 1 ◽  
Author(s):  
Rusdiansyah Rusdiansyah ◽  
Nining Suharyanti ◽  
Triningsih Triningsih ◽  
Muhammad Darussalam

Pizza is a processed food originating from Italy and has been spread in various other countries including one of them in Indonesia. Pizza is a processed food that is currently sought after by various groups of people so as to make the pizza business opportunity very profitable, if it is run in a food business. Currently the pizza business has very favorable prospects when compared to other businesses. Moreover, the targeted target can be from all walks of life from children to adults. Pizza sales transactions that produce sales data every day, have not been able to maximize the use of sales data. Sales data is only stored as an archive, so it becomes a pile of data. Therefore the use of data mining is used to solve this problem. A priori algorithm is a data mining method by using minimum support parameters, minimum confidence and will analyze in the period of every month of sales transactions. This study produces data on the results of the process of association rules from the data collection of sales transactions. From the association rules it can be concluded that the pattern of pizza sales, where consumers more often buy Meatzza and Cheese Mania, as evidenced by the results of calculations using Apriori Algorithm and Rapidminer 5.3, with support of 30% and 60% confidence.


Author(s):  
Asep Budiman Kusdinar ◽  
Daris Riyadi ◽  
Asriyanik Asriyanik

A buffet restaurant is a restaurant that provides buffet food that is served directly at the dining table so that customers can order more food according to their needs. This study uses the association rule method which is one of the methods of data mining and a priori algorithms. Data mining is the process of discovering patterns or rules in data, in which the process must be automatic or semi-automatic. Association rules are one of the techniques of data mining that is used to look for relationships between items in a dataset. While  the apriori algorithm is a very well-known algorithm for finding high-frequency patterns, this a priori algorithm is a type of association rule in data mining. High- frequency patterns are patterns of items in the database that have frequencies or support. This high-frequency pattern is used to develop rules and also some other data mining techniques. The composition of the food menu in the Asgar restaurant is now arranged randomly without being prepared on the food menu between one another. The result of this research is  to support the composition of the food menu at the Asgar restaurant so that it is easier to take food menu with one another.  


2020 ◽  
Vol 4 (1) ◽  
pp. 112
Author(s):  
Siti Awaliyah Rachmah Sutomo ◽  
Frisma Handayanna

By using data mining methods can be processed to obtain information and assist in decision making, the amount of data on sales transactions in each drug purchase can cause a data accumulation and various problems, such as drug stock inventory, and sales transaction data, with Data mining techniques, the behavior of consumers in making transactions of drug purchase patterns can be analyzed, It can be known what drugs are commonly purchased by mostly people, the application of Apriori Algorithm is expected to help in forming a combination of itemset. The process of determining drug purchase patterns can be carried out by applying the Appriori algorithm method, determination of drug purchase patterns can be done by looking at the results of the consumer's tendency to buy drugs based on a combination of 3 itemset. By calculating the Analysis of High Frequency Patterns and the Formation of Association Rules, with a minimum of 30% support, there is a combination of 3 itemsset namely MOLAGIT PER TAB (M1), VIT C TABLET (V2), and PARACETAMOL 500 MG TABLET (P2) with 33.33 % support results obtained, and with minimum confidence of 65% there are 6 final association rules.


2011 ◽  
Vol 282-283 ◽  
pp. 770-773
Author(s):  
Rong Liang Luo

Development of data mining technology provides convenience for analyzing tobacco consumers’ act. Through simple introduction on contents and categories of data mining technology, the survey on tobacco consumption act of Shaoxin Tobacco Company is analyzed with association rules and data mining software Weka, and factors which affect tobacco consumption are mined on with association with Apriori Algorithm, so as to provide valuable references for brand spreading channels, product design, improvement of taste and flavor, package, price and other aspects for the tobacco company.


Sign in / Sign up

Export Citation Format

Share Document