scholarly journals Clustering and Pattern Mining of Customer Transaction Data using Apriori Algorithm

2019 ◽  
Vol 8 (3) ◽  
pp. 8035-8040

Clustering customer transaction data is an important procedure for analyzing customer behavior in retail and e-Commerce. Clustering of trading data with finding patterns using Apriori algorithm will helps to develop a market strategy and increases the profit. The system uses Apriori algorithm for finding pattern. The input of Apriori algorithm is the output of Customer Transaction Clustering Algorithm. In a system the customer transaction data is presented by using transaction tree and the distance between them is also calculated. Cluster the customer transaction data by using customer transaction clustering algorithm. The system selects frequent customer as representatives of customer groups. Finally, the system forwards the output of clustering to Apriori algorithm for finding patterns.

2021 ◽  
Vol 13 (3) ◽  
pp. 1089
Author(s):  
Hailin Zheng ◽  
Qinyou Hu ◽  
Chun Yang ◽  
Jinhai Chen ◽  
Qiang Mei

Since the spread of the coronavirus disease 2019 (COVID-19) pandemic, the transportation of cargo by ship has been seriously impacted. In order to prevent and control maritime COVID-19 transmission, it is of great significance to track and predict ship sailing behavior. As the nodes of cargo ship transportation networks, ports of call can reflect the sailing behavior of the cargo ship. Accurate hierarchical division of ports of call can help to clarify the navigation law of ships with different ship types and scales. For typical cargo ships, ships with deadweight over 10,000 tonnages account for 95.77% of total deadweight, and 592,244 berthing ships’ records were mined from automatic identification system (AIS) from January to October 2020. Considering ship type and ship scale, port hierarchy classification models are constructed to divide these ports into three kinds of specialized ports, including bulk, container, and tanker ports. For all types of specialized ports (considering ship scale), port call probability for corresponding ship type is higher than other ships, positively correlated with the ship deadweight if port scale is bigger than ship scale, and negatively correlated with the ship deadweight if port scale is smaller than ship scale. Moreover, port call probability for its corresponding ship type is positively correlated with ship deadweight, while port call probability for other ship types is negatively correlated with ship deadweight. Results indicate that a specialized port hierarchical clustering algorithm can divide the hierarchical structure of typical cargo ship calling ports, and is an effective method to track the maritime transmission path of the COVID-19 pandemic.


JOUTICA ◽  
2018 ◽  
Vol 3 (1) ◽  
pp. 117 ◽  
Author(s):  
Elly Muningsih ◽  
Sri Kiswati

Customer is a very important asset for the company. Having customers who are loyal to the company is an absolute and important for the progress of the company. This study aims to help companies, especially in the online shop to create a better customer management by identifying and grouping customers into several clusters or groups to know the characteristics of their loyalty to the company. The method used in this research is K-Means method which is one of the best and most popular method in clustering algorithm. To overcome the weakness of the K-Means method in determining the number of clusters, we use the Elbow method where this method gets the comparison of the number of clusters added by calculating the SSE (Sum of Square Error) of each cluster value. This research starts from collecting the necessary data and will be processed. From total transaction data 478 then done cleaning of data and result 73 data. Then the data processed with RapidMiner software from Cluster 2 up to 10 to search the data center of each cluster. From the calculated SSE value found that the best number of clusters is 3. The end result of the research is a Visual Basic based application program that is expected to provide ease in grouping or clustering customers. Software development method using Waterfall method.


2019 ◽  
Vol 2 (1) ◽  
pp. 31-36
Author(s):  
Arfianto Darmawan ◽  
Titin Kristiana

The Anakku Foundation Cooperative is a multi-business cooperative consisting of shop businesses, savings and loans, and student shuttle services. Every sale of stuff services will be inputted data directly to each business unit. The Anakku Foundation Cooperative still has problems, including store transactions that cannot yet answer what items are often sold, when stock items are still difficult to determine the items that are still available or almost running out. Data mining techniques have been mostly used to overcome existing problems, one of which is the application of the Apriori algorithm to obtain information about the associations between products from a transaction database. Transaction data on school equipment sales at Cooperative Employees of Anakku Foundation can be reprocessed using Data mining applications so as to produce strong association rules between itemset sales of school supplies so that they can provide recommendations for item alignment and simplify the arrangement or strong item placement related to interdependence. The results are found that the highest value of support and confidence is if buying MUSLIM L1.5P1, so it would buy AL-IZHAR II LOGO with a value of 14.5% support and 79.5% confidence


2019 ◽  
Vol 15 (1) ◽  
pp. 85-90 ◽  
Author(s):  
Jordy Lasmana Putra ◽  
Mugi Raharjo ◽  
Tommi Alfian Armawan Sandi ◽  
Ridwan Ridwan ◽  
Rizal Prasetyo

The development of the business world is increasingly rapid, so it needs a special strategy to increase the turnover of the company, in this case the retail company. In increasing the company's turnover can be done using the Data Mining process, one of which is using apriori algorithm. With a priori algorithm can be found association rules which can later be used as patterns of purchasing goods by consumers, this study uses a repository of 209 records consisting of 23 transactions and 164 attributes. From the results of this study, the goods with the name CREAM CUPID HEART COAT HANGER are the products most often purchased by consumers. By knowing the pattern of purchasing goods by consumers, the company management can increase the company's turnover by referring to the results of processing sales transaction data using a priori algorithm


Author(s):  
Juhi Singh ◽  
Mandeep Mittal ◽  
Sarla Pareek

Due to the increased availability of individual customer data, it is possible to predict customer buying pattern. Customers can be segmented using clustering algorithms based on various parameters such as Frequency, Recency and Monetary values (RFM). The data can further be analyzed to infer rules among two or more purchases of the customer. In this chapter we will present a clustering algorithm, enhanced k- means algorithm, which is based on k- means algorithm to divide customers into various segments. After segmentation, each segment is mined with the help of a priori algorithm to infer rules so that the customer's purchase behavior can be predicted. From large number of association rules with sufficient coverage, the customer's purchasing pattern can be predicted. Experiment on real database is implemented to evaluate the performance on effectiveness and utility of the approach. The results show that the proposed approach can gain a well insight into customers' segmentation and thus their behavior can be predicted.


2014 ◽  
Vol 568-570 ◽  
pp. 798-801
Author(s):  
Ye Qing Xiong ◽  
Shu Dong Zhang

It occurs time and space performance bottlenecks when traditional association rules algorithms are used to big data mining. This paper proposes a parallel algorithm based on matrix under cloud computing to improve Apriori algorithm. The algorithm uses binary matrix to store transaction data, uses matrix "and" operation to replace the connection between itemsets and combines cloud computing technology to implement the parallel mining for frequent itemsets. Under different conditions, the simulation shows it improves the efficiency, solves the performance bottleneck problem and can be widely used in big data mining with strong scalability and stability.


2012 ◽  
Vol 195-196 ◽  
pp. 984-986
Author(s):  
Ming Ru Zhao ◽  
Yuan Sun ◽  
Jian Guo ◽  
Ping Ping Dong

Frequent itemsets mining is an important data mining task and a focused theme in data mining research. Apriori algorithm is one of the most important algorithm of mining frequent itemsets. However, the Apriori algorithm scans the database too many times, so its efficiency is relatively low. The paper has therefore conducted a research on the mining frequent itemsets algorithm based on a across linker. Through comparing with the classical algorithm, the improved algorithm has obvious advantages.


Sign in / Sign up

Export Citation Format

Share Document