On a Data Mining Framework for the Identification of Frequent Pattern Trends

Author(s):  
Marina Resta
Keyword(s):  
2021 ◽  
Vol 11 (4) ◽  
pp. 1715
Author(s):  
Jieh-Ren Chang ◽  
You-Shyang Chen ◽  
Chien-Ku Lin ◽  
Ming-Fu Cheng

Storage devices in the computer industry have gradually transformed from the hard disk drive (HDD) to the solid-state drive (SSD), of which the key component is error correction in not-and (NAND) flash memory. While NAND flash memory is under development, it is still limited by the “program and erase” cycle (PE cycle). Therefore, the improvement of quality and the formulation of customer service strategy are topics worthy of discussion at this stage. This study is based on computer company A as the research object and collects more than 8000 items of SSD error data of its customers, which are then calculated with data mining and frequent pattern growth (FP-Growth) of the association rule algorithm to identify the association rule of errors by setting the minimum support degree of 90 and the minimum trust degree of 10 as the threshold. According to the rules, three improvement strategies of production control are suggested: (1) use of the association rule to speed up the judgment of the SSD error condition by customer service personnel, (2) a quality strategy, and (3) a customer service strategy.


Author(s):  
Anne Denton

Time series data is of interest to most science and engineering disciplines and analysis techniques have been developed for hundreds of years. There have, however, in recent years been new developments in data mining techniques, such as frequent pattern mining, that take a different perspective of data. Traditional techniques were not meant for such pattern-oriented approaches. There is, as a result, a significant need for research that extends traditional time-series analysis, in particular clustering, to the requirements of the new data mining algorithms.


2013 ◽  
Vol 443 ◽  
pp. 402-406 ◽  
Author(s):  
Shang Gao ◽  
Mei Mei Li

With the rapid development of the number of mobile phone users has accumulated a large number of graph data, graph data mining has gradually become a hot area of research. Traditional data such as clustering, classification, frequent pattern mining gradually extended to the field of graph data mining research. Introduced at this stage graph data mining technology research progress, summarizes the characteristics of the graphical data mining, practical significance, the main problem, and scenarios to discuss and forecast chart data, especially research on uncertain graph data become trends and hot spots.


2017 ◽  
Vol 10 (13) ◽  
pp. 191
Author(s):  
Nikhil Jamdar ◽  
A Vijayalakshmi

There are many algorithms available in data mining to search interesting patterns from transactional databases of precise data. Frequent pattern mining is a technique to find the frequently occurred items in data mining. Most of the techniques used to find all the interesting patterns from a collection of precise data, where items occurred in each transaction are certainly known to the system. As well as in many real-time applications, users are interested in a tiny portion of large frequent patterns. So the proposed user constrained mining approach, will help to find frequent patterns in which user is interested. This approach will efficiently find user interested frequent patterns by applying user constraints on the collections of uncertain data. The user can specify their own interest in the form of constraints and uses the Map Reduce model to find uncertain frequent pattern that satisfy the user-specified constraints 


2012 ◽  
Vol 195-196 ◽  
pp. 984-986
Author(s):  
Ming Ru Zhao ◽  
Yuan Sun ◽  
Jian Guo ◽  
Ping Ping Dong

Frequent itemsets mining is an important data mining task and a focused theme in data mining research. Apriori algorithm is one of the most important algorithm of mining frequent itemsets. However, the Apriori algorithm scans the database too many times, so its efficiency is relatively low. The paper has therefore conducted a research on the mining frequent itemsets algorithm based on a across linker. Through comparing with the classical algorithm, the improved algorithm has obvious advantages.


2017 ◽  
Author(s):  
Andysah Putera Utama Siahaan ◽  
Mesran Mesran ◽  
Andre Hasudungan Lubis ◽  
Ali Ikhwan ◽  
Supiyandi

Sales transaction data on a company will continue to increase day by day. Large amounts of data can be problematic for a company if it is not managed properly. Data mining is a field of science that unifies techniques from machine learning, pattern processing, statistics, databases, and visualization to handle the problem of retrieving information from large databases. The relationship sought in data mining can be a relationship between two or more in one dimension. The algorithm included in association rules in data mining is the Frequent Pattern Growth (FP-Growth) algorithm is one of the alternatives that can be used to determine the most frequent itemset in a data set.


2019 ◽  
Vol 3 (2) ◽  
pp. 115
Author(s):  
Mardiah Mardiah

<span><em>The importance of inventory systems at a pharmacy and the type of goods which</em><br /><span><em>are a top priority that must be in stock. It is useful to anticipate the void stuff. Due to the</em><br /><span><em>lack of inventory may affect customer service and asset to the pharmacy. Therefore, this</em><br /><span><em>study was conducted to help resolve those problems by designing a data mining</em><br /><span><em>application that serves to predict sales of the drug is needed most knowable a priori</em><br /><span><em>algorithm with the help of Tools Tanagra. One of the interesting association analysis</em><br /><span><em>phase analysis algorithm that generates a high frequency patterns (frequent pattern</em><br /><span><em>mining).</em><br /><span><em>Keywords: Data Mining, Apriori Algorithm, Association Rule</em></span></span></span></span></span></span></span></span><br /><br class="Apple-interchange-newline" /></span>


2017 ◽  
Author(s):  
◽  
Michael Phinney

Frequent pattern mining is a classic data mining technique, generally applicable to a wide range of application domains, and a mature area of research. The fundamental challenge arises from the combinatorial nature of frequent itemsets, scaling exponentially with respect to the number of unique items. Apriori-based and FPTree-based algorithms have dominated the space thus far. Initial phases of this research relied on the Apriori algorithm and utilized a distributed computing environment; we proposed the Cartesian Scheduler to manage Apriori's candidate generation process. To address the limitation of bottom-up frequent pattern mining algorithms such as Apriori and FPGrowth, we propose the Frequent Hierarchical Pattern Tree (FHPTree): a tree structure and new frequent pattern mining paradigm. The classic problem is redefined as frequent hierarchical pattern mining where the goal is to detect frequent maximal pattern covers. Under the proposed paradigm, compressed representations of maximal patterns are mined using a top-down FHPTree traversal, FHPGrowth, which detects large patterns before their subsets, thus yielding significant reductions in computation time. The FHPTree memory footprint is small; the number of nodes in the structure scales linearly with respect to the number of unique items. Additionally, the FHPTree serves as a persistent, dynamic data structure to index frequent patterns and enable efficient searches. When the search space is exponential, efficient targeted mining capabilities are paramount; this is one of the key contributions of the FHPTree. This dissertation will demonstrate the performance of FHPGrowth, achieving a 300x speed up over state-of-the-art maximal pattern mining algorithms and approximately a 2400x speedup when utilizing FHPGrowth in a distributed computing environment. In addition, we allude to future research opportunities, and suggest various modifications to further optimize the FHPTree and FHPGrowth. Moreover, the methods we offer will have an impact on other data mining research areas including contrast set mining as well as spatial and temporal mining.


Author(s):  
Moch. Syahrir ◽  
Fatimatuzzahra Fatimatuzzahra

Data mining dengan peran asosiasi sudah banyak digunakan oleh dunia usaha, salah satu algoritma yang sering digunakan untuk aturan asosiasi adalah apriori. Namun apriori memiliki kelemahan dalam hal performa, karena pada setiap penentuan frequent k-itemset harus melakukan scan database. Hal ini akan menjadi masalah apabila kandidat k-itemset memiliki dimensi yang banyak. proses scan database yang besar akan memakan waktu yang lama dan berpengaruh pada penggunaan memori dan prosesor. Apriori sudah sering dikembangkan, salah satu yang populer adalah Frequent Pattern (fp-growth), apriori dan fp-growth sama-sama merupakan algoritma untuk aturan asosiasi, hanya saja fp-growth menggunakan pendekatan yang berbeda dengan apriori yakni menggunakan pendekatan Frequent Pattern Tree (fp-tree). Meski fp-growth memiiki performa yang bagus ketika scan database namun rules yang di hasilkan oleh fp-growth tidak sebaik yang di hasilkan oleh apriori. Alternatif lain yang bisa digunakan adalah metode hashing, hal ini bisa menjadi solusi untuk mengatasi masalah dalam proses pencarian dan penentuan frequent k-itemset, sehingga proses scan database bisa lebih cepat. Tujuan penelitian adalah memperbaiki kinerja apriori dalam proses pencarian frekuensi itemset sehingga waktu scan database bisa lebih cepat


Sign in / Sign up

Export Citation Format

Share Document