frequent itemset mining Latest Research Papers

FPGA/GPU-based Acceleration for Frequent Itemsets Mining: A Comprehensive Review

ACM Computing Surveys ◽

10.1145/3472289 ◽

2022 ◽

Vol 54 (9) ◽

pp. 1-35

Author(s):

Lázaro Bustio-Martínez ◽

René Cumplido ◽

Martín Letras ◽

Raudel Hernández-León ◽

Claudia Feregrino-Uribe ◽

...

Keyword(s):

Graphics Processing Units ◽

Hardware Acceleration ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Comprehensive Review ◽

Development Platform ◽

Itemset Mining ◽

Modern Development ◽

Frequent Itemsets Mining

In data mining, Frequent Itemsets Mining is a technique used in several domains with notable results. However, the large volume of data in modern datasets increases the processing time of Frequent Itemset Mining algorithms, making them unsuitable for many real-world applications. Accordingly, proposing new methods for Frequent Itemset Mining to obtain frequent itemsets in a realistic amount of time is still an open problem. A successful alternative is to employ hardware acceleration using Graphics Processing Units (GPU) and Field Programmable Gates Arrays (FPGA). In this article, a comprehensive review of the state of the art of Frequent Itemsets Mining hardware acceleration is presented. Several approaches (FPGA and GPU based) were contrasted to show their weaknesses and strengths. This survey gathers the most relevant and the latest research efforts for improving the performance of Frequent Itemsets Mining regarding algorithms advances and modern development platforms. Furthermore, this survey organizes the current research on Frequent Itemsets Mining from the hardware perspective considering the source of the data, the development platform, and the baseline algorithm.

Association Rule Mining Algorithms for Big Data using RDD-ECLAT Algorithms

10.21203/rs.3.rs-935690/v1 ◽

2021 ◽

Author(s):

Martha ◽

Ramdas Vankdothu ◽

Hameed Mohd Abdul ◽

Rekha Gangula

Keyword(s):

Data Mining ◽

Big Data ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

New Paradigm ◽

Rule Mining ◽

Data Intensive ◽

Itemset Mining ◽

Real World Datasets ◽

Mining Algorithms

Abstract The revolution in technology for storing and processing big data leads to data intensive computing as a new paradigm. To find the valuable and precise big data knowledge, efficient and scalable data mining techniques are required. In data mining, different techniques are applied depending on the kind of knowledge to be mined. Association rules are generated from the frequent itemsets computed by frequent itemset mining (FIM) algorithms. The problem of designing scalable and efficient frequent itemset mining algorithms on the Spark RDD framework. The research done in this thesis aims to improve the performance (in terms of execution time) of the existing Spark-based frequent itemset mining algorithms and efficiently re-design other frequent itemset mining algorithms on Spark. The particular problem of interest is re-designing the Eclat algorithm in the distributed computing environment of the Spark. The paper proposes and implements a parallel Eclat algorithm using the Spark RDD architecture, dubbed RDD-Eclat. EclatV1 is the earliest version, followed by EclatV2, EclatV3, EclatV4, and EclatV5. Each version is the consequence of a different technique and heuristic being applied to the preceding variant. Following EclatV1, the filtered transaction technique is used, followed by heuristics for equivalence class partitioning in EclatV4 and EclatV5. EclatV2 and EclatV3 are slightly different algorithmically, as are EclatV4 and EclatV5. Experiments on synthetic and real-world datasets.

Comparative Analysis on Frequent Itemset Mining Algorithms in Vertically Partitioned Cloud Data

10.1007/978-981-16-4625-6_38 ◽

2021 ◽

pp. 395-402

Author(s):

M. Yogasini ◽

B. N. Prathibha

Keyword(s):

Comparative Analysis ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Cloud Data ◽

Itemset Mining ◽

Mining Algorithms

A Database Reconstruction Approach for the Inverse Frequent Itemset Mining Problem

10.1007/978-3-030-80571-5_4 ◽

2021 ◽

pp. 45-58

Author(s):

Panteleimon Krasadakis ◽

Evangelos Sakkopoulos ◽

Vassilios S. Verykios

Keyword(s):

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining

Frequent Itemset Mining Using Genetic Approach

10.1007/978-981-16-3071-2_43 ◽

2021 ◽

pp. 533-540

Author(s):

Renji George Amballoor ◽

Shankar B. Naik

Keyword(s):

Frequent Itemset ◽

Frequent Itemset Mining ◽

Genetic Approach ◽

Itemset Mining

Frequent Itemset Mining Algorithms—A Literature Survey

10.1007/978-981-16-2422-3_13 ◽

2021 ◽

pp. 159-166

Author(s):

M. Sinthuja ◽

D. Evangeline ◽

S. Pravinth Raja ◽

G. Shanmugarathinam

Keyword(s):

Literature Survey ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Mining Algorithms

Stable Periodic Frequent Itemset Mining on Uncertain Datasets

10.1109/ccet52649.2021.9544352 ◽

2021 ◽

Author(s):

Ruimeng He ◽

Jinchao Chen ◽

Chenglie Du ◽

Yuxin Duan

Keyword(s):

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Stable Periodic

Human resource recommendation algorithm based on improved frequent itemset mining

Future Generation Computer Systems ◽

10.1016/j.future.2021.08.017 ◽

2021 ◽

Author(s):

Liu Zhaoshan ◽

Ma Yiming ◽

Zheng Huihua ◽

Liu Dege ◽

Liu Jing

Keyword(s):

Human Resource ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Recommendation Algorithm ◽

Itemset Mining

A Synopsis Based Approach for Itemset Frequency Estimation over Massive Multi-Transaction Stream

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3465238 ◽

2021 ◽

Vol 16 (2) ◽

pp. 1-30

Author(s):

Guangtao Wang ◽

Gao Cong ◽

Ying Zhang ◽

Zhen Hai ◽

Jieping Ye

Keyword(s):

Frequency Estimation ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Experimental Results ◽

Closure Property ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Minimum Value ◽

Downward Closure ◽

Bounded Size

The streams where multiple transactions are associated with the same key are prevalent in practice, e.g., a customer has multiple shopping records arriving at different time. Itemset frequency estimation on such streams is very challenging since sampling based methods, such as the popularly used reservoir sampling, cannot be used. In this article, we propose a novel k -Minimum Value (KMV) synopsis based method to estimate the frequency of itemsets over multi-transaction streams. First, we extract the KMV synopses for each item from the stream. Then, we propose a novel estimator to estimate the frequency of an itemset over the KMV synopses. Comparing to the existing estimator, our method is not only more accurate and efficient to calculate but also follows the downward-closure property. These properties enable the incorporation of our new estimator with existing frequent itemset mining (FIM) algorithm (e.g., FP-Growth) to mine frequent itemsets over multi-transaction streams. To demonstrate this, we implement a KMV synopsis based FIM algorithm by integrating our estimator into existing FIM algorithms, and we prove it is capable of guaranteeing the accuracy of FIM with a bounded size of KMV synopsis. Experimental results on massive streams show our estimator can significantly improve on the accuracy for both estimating itemset frequency and FIM compared to the existing estimators.

Marginal frequent itemset mining for fault prevention of railway overhead contact system

ISA Transactions ◽

10.1016/j.isatra.2021.07.018 ◽

2021 ◽

Author(s):

Kaiyi Qian ◽

Shibin Gao ◽

Long Yu

Keyword(s):

Frequent Itemset ◽

Contact System ◽

Frequent Itemset Mining ◽

Itemset Mining

frequent itemset mining
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

FPGA/GPU-based Acceleration for Frequent Itemsets Mining: A Comprehensive Review

Association Rule Mining Algorithms for Big Data using RDD-ECLAT Algorithms

Comparative Analysis on Frequent Itemset Mining Algorithms in Vertically Partitioned Cloud Data

A Database Reconstruction Approach for the Inverse Frequent Itemset Mining Problem

Frequent Itemset Mining Using Genetic Approach

Frequent Itemset Mining Algorithms—A Literature Survey

Stable Periodic Frequent Itemset Mining on Uncertain Datasets

Human resource recommendation algorithm based on improved frequent itemset mining

A Synopsis Based Approach for Itemset Frequency Estimation over Massive Multi-Transaction Stream

Marginal frequent itemset mining for fault prevention of railway overhead contact system

Export Citation Format

frequent itemset miningRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

FPGA/GPU-based Acceleration for Frequent Itemsets Mining: A Comprehensive Review

Association Rule Mining Algorithms for Big Data using RDD-ECLAT Algorithms

Comparative Analysis on Frequent Itemset Mining Algorithms in Vertically Partitioned Cloud Data

A Database Reconstruction Approach for the Inverse Frequent Itemset Mining Problem

Frequent Itemset Mining Using Genetic Approach

Frequent Itemset Mining Algorithms—A Literature Survey

Stable Periodic Frequent Itemset Mining on Uncertain Datasets

Human resource recommendation algorithm based on improved frequent itemset mining

A Synopsis Based Approach for Itemset Frequency Estimation over Massive Multi-Transaction Stream

Marginal frequent itemset mining for fault prevention of railway overhead contact system

frequent itemset mining
Recently Published Documents