An efficient closed frequent itemset miner for the MOA stream mining system

AI Communications ◽

10.3233/aic-140615 ◽

2015 ◽

Vol 28 (1) ◽

pp. 143-158 ◽

Author(s):

Massimo Quadrana ◽

Albert Bifet ◽

Ricard Gavaldà

Keyword(s):

Frequent Itemset ◽

Stream Mining ◽

Mining System ◽

Closed Frequent Itemset

Download Full-text

A new closed frequent itemset mining algorithm based on GPU and improved vertical structure

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.3904 ◽

2016 ◽

Vol 29 (6) ◽

pp. e3904 ◽

Author(s):

Yun Li ◽

Jie Xu ◽

Yun-Hao Yuan ◽

Ling Chen

Keyword(s):

Vertical Structure ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Mining Algorithm ◽

Closed Frequent Itemset

Download Full-text

VEDAS: A Mobile and Distributed Data Stream Mining System for Real-Time Vehicle Monitoring

Proceedings of the 2004 SIAM International Conference on Data Mining ◽

10.1137/1.9781611972740.28 ◽

2004 ◽

Author(s):

Hillol Kargupta ◽

Ruchita Bhargava ◽

Kun Liu ◽

Michael Powers ◽

Patrick Blair ◽

...

Keyword(s):

Real Time ◽

Data Stream ◽

Data Stream Mining ◽

Distributed Data ◽

Stream Mining ◽

Mining System ◽

Vehicle Monitoring

Download Full-text

Towards Scalable Algorithm for Closed Itemset Mining in High-Dimensional Data

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v8.i2.pp487-494 ◽

2017 ◽

Vol 8 (2) ◽

pp. 487

Author(s):

Fatimah Audah Md. Zaki ◽

Nurul Fariza Zulkurnain

Keyword(s):

High Dimensional Data ◽

Search Tree ◽

Frequent Itemsets ◽

Main Memory ◽

Frequent Itemset ◽

High Dimensional ◽

Major Drawback ◽

Scalable Algorithm ◽

Support Threshold ◽

Closed Frequent Itemset

<p>Mining frequent itemsets from large dataset has a major drawback in which the explosive number of itemsets requires additional mining process which might filter the interesting ones. Therefore, as the solution, the concept of closed frequent itemset was introduced that is lossless and condensed representation of all the frequent itemsets and their corresponding supports. Unfortunately, many algorithms are not memory-efficient since it requires the storage of closed itemsets in main memory for duplication checks. This paper presents BFF, a scalable algorithm for discovering closed frequent itemsets from high-dimensional data. Unlike many well-known algorithms, BFF traverses the search tree in breadth-first manner resulted to a minimum use of memory and less running time. The tests conducted on a number of microarray datasets show that the performance of this algorithm improved significantly as the support threshold decreases which is crucial in generating more interesting rules.</p>

Download Full-text

Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining

Mathematical Problems in Engineering ◽

10.1155/2015/874032 ◽

2015 ◽

Vol 2015 ◽

pp. 1-10

Author(s):

Yanhuang Jiang ◽

Qiangli Zhao ◽

Yutong Lu

Keyword(s):

Data Stream ◽

Data Stream Mining ◽

Memory Retention ◽

Stream Mining ◽

Mining System ◽

Complex Concept ◽

Knowledge Repository ◽

Component Classifier ◽

Concept Drifts ◽

Forgetting Mechanism

Combining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model. In this model, each component classifier is regarded as a piece of knowledge that a human obtains through learning some materials and has a memory retention value reflecting its usefulness in the history. The classifiers with high memory retention values are reserved in a “knowledge repository.” When a new data chunk comes, most useful classifiers will be selected (recalled) from the repository and compose the current target ensemble. Based on MDSM, we put forward a new algorithm, MAE (Memorizing Based Adaptive Ensemble), which uses Ebbinghaus forgetting curve as the forgetting mechanism and adopts ensemble pruning as the recalling mechanism. Compared with four popular data stream mining approaches on the datasets with different concept drifts, the experimental results show that MAE achieves high and stable predicting accuracy, especially for the applications with recurring or complex concept drifts. The results also prove the effectiveness of MDSM model.

Download Full-text

MineFleet : The Vehicle Data Stream Mining System for Ubiquitous Environments

Ubiquitous Knowledge Discovery - Lecture Notes in Computer Science ◽

10.1007/978-3-642-16392-0_14 ◽

2010 ◽

pp. 235-254 ◽

Author(s):

Hillol Kargupta ◽

Michael Gilligan ◽

Vasundhara Puttagunta ◽

Kakali Sarkar ◽

Martin Klein ◽

...

Keyword(s):

Data Stream ◽

Data Stream Mining ◽

Stream Mining ◽

Mining System ◽

Download Full-text

A Completeness on an Online ϵ-Approximation Algorithm for Closed Frequent Itemset Mining in a Transactional Stream

Transactions of the Japanese Society for Artificial Intelligence ◽

10.1527/tjsai.b-g52 ◽

2016 ◽

Vol 31 (5) ◽

pp. B-G52_1-10

Author(s):

Koji Iwanuma ◽

Yoshitaka Yamamoto ◽

Shoshi Fukuda

Keyword(s):

Approximation Algorithm ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Closed Frequent Itemset

Download Full-text

SAT‐based and CP‐based declarative approaches for Top‐Rank‐ K closed frequent itemset mining

International Journal of Intelligent Systems ◽

10.1002/int.22294 ◽

2020 ◽

Vol 36 (1) ◽

pp. 112-151

Author(s):

Sa'ed Abed ◽

Areej A. Abdelaal ◽

Mohammad H. Al‐Shayeji ◽

Imtiaz Ahmad

Keyword(s):

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Closed Frequent Itemset

Download Full-text

A Scalable Distributed Stream Mining System for Highway Traffic Data

Lecture Notes in Computer Science - Knowledge Discovery in Databases: PKDD 2006 ◽

10.1007/11871637_31 ◽

2006 ◽

pp. 309-321 ◽

Author(s):

Ying Liu ◽

Alok Choudhary ◽

Jianhong Zhou ◽

Ashfaq Khokhar

Keyword(s):

Traffic Data ◽

Highway Traffic ◽

Stream Mining ◽

Download Full-text

A Prime Number Based Approach for Closed Frequent Itemset Mining in Big Data

Lecture Notes in Computer Science - Database and Expert Systems Applications ◽

10.1007/978-3-319-22849-5_35 ◽

2015 ◽

pp. 509-516 ◽

Author(s):

Mehdi Zitouni ◽

Reza Akbarinia ◽

Sadok Ben Yahia ◽

Florent Masseglia

Keyword(s):

Big Data ◽

Prime Number ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Closed Frequent Itemset

Download Full-text

Closed Frequent Itemset Mining with Arbitrary Side Constraints

2018 IEEE International Conference on Data Mining Workshops (ICDMW) ◽

10.1109/icdmw.2018.00175 ◽

2018 ◽

Author(s):

Gokberk Kocak ◽

Ozgur Akgun ◽

Ian Miguel ◽

Peter Nightingale

Keyword(s):

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Side Constraints ◽

Closed Frequent Itemset

Download Full-text