Mining algorithm for weighted FP-tree frequent item sets based on two-dimensional table

The first step of the association rule mining algorithm Apriori generate a lot of candidate item sets which are not frequent item sets, and all of these item sets cost a lot of system spending. To solve this problem，this paper presents an improved algorithm based on Apriori algorithm to improve the Apriori pruning step. Using this method, the large number of useless candidate item sets can be reduced effectively and it can also reduce the times of judge whether the item sets are frequent item sets. Experimental results show that the improved algorithm has better efficiency than classic Apriori algorithm.

Download Full-text

Algorithm of Frequent Item Sets Mining Based on Index Table

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.373-375.1076 ◽

2013 ◽

Vol 373-375 ◽

pp. 1076-1079

Author(s):

Lin Zhang ◽

Nan Zhen Yao ◽

Jian Li Zhang

Keyword(s):

Low Cost ◽

Apriori Algorithm ◽

Two Dimensional ◽

One Dimensional ◽

Single Index ◽

Frequent Item ◽

The One ◽

Index Value ◽

Frequent Item Sets ◽

Index Table

The paper gave a new frequent item sets mining algorithm based on index table at multiple times for the Apriori algorithm scans the database which causes the I/O load is too large, and the costly problem with the Apriori algorithm which want to have a big candidate sets. The algorithm first generated a one-dimensional index table by scan the database once, and then generates a two-dimensional index table based on the one-dimensional index table. After the two-dimension index table had been generated, we can use the method similar with Floyd algorithm, which inserts the single index entry individually into the two-dimensional index table. If the count of new index value is greater than or equal to Minsuppor after the single index item had been inserted, the new index entrys Item will be a frequently item sets. After all single index entry had been inserted into the two-dimensional index table, all the index entry in the table will be the maximum frequently item sets. After analysis we can see that this algorithm has low cost and with the high accuracy than Apriori algorithm and can provide some reference for related rules.

Download Full-text

SIBA: A fast frequent item sets mining algorithm based on sampling and improved bat algorithm

2015 Chinese Automation Congress (CAC) ◽

10.1109/cac.2015.7382471 ◽

2015 ◽

Cited By ~ 1

Author(s):

Ying Wei ◽

Jian Huang ◽

Zhongjie Zhang ◽

Jiangtao Kong

Keyword(s):

Bat Algorithm ◽

Frequent Item ◽

Mining Algorithm ◽

Frequent Item Sets

Download Full-text

A Hash based Mining Algorithm for Maximal Frequent Item Sets using Hashing

IJARCCE ◽

10.17148/ijarcce.2017.63241 ◽

2017 ◽

Vol 6 (3) ◽

pp. 1040-1044 ◽

Cited By ~ 1

Author(s):

Vaishali Galav ◽

Deepak Jain

Keyword(s):

Frequent Item ◽

Mining Algorithm ◽

Frequent Item Sets

Download Full-text

Mining Algorithm for Weighted FP-Growth Frequent Item Sets based on Ordered FP-Tree

International Journal of Engineering and Management Research ◽

10.31033/ijemr.9.5.22 ◽

2019 ◽

Vol 09 (05) ◽

pp. 154-158

Author(s):

Yuanyuan Li ◽

Shaohong Yin

Keyword(s):

Frequent Item ◽

Mining Algorithm ◽

Frequent Item Sets

Download Full-text

Frequent Item Sets and Association Rules Mining Algorithm Based on Floyd Algorithm

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2015.4065 ◽

2015 ◽

Vol 12 (9) ◽

pp. 2574-2578

Author(s):

Zhang Lin ◽

Zhang Jianli

Keyword(s):

Association Rules ◽

Association Rules Mining ◽

Frequent Item ◽

Mining Algorithm ◽

Frequent Item Sets

Download Full-text

A New Mining Algorithm Based on Frequent Item Sets

First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008) ◽

10.1109/wkdd.2008.86 ◽

2008 ◽

Author(s):

Yun Wen

Keyword(s):

Frequent Item ◽

Mining Algorithm ◽

Frequent Item Sets

Download Full-text

Biomedical Text Summarization Based on the Itemset Mining Approach

10.4018/978-1-7998-8061-5.ch007 ◽

2021 ◽

pp. 140-152

Author(s):

Supriya Gupta ◽

Aakanksha Sharaff ◽

Naresh Kumar Nagwani

Keyword(s):

Text Mining ◽

Text Summarization ◽

Biomedical Literature ◽

Biomedical Text ◽

Frequent Patterns ◽

Itemset Mining ◽

Frequent Item ◽

Mining Algorithm ◽

Frequent Item Sets ◽

The Given

The expanding amount of text-based biomedical information has prompted mining valuable or intriguing frequent patterns (words/terms) from extremely massive content, which is still a very challenging task. In the chapter, the authors have conceived a practical methodology for text mining dependent on the frequent item sets. This chapter presents a strategy utilizing item set mining graph-based summarization for summing up biomedical literature. They address the difficulties of recognizing important subjects or concepts in the given biomedical document text and display the relations between the strings by choosing the high pertinent lines from biomedical literature using apriori itemset mining algorithm. This method utilizes essential criteria to distinguish the significant concepts, events, for example, the fundamental subjects of the input record. These sentences are determined as exceptionally educational, applicable, and chosen to create the final summary.

Download Full-text

An Efficient Closed Frequent Item Sets Mining Algorithm-For Mining Closed Frequent Item Sets from Data Streams

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2016.5741 ◽

2016 ◽

Vol 13 (10) ◽

pp. 7467-7474

Author(s):

Venu Madhav Kuthadi ◽

Rajalakshmi Selvaraj

Keyword(s):

Data Streams ◽

Data Stream ◽

Processing Time ◽

Frequent Itemset ◽

Memory Usage ◽

Data Set ◽

Frequent Item ◽

Mining Algorithm ◽

Data Elements ◽

Frequent Item Sets

A data stream is a continuous sequence of data elements generated from a specified source. Mining frequent item sets in dynamic databases and data streams encounters some challenges that make the mining task harder than static databases. Many research works were developed in the frequent itemset mining, but these methods have the familiar problem of memory usage and processing time. Because, in data streams data elements are arrive at a rapid rate. The incoming data is unbounded and probably infinite. Due to high speed and large amount of incoming data, frequent item set mining algorithm must require a limited memory and processing time. To reduce this drawback in the existing method, a new algorithm is proposed in this paper. Here, a new algorithm is named as CFIM is developed for mining closed frequent item sets from the data streams based on their utility and consistency. During the closed frequent item sets mining, a hash table is maintained to check whether the given item set is closed or not. The computation of closed frequent item sets from the data stream will minimize the memory usage and processing time. Thus our proposed technique performance is analyzed by using the synthetic data set and compared with the exiting mining techniques.

Download Full-text

Algorithm for discovering frequent item sets based on optimized and regrouped item sets

Journal of Computer Applications ◽

10.3724/sp.j.1087.2010.02332 ◽

2010 ◽

Vol 30 (9) ◽

pp. 2332-2334

Author(s):

Ming WANG ◽

Shun-lin SONG

Keyword(s):

Frequent Item ◽

Frequent Item Sets

Download Full-text