Equivalence Class Based Parallel Algorithm for Mining MFI

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.713-715.1712 ◽

2015 ◽

Vol 713-715 ◽

pp. 1712-1715

Author(s):

Hui Wang

Keyword(s):

Parallel Algorithm ◽

Equivalence Class ◽

Load Balance ◽

Search Space ◽

Frequent Itemsets ◽

Novel Technologies ◽

Pruning Strategy ◽

Maximal Frequent Itemsets ◽

Frequency Counting ◽

Serial Algorithm

We present a novel and powerful parallel algorithm, PMFI, for mining all the maximal frequent itemsets from a big database. PMFI utilizes novel technologies to make the I/O overhead down drastically. The key principle is to utilize prefix-based equivalence classes to decompose the search space. It distributes the work among the processors by equivalence class weights. It re-represents the database with vertical format, so the frequency counting can be done by simple tid-list intersection operations. It bases a novel serial algorithm MaxMining which utilizes multiple-level backtrack pruning strategy, so that each processor can count the maximal frequent itemsets independently by selectively duplicating the pieces of database. These techniques eliminate the need for synchronization. The dynamic load balance schema is applied in PMFI, it would be hopeful to achieve better performance.

Download Full-text

New Policy of Maximal Frequent Itemsets in Data Stream Mining

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.26-28.118 ◽

2010 ◽

Vol 26-28 ◽

pp. 118-122

Author(s):

Chong Huan Xu ◽

Chun Hua Ju

Keyword(s):

Stream Flow ◽

Data Stream ◽

Search Space ◽

Frequent Itemsets ◽

Stream Flows ◽

Algorithm Construct ◽

Maximal Frequent Itemsets ◽

Core Idea ◽

Basic Window ◽

Pruning Techniques

According to the features of data streams and combined sliding window, a new algorithm A-MFI which is based on self-adjusting and orderly-compound policy for mining maximal frequent itemsets in data stream is proposed. This algorithm which is based on basic window updates information from data stream flow fragments and scans the stream only once to gain and store it in frequent itemsets list when the data stream flows. The core idea of this algorithm: construct self-adjusting and orderly-compound FP-tree, use mixed subset pruning techniques to reduce the search space, merge nodes which has equal minsup in the same branch and compress to generate the orderly-compound FP-tree to avoid superset checking when mining maximal frequent itemsets. The experimental results show that the algorithm has higher efficiency in time and space, and also has good scalability.

Download Full-text

EOBAA: Enhanced Ontology Based Alignment Algorithm for Mining Frequent Patterns

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.12.15908 ◽

2018 ◽

Vol 7 (3.12) ◽

pp. 157

Author(s):

D Srinivasa Rao ◽

V Sucharitha ◽

K V.V Satyanarayana

Keyword(s):

Real Time ◽

High Performance ◽

Computation Time ◽

Search Space ◽

Search Tree ◽

Frequent Itemsets ◽

Alignment Algorithm ◽

Frequent Patterns ◽

Pruning Strategy ◽

Real Time Applications

Mining frequent patterns are most widely used in many applications such as supermarkets, diagnostics, and other real-time applications. Performance of the algorithm is calculated based on the computation of the algorithm. It is very tedious to compute the frequent patterns in mining. Many algorithms and techniques are implemented and studied to generate the high-performance algorithms such as Prepost+ which employees the N-list to represent itemsets and directly discovers frequent itemsets using a set-enumeration search tree. But due to its pruning strategy, it is known that the computation time is more for processing the search space. It enumerates all item sets from datasets by the principle of exhaustion and they don’t sort them based on utility, but only a statistical proof of most recurring itemset. In this paper, the proposed Enhanced Ontologies based Alignment Algorithm (EOBAA) to identify, extract, sort out the HUI's from FI's. To improve the similarity measure the proposed system adopted Cosine similarity. The experiments conducted on 1 real datasets and show the performance of the EOBAA based on the computation time and accuracy of the proposed EOBAA.

Download Full-text

MaxMining: A Novel Algorithm for Mining Maximal Frequent Itemset

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.713-715.1765 ◽

2015 ◽

Vol 713-715 ◽

pp. 1765-1768

Author(s):

Hui Wang

Keyword(s):

Iterative Method ◽

Search Space ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Transaction Database ◽

Maximal Frequent Itemsets ◽

Novel Algorithm

We present a new algorithm for mining maximal frequent itemsets, MaxMining, from big transaction databases. MaxMining employs the depth-first traversal and iterative method. It re-represents the transaction database by vertical tidset format, travels the search space with effective pruning strategies which reduces the search space dramatically. MaxMining removes all the non-maximal frequent itemsets to get the exact set of maximal frequent itemsets directly, no need to enumerate all the frequent itemsets from smaller ones step by step. It backtracks to the proper ancestor directly, needless level by level, ignoring those redundant frequent itemsets. We found that MaxMining can be more effective to find all the maximal frequent itemsets from big databases than many of proposed algorithms with ordinary pruning strategies.

Download Full-text

Parallel algorithm for mining frequent itemsets

2005 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2005.1527295 ◽

2005 ◽

Cited By ~ 2

Author(s):

You-Lin Ruan ◽

Gan Liu ◽

Qing-Hua Li

Keyword(s):

Parallel Algorithm ◽

Frequent Itemsets ◽

Mining Frequent Itemsets

Download Full-text

A mining algorithm for distributed global maximal frequent itemsets based on Sorted SCan-Tree

2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) ◽

10.1109/imcec.2016.7867533 ◽

2016 ◽

Author(s):

Yulei Huang ◽

Jinhuan Wang ◽

Yan Li ◽

Qing Lin

Keyword(s):

Frequent Itemsets ◽

Mining Algorithm ◽

Maximal Frequent Itemsets

Download Full-text

Mining Maximal Frequent Itemsets for Intrusion Detection

Grid and Cooperative Computing - GCC 2004 Workshops - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30207-0_53 ◽

2004 ◽

pp. 422-429 ◽

Cited By ~ 3

Author(s):

Hui Wang ◽

Qing-Hua Li ◽

Huanyu Xiong ◽

Sheng-Yi Jiang

Keyword(s):

Intrusion Detection ◽

Frequent Itemsets ◽

Maximal Frequent Itemsets

Download Full-text

Role of machine learning and artificial intelligence algorithms for teaching reform of linguistics

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189365 ◽

2020 ◽

pp. 1-12

Author(s):

Wang Li

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Search Space ◽

Teaching Experiment ◽

Machine Learning Algorithms ◽

Teaching Process ◽

Root Cause ◽

Pruning Strategy ◽

Intelligence Models ◽

Artificial Intelligence Models

The teaching of linguistics is limited by the influence of various factors, which leads to poor teaching effect, and the teaching process is difficult to evaluate. In order to improve the efficiency of linguistics teaching, this paper uses improved machine learning algorithms to construct a linguistics artificial intelligence teaching model. According to the teaching needs of linguistics, the efficiency of the teaching process is improved, and the teaching evaluation is performed, and the root cause analysis algorithm based on MCTS is optimized. Moreover, according to the frequent item set algorithm in data mining, a layered pruning strategy is proposed to further reduce the search space and improve the efficiency of the model. In addition, this study combines with the comparative teaching experiment to study the efficiency of artificial intelligence models in linguistics teaching. The statistical results show that the model proposed in this paper has a certain effect.

Download Full-text

A Hybrid Method for Discovering Maximal Frequent Itemsets

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery ◽

10.1109/fskd.2008.347 ◽

2008 ◽

Cited By ~ 1

Author(s):

Fu-zan Chen ◽

Min-qiang Li

Keyword(s):

Hybrid Method ◽

Frequent Itemsets ◽

Maximal Frequent Itemsets

Download Full-text

Efficiently Mining Maximal Frequent Itemsets Based on Digraph

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.610.291 ◽

2014 ◽

Vol 610 ◽

pp. 291-295

Author(s):

Qiang Wu ◽

Ding We Wu ◽

Qin Wang ◽

Shao Min Wen ◽

Rong Tu

Keyword(s):

Data Mining ◽

Frequent Itemsets ◽

Maximal Frequent Itemsets ◽

Novel Algorithm

In this paper, a novel algorithm for mining maximal frequent itemsets is presented, which has a pre-processing phase where a digraph is constructed. The digraph represents the frequent 2-itemsets which play an important role on the performance of data mining. Then the search for maximal frequent itemsets is done in the digraph. Experiments show that the proposed algorithm is efficient for all types of data.

Download Full-text

An Efficient Algorithm for Privacy Preserving Maximal Frequent Itemsets Mining

2011 Fourth International Symposium on Parallel Architectures, Algorithms and Programming ◽

10.1109/paap.2011.62 ◽

2011 ◽

Author(s):

Yuqing Miao ◽

Xiaohua Zhang ◽

Kongling Wu ◽

Jie Su

Keyword(s):

Efficient Algorithm ◽

Privacy Preserving ◽

Frequent Itemsets ◽

Frequent Itemsets Mining ◽

Maximal Frequent Itemsets

Download Full-text