A Parallel Direct‐Vertical Map Reduce Programming model for an effective frequent pattern mining in a dispersed environment

There are many algorithms available in data mining to search interesting patterns from transactional databases of precise data. Frequent pattern mining is a technique to find the frequently occurred items in data mining. Most of the techniques used to find all the interesting patterns from a collection of precise data, where items occurred in each transaction are certainly known to the system. As well as in many real-time applications, users are interested in a tiny portion of large frequent patterns. So the proposed user constrained mining approach, will help to find frequent patterns in which user is interested. This approach will efficiently find user interested frequent patterns by applying user constraints on the collections of uncertain data. The user can specify their own interest in the form of constraints and uses the Map Reduce model to find uncertain frequent pattern that satisfy the user-specified constraints

Download Full-text

Frequent Pattern Mining Based on Pattern Space Division in Map/Reduce Cluster

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.588-589.2038 ◽

2012 ◽

Vol 588-589 ◽

pp. 2038-2041

Author(s):

Qian Liu ◽

Ming Chen

Keyword(s):

Pattern Mining ◽

Recursive Algorithm ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Map Reduce ◽

Data Set ◽

Combinatorial Explosion ◽

Space Division ◽

Pattern Space ◽

The Many

By means of pattern space division and based on Map/Reduce, the problem of processing the many-to-many corresponding relationship between the data set and the patterns set is converted to the problem of processing the many-to-many corresponding relationship between the data subsets and the pattern subspaces associated with the frequent 1-itemsets. Thus, the scale of the intermediate key/value pairs set is reduced so dramatically that the problem of single Map node bottleneck which results from combinatorial explosion of candidate patterns space is avoided. Over three rounds of Map/Reduce tasks, the pattern space is constructed and divided, the filtering rules is established and employed, father more, the mining of frequent patterns is realized in each pattern subspace independently. By making the best of both the universal trait of the entire pattern space and the individuality of each pattern subspace, the optimized non-recursive algorithm is designed and implemented to improve the efficiency of mining phase.

Download Full-text

An Adaptive Data Distribution Through Tree Rules in Frequent Pattern Mining

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit183894 ◽

2018 ◽

pp. 300-305

Keyword(s):

Information Sharing ◽

Pattern Mining ◽

Data Distribution ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

General Development ◽

Secure Information ◽

Evaluation Parameters ◽

Secure Information Sharing

Information sharing among the associations is a general development in a couple of zones like business headway and exhibiting. As bit of the touchy principles that ought to be kept private may be uncovered and such disclosure of delicate examples may impacts the advantages of the association that have the data. Subsequently the standards which are delicate must be secured before sharing the data. In this paper to give secure information sharing delicate guidelines are bothered first which was found by incessant example tree. Here touchy arrangement of principles are bothered by substitution. This kind of substitution diminishes the hazard and increment the utility of the dataset when contrasted with different techniques. Examination is done on certifiable dataset. Results shows that proposed work is better as appear differently in relation to various past strategies on the introduce of evaluation parameters.

Download Full-text

Learning and Synchronized Privacy Preserving Frequent Pattern Mining

Journal of Software ◽

10.3724/sp.j.1001.2011.04000 ◽

2011 ◽

Vol 22 (8) ◽

pp. 1749-1760

Author(s):

Yu-Hong GUO ◽

Yun-Hai TONG ◽

Shi-Wei TANG ◽

Leng-Dong WU

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Privacy Preserving ◽

Frequent Pattern

Download Full-text

RAKING: An Efficient K-Maximal Frequent Pattern Mining Algorithm on Uncertain Graph Database

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2010.01387 ◽

2010 ◽

Vol 33 (8) ◽

pp. 1387-1395 ◽

Cited By ~ 4

Author(s):

Meng HAN ◽

Wei ZHANG ◽

Jian-Zhong LI

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Graph Database ◽

Uncertain Graph ◽

Mining Algorithm ◽

Maximal Frequent Pattern

Download Full-text

Sliding window based weighted maximal frequent pattern mining over data streams

Expert Systems with Applications ◽

10.1016/j.eswa.2013.07.094 ◽

2014 ◽

Vol 41 (2) ◽

pp. 694-708 ◽

Cited By ~ 64

Author(s):

Gangin Lee ◽

Unil Yun ◽

Keun Ho Ryu

Keyword(s):

Data Streams ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Sliding Window ◽

Frequent Pattern ◽

Maximal Frequent Pattern

Download Full-text

Deep learning frequent pattern mining on static semi structured data streams for improving fast speed and complex data streams

2021 7th International Conference on Optimization and Applications (ICOA) ◽

10.1109/icoa51614.2021.9442621 ◽

2021 ◽

Author(s):

G. Suseendran ◽

D. Balaganesh ◽

D. Akila ◽

Souvik Pal

Keyword(s):

Deep Learning ◽

Data Streams ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Structured Data ◽

Frequent Pattern ◽

Complex Data ◽

Fast Speed

Download Full-text

Genotype Pattern Mining for Pairs of Interacting Variants Underlying Digenic Traits

Genes ◽

10.3390/genes12081160 ◽

2021 ◽

Vol 12 (8) ◽

pp. 1160

Author(s):

Atsuko Okazaki ◽

Sukanya Horpaopan ◽

Qingrun Zhang ◽

Matthew Randesi ◽

Jurg Ott

Keyword(s):

Null Hypothesis ◽

Pattern Mining ◽

Genetic Diseases ◽

Frequent Pattern Mining ◽

Case Control ◽

Frequent Pattern ◽

Permutation Testing ◽

Case Control Studies ◽

P Values ◽

Dna Variants

Some genetic diseases (“digenic traits”) are due to the interaction between two DNA variants, which presumably reflects biochemical interactions. For example, certain forms of Retinitis Pigmentosa, a type of blindness, occur in the presence of two mutant variants, one each in the ROM1 and RDS genes, while the occurrence of only one such variant results in a normal phenotype. Detecting variant pairs underlying digenic traits by standard genetic methods is difficult and is downright impossible when individual variants alone have minimal effects. Frequent pattern mining (FPM) methods are known to detect patterns of items. We make use of FPM approaches to find pairs of genotypes (from different variants) that can discriminate between cases and controls. Our method is based on genotype patterns of length two, and permutation testing allows assigning p-values to genotype patterns, where the null hypothesis refers to equal pattern frequencies in cases and controls. We compare different interaction search approaches and their properties on the basis of published datasets. Our implementation of FPM to case-control studies is freely available.

Download Full-text