Summarization in Pattern Mining

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch287 ◽

2011 ◽

pp. 1877-1883 ◽

Cited By ~ 2

Author(s):

Mohammad Al Hasan

Keyword(s):

Knowledge Discovery ◽

Pattern Mining ◽

Real Life ◽

Frequent Pattern Mining ◽

Search Space ◽

Frequent Pattern ◽

Combinatorial Search ◽

Main Concept ◽

Set Size ◽

Memory Complexity

The research on mining interesting patterns from transactions or scientific datasets has matured over the last two decades. At present, numerous algorithms exist to mine patterns of variable complexities, such as set, sequence, tree, graph, etc. Collectively, they are referred as Frequent Pattern Mining (FPM) algorithms. FPM is useful in most of the prominent knowledge discovery tasks, like classification, clustering, outlier detection, etc. They can be further used, in database tasks, like indexing and hashing while storing a large collection of patterns. But, the usage of FPM in real-life knowledge discovery systems is considerably low in comparison to their potential. The prime reason is the lack of interpretability caused from the enormity of the output-set size. For instance, a moderate size graph dataset with merely thousand graphs can produce millions of frequent graph patterns with a reasonable support value. This is expected due to the combinatorial search space of pattern mining. However, classification, clustering, and other similar Knowledge discovery tasks should not use that many patterns as their knowledge nuggets (features), as it would increase the time and memory complexity of the system. Moreover, it can cause a deterioration of the task quality because of the popular “curse of dimensionality” effect. So, in recent years, researchers felt the need to summarize the output set of FPM algorithms, so that the summary-set is small, non-redundant and discriminative. There are different summarization techniques: lossless, profile-based, cluster-based, statistical, etc. In this article, we like to overview the main concept of these summarization techniques, with a comparative discussion of their strength, weakness, applicability and computation cost.

Download Full-text

Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms

Proceedings of the 14th International Conference on Extending Database Technology - EDBT/ICDT '11 ◽

10.1145/1951365.1951370 ◽

2011 ◽

Cited By ~ 21

Author(s):

R. Uday Kiran ◽

P. Krishna Reddy

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Search Space ◽

Frequent Pattern ◽

Multiple Minimum Supports ◽

Mining Algorithms

Download Full-text

B-mine: Frequent Pattern Mining and Its Application to Knowledge Discovery from Social Networks

Web Technologies and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-319-45814-4_26 ◽

2016 ◽

pp. 316-328 ◽

Cited By ~ 6

Author(s):

Fan Jiang ◽

Carson K. Leung ◽

Hao Zhang

Keyword(s):

Social Networks ◽

Knowledge Discovery ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern

Download Full-text

Frequent Pattern Mining Using Modified CP-Tree for Knowledge Discovery

Advanced Data Mining and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-642-17316-5_24 ◽

2010 ◽

pp. 254-261 ◽

Cited By ~ 4

Author(s):

R . Vishnu Priya ◽

A. Vadivel ◽

R. S. Thakur

Keyword(s):

Knowledge Discovery ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern

Download Full-text

Knowledge discovery of design rationale based on frequent-pattern mining

Automatic Control, Mechatronics and Industrial Engineering ◽

10.1201/9780429468605-22 ◽

2019 ◽

pp. 161-166

Author(s):

H. Jiang ◽

W. Yang ◽

J. Mei ◽

R.L. Wu ◽

L. Guo

Keyword(s):

Knowledge Discovery ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Design Rationale

Download Full-text

An Efficient Approach for Mining Weighted Approximate Closed Frequent Patterns Considering Noise Constraints

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s0218488514500470 ◽

2014 ◽

Vol 22 (06) ◽

pp. 879-912 ◽

Cited By ~ 18

Author(s):

Unil Yun ◽

Eunchul Yoon

Keyword(s):

Pattern Mining ◽

Fault Tolerant ◽

Frequent Pattern Mining ◽

Search Space ◽

Frequent Pattern ◽

Frequent Patterns ◽

Performance Study ◽

Negative Effects ◽

Previous Definition ◽

Definition Of

Based on the frequent pattern mining, closed frequent pattern mining and weighted frequent pattern mining have been studied to reduce the search space and discover important patterns. In the previous definition of weighted closed patterns, supports of patterns are only considered to compute the closures of the patterns. It means that the closures of weighted frequent patterns cannot be perfectly checked. Moreover, the usefulness of weighted closed frequent patterns depends on the presence of frequent patterns that have supersets with the exactly same weighted support. However, from the errors such as noise, slight changes in items' supports or weights by them have significantly negative effects on the mining results, which may prevent us from obtaining exact and valid analysis results since the errors can break the original characteristics of items and patterns. In this paper, to solve the above problems, we propose a concept of robust weighted closed frequent pattern mining, and an approximate bound is defined on the basis of the concept, which can relax requirements for precise equality among patterns' weighted supports. Thereafter, we propose a weighted approximate closed frequent pattern mining algorithm which not only considers the two approaches but also suggests fault tolerant pattern mining in the noise constraints. To efficiently mine weighted approximate closed frequent patterns, we suggest pruning and subset checking methods which reduce search space. We also report extensive performance study to demonstrate the effectiveness, efficiency, memory usage, scalability, and quality of patterns in our algorithm.

Download Full-text

Novel Techniques to Reduce Search Space in Periodic-Frequent Pattern Mining

Database Systems for Advanced Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-319-05813-9_25 ◽

2014 ◽

pp. 377-391 ◽

Cited By ~ 9

Author(s):

R. Uday Kiran ◽

Masaru Kitsuregawa

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Search Space ◽

Frequent Pattern

Download Full-text

An Adaptive Data Distribution Through Tree Rules in Frequent Pattern Mining

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit183894 ◽

2018 ◽

pp. 300-305

Keyword(s):

Information Sharing ◽

Pattern Mining ◽

Data Distribution ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

General Development ◽

Secure Information ◽

Evaluation Parameters ◽

Secure Information Sharing

Information sharing among the associations is a general development in a couple of zones like business headway and exhibiting. As bit of the touchy principles that ought to be kept private may be uncovered and such disclosure of delicate examples may impacts the advantages of the association that have the data. Subsequently the standards which are delicate must be secured before sharing the data. In this paper to give secure information sharing delicate guidelines are bothered first which was found by incessant example tree. Here touchy arrangement of principles are bothered by substitution. This kind of substitution diminishes the hazard and increment the utility of the dataset when contrasted with different techniques. Examination is done on certifiable dataset. Results shows that proposed work is better as appear differently in relation to various past strategies on the introduce of evaluation parameters.

Download Full-text