Search Space Partition-Based Rule Mining

Classification and associative rule mining are two substantial areas in data mining. Some scientists attempt to integrate these two field called rule-based classifiers. Rule-based classifiers can play a very important role in applications such as fraud detection, medical diagnosis, etc. Numerous previous studies have shown that this type of classifier achieves a higher classification accuracy than traditional classification algorithms. However, they still suffer from a fundamental limitation. Many rule-based classifiers used various greedy techniques to prune the redundant rules that lead to missing some important rules. Another challenge that must be considered is related to the enormous set of mined rules that result in high processing overhead. The result of these approaches is that the final selected rules may not be the global best rules. These algorithms are not successful at exploiting search space effectively in order to select the best subset of candidate rules. We merged the Apriori algorithm, Harmony Search, and classification-based association rules (CBA) algorithm in order to build a rule-based classifier. We applied a modified version of the Apriori algorithm with multiple minimum support for extracting useful rules for each class in the dataset. Instead of using a large number of candidate rules, binary Harmony Search was utilized for selecting the best subset of rules that appropriate for building a classification model. We applied the proposed method on a seventeen benchmark dataset and compared its result with traditional association rule classification algorithms. The statistical results show that our proposed method outperformed other rule-based approaches.

Download Full-text

Efficient Implementations for UWEP Incremental Frequent Itemset Mining Algorithm

International Journal of Applied Logistics ◽

10.4018/ijal.2021010102 ◽

2021 ◽

Vol 11 (1) ◽

pp. 18-37

Author(s):

Mehmet Bicer ◽

Daniel Indictor ◽

Ryan Yang ◽

Xiaowen Zhang

Keyword(s):

Association Rules ◽

Association Rule ◽

Search Space ◽

Frequent Itemset ◽

Incremental Algorithm ◽

Frequent Patterns ◽

Lazy Evaluation ◽

Rule Mining ◽

Implementation Techniques ◽

Common Technique

Association rule mining is a common technique used in discovering interesting frequent patterns in data acquired in various application domains. The search space combinatorically explodes as the size of the data increases. Furthermore, the introduction of new data can invalidate old frequent patterns and introduce new ones. Hence, while finding the association rules efficiently is an important problem, maintaining and updating them is also crucial. Several algorithms have been introduced to find the association rules efficiently. One of them is Apriori. There are also algorithms written to update or maintain the existing association rules. Update with early pruning (UWEP) is one such algorithm. In this paper, the authors propose that in certain conditions it is preferable to use an incremental algorithm as opposed to the classic Apriori algorithm. They also propose new implementation techniques and improvements to the original UWEP paper in an algorithm we call UWEP2. These include the use of memorization and lazy evaluation to reduce scans of the dataset.

Download Full-text

Combining search space partition and abstraction for LTL model checking

Science in China Series F Information Sciences ◽

10.1007/s11432-007-0073-y ◽

2007 ◽

Vol 50 (6) ◽

pp. 793-810 ◽

Cited By ~ 1

Author(s):

Fei Pu ◽

WenHui Zhang

Keyword(s):

Model Checking ◽

Search Space ◽

Space Partition ◽

Ltl Model Checking

Download Full-text

Data Mining Application using Association Rule Mining ECLAT Algorithm Based on SPMF

MATEC Web of Conferences ◽

10.1051/matecconf/201816401019 ◽

2018 ◽

Vol 164 ◽

pp. 01019

Author(s):

Jason Reynaldo ◽

David Boy Tonara

Keyword(s):

Data Mining ◽

Open Source ◽

Data Storage ◽

Association Rule ◽

Association Rule Mining ◽

Search Space ◽

Rule Mining ◽

Master Data ◽

Research Domain ◽

Data Mining Application

Data mining is an important research domain that currently focused on knowledge discovery database. Where data from the database are mined so that information can be generated and used effectively and efficiently by humans. Mining can be applied to the market analysis. Association Rule Mining (ARM) has become the core of data mining. The search space is exponential in the number of database attributes and with millions of database objects the problem of I/O minimization becomes paramount. To get the information and the data such as, observation of the master data storage systems and interviews were done. Then, ECLAT algorithm is applied to the open-source library SPMF. In this project, this application can perform data mining assisted by open source SPMF with determined writing format of transaction data. It successfully displayed data with 100 % success rate. The application can generate a new easier knowledge which can be used for marketing the product.

Download Full-text

Boosting association rule mining in large datasets via Gibbs sampling

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1604553113 ◽

2016 ◽

Vol 113 (18) ◽

pp. 4958-4963 ◽

Cited By ~ 5

Author(s):

Guoqi Qian ◽

Calyampudi Radhakrishna Rao ◽

Xiaoying Sun ◽

Yuehua Wu

Keyword(s):

Gibbs Sampling ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

General Rule ◽

Search Space ◽

Stochastic Search ◽

Importance Measure ◽

Rule Mining ◽

Ergodic Markov Chain

Current algorithms for association rule mining from transaction data are mostly deterministic and enumerative. They can be computationally intractable even for mining a dataset containing just a few hundred transaction items, if no action is taken to constrain the search space. In this paper, we develop a Gibbs-sampling–induced stochastic search procedure to randomly sample association rules from the itemset space, and perform rule mining from the reduced transaction dataset generated by the sample. Also a general rule importance measure is proposed to direct the stochastic search so that, as a result of the randomly generated association rules constituting an ergodic Markov chain, the overall most important rules in the itemset space can be uncovered from the reduced dataset with probability 1 in the limit. In the simulation study and a real genomic data example, we show how to boost association rule mining by an integrated use of the stochastic search and the Apriori algorithm.

Download Full-text

Search Space Partition and Case Basis Exploration for Reducing Model Checking Complexity

Automated Technology for Verification and Analysis - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30476-0_8 ◽

2004 ◽

pp. 34-48 ◽

Cited By ~ 4

Author(s):

Bai Su ◽

Wenhui Zhang

Keyword(s):

Model Checking ◽

Search Space ◽

Space Partition ◽

Case Basis

Download Full-text

Mining Changes in Temporal Patterns in Latest Time Window for Knowledge Discovery

Journal of Information & Knowledge Management ◽

10.1142/s021964921950028x ◽

2019 ◽

Vol 18 (03) ◽

pp. 1950028

Author(s):

Sheel Shalini ◽

Kanhaiya Lal

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Time Window ◽

Search Space ◽

Temporal Association ◽

Incremental Mining ◽

Rule Mining ◽

Novel Approach ◽

Temporal Association Rule ◽

Mining Algorithms

Temporal Association Rule mining uncovers time integrated associations in a transactional database. However, in an environment where database is regularly updated, maintenance of rules is a challenging process. Earlier algorithms suggested for maintaining frequent patterns either suffered from the problem of repeated scanning or the problem of larger storage space. Therefore, this paper proposes an algorithm “Probabilistic Incremental Temporal Association Rule Mining (PITARM)” that uncovers the changed behaviour in an updated database to maintain the rules efficiently. The proposed algorithm defines two support measures to identify itemsets expected to be frequent in the successive segment in advance. It reduces unnecessary scanning of itemsets in the entire database through three-fold verification and avoids generating redundant supersets and power sets from infrequent itemsets. Implementation of pruning technique in incremental mining is a novel approach that makes it better than earlier incremental mining algorithms and consequently reduces search space to a great extent. It scans the entire database only once, thus reducing execution time. Experimental results confirm that it is an enhancement over earlier algorithms.

Download Full-text