scholarly journals A New Importance Measure of Association Rules Using Information Theory

2014 ◽  
Vol 3 (1) ◽  
pp. 37-42
Author(s):  
Chang-Hwan Lee ◽  
Joohyun Bae
2016 ◽  
Vol 113 (18) ◽  
pp. 4958-4963 ◽  
Author(s):  
Guoqi Qian ◽  
Calyampudi Radhakrishna Rao ◽  
Xiaoying Sun ◽  
Yuehua Wu

Current algorithms for association rule mining from transaction data are mostly deterministic and enumerative. They can be computationally intractable even for mining a dataset containing just a few hundred transaction items, if no action is taken to constrain the search space. In this paper, we develop a Gibbs-sampling–induced stochastic search procedure to randomly sample association rules from the itemset space, and perform rule mining from the reduced transaction dataset generated by the sample. Also a general rule importance measure is proposed to direct the stochastic search so that, as a result of the randomly generated association rules constituting an ergodic Markov chain, the overall most important rules in the itemset space can be uncovered from the reduced dataset with probability 1 in the limit. In the simulation study and a real genomic data example, we show how to boost association rule mining by an integrated use of the stochastic search and the Apriori algorithm.


Author(s):  
Gabriele Kern-Isberner

Knowledge discovery refers to the process of extracting new, interesting, and useful knowledge from data and presenting it in an intelligible way to the user. Roughly, knowledge discovery can be considered a three-step process: preprocessing data; data mining, in which the actual exploratory work is done; and interpreting the results to the user. Here, I focus on the data-mining step, assuming that a suitable set of data has been chosen properly. The patterns that we search for in the data are plausible relationships, which agents may use to establish cognitive links for reasoning. Such plausible relationships can be expressed via association rules. Usually, the criteria to judge the relevance of such rules are either frequency based (Bayardo & Agrawal, 1999) or causality based (for Bayesian networks, see Spirtes, Glymour, & Scheines, 1993). Here, I will pursue a different approach that aims at extracting what can be regarded as structures of knowledge — relationships that may support the inductive reasoning of agents and whose relevance is founded on information theory. The method that I will sketch in this article takes numerical relationships found in data and interprets these relationships as structural ones, using mostly algebraic techniques to elaborate structural information.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Sibing Sun

Data science has expanded at an exponential growth with the advancement of big data technology. The data analysis techniques need to delve deeper to find valuable information (Sarac 2017). The notion of edge computing is broadly acknowledged. Edge-enabled solutions provide computing, analysis, storage, and control nearer to the edge of the network, which support the efficient processing and decision-making. Machine learning has also attained significant attention in this context due to its flexibility and its ability to provide a variety of supervised, unsupervised, and semisupervised techniques. This research presents a specific model to evaluate the potential correlation of piano teaching using machine learning. The data analysis is performed at the edges of network for efficient results (Tan et al. 2017). The association rule mining technique of machine learning is utilized with the integration of improved T-test method. The improved T-test is performed for the measurement of association rules and proposed a new measure and influence degree of association rules. It is evident from the results that the use of the degree of influence as a measure of association rules to find the potential relevance of multimedia-assistant piano teaching evaluation data is extremely feasible. It overcomes shortcomings of existing measurement standards and reduces the generation of redundant rules. The existing literature highlights the concepts of evaluation of potential correlation and evaluates the advantages. However, there is a lack of an effective strategy for piano teaching. The proposed model performs efficient calculation and storage. The feasibility and effectiveness of the proposed framework are verified using the analysis of the actual dataset. The verification results show that it is feasible and valuable to find the potential relevance of multimedia-assisted piano teaching evaluation.


Sign in / Sign up

Export Citation Format

Share Document