Visual Data Mining for Discovering Association Rules

Author(s):  
Kesaraporn Techapichetvanich ◽  
Amitava Datta

Both visualization and data mining have become important tools in discovering hidden relationships in large data sets, and in extracting useful knowledge and information from large databases. Even though many algorithms for mining association rules have been researched extensively in the past decade, they do not incorporate users in the association-rule mining process. Most of these algorithms generate a large number of association rules, some of which are not practically interesting. This chapter presents a new technique that integrates visualization into the mining association rule process. Users can apply their knowledge and be involved in finding interesting association rules through interactive visualization, after obtaining visual feedback as the algorithm generates association rules. In addition, the users gain insight and deeper understanding of their data sets, as well as control over mining meaningful association rules.

2008 ◽  
pp. 2105-2120
Author(s):  
Kesaraporn Techapichetvanich ◽  
Amitava Datta

Both visualization and data mining have become important tools in discovering hidden relationships in large data sets, and in extracting useful knowledge and information from large databases. Even though many algorithms for mining association rules have been researched extensively in the past decade, they do not incorporate users in the association-rule mining process. Most of these algorithms generate a large number of association rules, some of which are not practically interesting. This chapter presents a new technique that integrates visualization into the mining association rule process. Users can apply their knowledge and be involved in finding interesting association rules through interactive visualization, after obtaining visual feedback as the algorithm generates association rules. In addition, the users gain insight and deeper understanding of their data sets, as well as control over mining meaningful association rules.


Author(s):  
Ana Cristina Bicharra Garcia ◽  
Inhauma Ferraz ◽  
Adriana S. Vivacqua

AbstractMost past approaches to data mining have been based on association rules. However, the simple application of association rules usually only changes the user's problem from dealing with millions of data points to dealing with thousands of rules. Although this may somewhat reduce the scale of the problem, it is not a completely satisfactory solution. This paper presents a new data mining technique, called knowledge cohesion (KC), which takes into account a domain ontology and the user's interest in exploring certain data sets to extract knowledge, in the form of semantic nets, from large data sets. The KC method has been successfully applied to mine causal relations from oil platform accident reports. In a comparison with association rule techniques for the same domain, KC has shown a significant improvement in the extraction of relevant knowledge, using processing complexity and knowledge manageability as the evaluation criteria.


Author(s):  
LAWRENCE MAZLACK

Determining causality has been a tantalizing goal throughout human history. Proper sacrifices to the gods were thought to bring rewards; failure to make suitable observations were thought to lead to disaster. Today, data mining holds the promise of extracting unsuspected information from very large databases. Methods have been developed to build association rules from large data sets. Association rules indicate the strength of association of two or more data attributes. In many ways, the interest in association rules is that they offer the promise (or illusion) of causal, or at least, predictive relationships. However, association rules only calculate a joint probability; they do not express a causal relationship. If causal relationships could be discovered, it would be very useful. Our goal is to explore causality in the data mining context.


Author(s):  
Suma B. ◽  
Shobha G.

<div>Association rule mining is a well-known data mining technique used for extracting hidden correlations between data items in large databases. In the majority of the situations, data mining results contain sensitive information about individuals and publishing such data will violate individual secrecy. The challenge of association rule mining is to preserve the confidentiality of sensitive rules when releasing the database to external parties. The association rule hiding technique conceals the knowledge extracted by the sensitive association rules by modifying the database. In this paper, we introduce a border-based algorithm for hiding sensitive association rules. The main purpose of this approach is to conceal the sensitive rule set while maintaining the utility of the database and association rule mining results at the highest level. The performance of the algorithm in terms of the side effects is demonstrated using experiments conducted on two real datasets. The results show that the information loss is minimized without sacrificing the accuracy. </div>


2014 ◽  
Vol 543-547 ◽  
pp. 3569-3572
Author(s):  
Tian Xiang Zhu ◽  
Xiao Lan Tian ◽  
Shu Hui Sun ◽  
Shu Jie Sun

Cloud computing is the latest trend in IT technical development, the importance of cloud databases has been widely acknowledged. There are numerous data in the cloud database and among these data, much potential and valuable knowledge are implicit. The key point is to discover and pick up the useful knowledge automatically. An association rule is one of the main models in mining out these data, and it mainly focuses on the relationship among different areas in the data. This paper puts forward the basic model of data mining based on association rules in cloud database and introduces corresponding mining algorithms.


Author(s):  
Mohamad Fauzy ◽  
Kemas Rahmat Saleh W ◽  
Ibnu Asror

[Id] Prakiraan cuaca saat ini telah menjadi satu hal yang dibutuhkan bagi banyak orang di dunia. Dalam memprediksi hujan pengolahan data cuaca merupakan hal yang penting. Namun permasalahannya, data cuaca yang semakin hari semakin bertambah menyebabkan penumpukan data sehingga pengolahan data tersebut perlu penanganan lebih lanjut. Oleh karena itu pemanfaatan data mining digunakan untuk menyelesaikan masalah ini. Association rule mining adalah salah satu metode data mining yang dapat mengidentifikasi hubungan kesamaan antar item. Penelitian ini dilakukan dengan tiga tahapan utama yaitu : 1) melakukan analisa pola frekuensi tinggi menggunakan algortima apriori; 2) pembentukan aturan asosiasi (association rule); 3) uji kekuatan rule yang terbentuk dengan menghitung lift ratio pada masing-masing rule. Dataset yang digunakan adalah data klimatologi yang diambil dari BMKG stasiun geofisika kelas 1 Bandung. Hasil akhir dari Penelitian ini berupa aturan-aturan asosiasi (association rules) dimana aturan-aturan ini dapat dijadikan sebagai acuan dalam memprediksi cuaca hujan atau tidak hujan untuk satu hari kedepan. Kata kunci : Data mining, association rule, apriori, prediksi hujan [En] Weather forecast today has become a necessary thing for many people in the world. In predicting rain weather data processing is essential. But the problem, weather data that is increasingly growing cause the accumulation of data so that the data processing needs further treatment. Therefore, the use of data mining is used to solve this problem. Association rule mining is one of data mining methods that can identify similarity relationships between items. This research is performed by three main stages, namely: 1) to analyze high frequency patterns using algorithms priori; 2) the establishment of an association rule (association rule); 3) test the strength of the rule which is formed by calculating the ratio elevator on each rule. The dataset used is the climatological data taken from BMKG station 1st class geophysical Bandung. The end result of this research in the form of rules of association (association rules) in which these rules can be used as a reference in predicting the weather is rain or not rain for the next day. Keywords : data mining, association rule, apriori, rain forecast


The main employment and resource of our country is agriculture. In the upcoming days agriculture is going to be one of the important field .Agriculture plays a vital role in economical development of india. Half of the Indian population is mainly depended on agriculture. It is the source of living it is important in everyday life. Comparing to previous years Now-aday's Agriculture is in poor condition. The most important reasons for this is there is no proper guidance for the farmers.Outstanding to these problems, farming affects the yield of Coriander and lack of knowledge about the Coriander cultivation methodologies. And also season to cultivate the coriander and choosing which soil is the best to cultivate the particular Coriander based on the weather condition and also when to harvest the Coriander for the best yield. If the farmer is aware about the Coriander cultivation methodologies and harvesting it will more helpful for the people in the real world and also to increase the Coriander productivity. Data mining is the process of finding new template from large data sets, this technology which is in use in inferring useful knowledge that can be put to use from a vast amount of data. Climate is one of the meteorological data that is well-to-do by important knowledge. This paper presents a brief comparative study of various different techniques used for yield of coriander. The data mining techniques that are in use for the coriander yield estimation are K-Means.


2008 ◽  
Vol 17 (06) ◽  
pp. 1109-1129 ◽  
Author(s):  
BASILIS BOUTSINAS ◽  
COSTAS SIOTOS ◽  
ANTONIS GEROLIMATOS

One of the most important data mining problems is learning association rules of the form "90% of the customers that purchase product x also purchase product y". Discovering association rules from huge volumes of data requires substantial processing power. In this paper we present an efficient distributed algorithm for mining association rules that reduces the time complexity in a magnitude that renders as suitable for scaling up to very large data sets. The proposed algorithm is based on partitioning the initial data set into subsets and processing each subset in parallel. The proposed algorithm can maintain the set of association rules that are extracted when applying an association rule mining algorithm to all the data, by reducing the support threshold during processing the subsets. The above are confirmed by empirical tests that we present and which also demonstrate the utility of the method.


2018 ◽  
Vol 7 (2) ◽  
pp. 100-105
Author(s):  
Simranjit Kaur ◽  
Seema Baghla

Online shopping has a shopping channel or purchasing various items through online medium. Data mining is defined as a process used to extract usable data from a larger set of any raw data. The data set extraction from the demographic profiles and Questionnaire to investigate the gathered based by association. The method for shopping was totally changed with the happening to internet Technology. Association rule mining is one of the important problems of data mining has been used here. The goal of the association rule mining is to detect relationships or associations between specific values of categorical variables in large data sets.


2021 ◽  
Vol 8 (3) ◽  
pp. 65-70
Author(s):  
Mohamad Mohamad Shamie ◽  
Muhammad Mazen Almustafa

Data mining is a process of knowledge discovery to extract the interesting, previously unknown, potentially useful, and nontrivial patterns from large data sets. Currently, there is an increasing interest in data mining in traffic accidents, which makes it a growing new research community. A large number of traffic accidents in recent years have generated large amounts of traffic accident data. The mining algorithms had a great role in determining the causes of these accidents, especially the association rule algorithms. One challenging problem in data mining is effective association rules mining with the huge transactional databases, many efforts have been made to propose and improve association rules mining methods. In the paper, we use the RapidMiner application to design a process that can generate association rules based on clustering algorithms.


Sign in / Sign up

Export Citation Format

Share Document