Implementation of K-Means Algorithm using Clustering Rules on Medical Data Sets

During the process of mining frequent item sets, when minimum support is little, the production of candidate sets is a kind of time-consuming and frequent operation in the mining algorithm. The K-Means algorithm does not need to produce the candidate sets, the database which provides the frequent item set is compressed to a frequent pattern tree (or FP tree), and frequent item set is mining by using of FP tree. These algorithms considered as efficient because of their compact structure and also for less generation of candidates itemsets compare to Apriori and Apriori like algorithms. Therefore this paper aims to presents a basic Concepts of some of the algorithms (K-Means Algorithmn, COFI-Tree, CT-PRO) based upon the FP- Tree like structure for mining the frequent item sets along with their capabilities and comparisons. Data mining implementation on spatial data to generate rules and patterns using Frequent Pattern (FP)-Growth algorithm is the major concern of this research study. We presented in this paper how data mining can apply on spatial data.

Download Full-text

Extrication of Apriori Algorithm using Association Rules on Medical Data sets

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset19627 ◽

2019 ◽

pp. 107-112

Author(s):

Anusha Viswanadapalli ◽

Praveen Kumar Nelapati

Keyword(s):

Data Mining ◽

Research Study ◽

Medical Data ◽

Frequent Pattern ◽

Data Sets ◽

Apriori Algorithm ◽

Compact Structure ◽

Frequent Item ◽

Frequent Pattern Tree ◽

Frequent Item Sets

During the process of mining frequent item sets, when minimum support is little, the production of candidate sets is a kind of time-consuming and frequent operation in the mining algorithm. The APRIORI growth algorithm does not need to produce the candidate sets, the database which provides the frequent item set is compressed to a frequent pattern tree (or APRIORI tree), and frequent item set is mining by using of APRIORI tree. These algorithms considered as efficient because of their compact structure and also for less generation of candidates item sets compare to Apriori and Apriori like algorithms. Therefore this paper aims to presents a basic Concepts of some of the algorithms (APRIORI-Growth, COFI-Tree, CT-PRO) based upon the APRIORI- Tree like structure for mining the frequent item sets along with their capabilities and comparisons. Data mining implementation on MEDICAL data to generate rules and patterns using Frequent Pattern (APRIORI)-Growth algorithm is the major concern of this research study. We presented in this paper how data mining can apply on MEDICAL data.

Download Full-text

An Improvised Frequent Pattern Tree Based Association Rule Mining Technique with Mining Frequent Item Sets Algorithm and a Modified Header Table

International Journal of Data Mining & Knowledge Management Process ◽

10.5121/ijdkp.2015.5204 ◽

2015 ◽

Vol 5 (2) ◽

pp. 39-51

Author(s):

Vandit Agarwal ◽

Mandhani Kushal ◽

Preetham Kumar

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Frequent Pattern ◽

Rule Mining ◽

Mining Technique ◽

Frequent Item ◽

Frequent Pattern Tree ◽

Frequent Item Sets

Download Full-text

The Integral of Spatial Data Mining in the Era of Big Data

Advances in Business Information Systems and Analytics - Handbook of Research on Advanced Data Mining Techniques and Applications for Business Intelligence ◽

10.4018/978-1-5225-2031-3.ch006 ◽

2017 ◽

pp. 90-126

Author(s):

Gebeyehu Belay Gebremeskel ◽

Chai Yi ◽

Zhongshi He

Keyword(s):

Data Mining ◽

Data Warehouse ◽

Spatial Data ◽

High Volume ◽

Spatial Data Mining ◽

Research Field ◽

Data Sets ◽

Data Types ◽

Basic Principles ◽

Gis Data

Data Mining (DM) is a rapidly expanding field in many disciplines, and it is greatly inspiring to analyze massive data types, which includes geospatial, image and other forms of data sets. Such the fast growths of data characterized as high volume, velocity, variety, variability, value and others that collected and generated from various sources that are too complex and big to capturing, storing, and analyzing and challenging to traditional tools. The SDM is, therefore, the process of searching and discovering valuable information and knowledge in large volumes of spatial data, which draws basic principles from concepts in databases, machine learning, statistics, pattern recognition and 'soft' computing. Using DM techniques enables a more efficient use of the data warehouse. It is thus becoming an emerging research field in Geosciences because of the increasing amount of data, which lead to new promising applications. The integral SDM in which we focused in this chapter is the inference to geospatial and GIS data.

Download Full-text

Security and Verification of Server Data Using Frequent Itemset Mining in Ecommerce

International Journal of Synthetic Emotions ◽

10.4018/ijse.2017010103 ◽

2017 ◽

Vol 8 (1) ◽

pp. 31-43

Author(s):

Zuber Shaikh ◽

Antara Mohadikar ◽

Rachana Nayak ◽

Rohith Padamadan

Keyword(s):

Data Mining ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Graphical Password ◽

Itemset Mining ◽

Frequent Item ◽

Data Mining Algorithms ◽

Shoulder Surfing ◽

Mining Algorithms ◽

Frequent Item Sets

Frequent itemsets refer to a set of data values (e.g., product items) whose number of co-occurrences exceeds a given threshold. The challenge is that the design of proofs and verification objects has to be customized for different data mining algorithms. Intended method will implement a basic idea of completeness verification and authentication approach in which the client will uses a set of frequent item sets as the evidence, and checks whether the server has missed any frequent item set as evidence in its returned result. It will help client detect untrusted server and system will become much more efficiency by reducing time. In authentication process CaRP is both a captcha and a graphical password scheme. CaRP addresses a number of security problems altogether, such as online guessing attacks, relay attacks, and, if combined with dual-view technologies, shoulder-surfing attacks.

Download Full-text

Visual Data Mining of Large Spatial Data Sets

Databases in Networked Information Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-540-39845-5_17 ◽

2003 ◽

pp. 201-215 ◽

Cited By ~ 4

Author(s):

Daniel A. Keim ◽

Christian Panse ◽

Mike Sips

Keyword(s):

Data Mining ◽

Spatial Data ◽

Data Sets ◽

Visual Data ◽

Visual Data Mining ◽

Spatial Data Sets

Download Full-text

High performance spatial data mining for very large data-sets (citation_only)

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '03 ◽

10.1145/781498.781509 ◽

2003 ◽

Author(s):

Baris Kazar

Keyword(s):

Data Mining ◽

Spatial Data ◽

High Performance ◽

Large Data ◽

Spatial Data Mining ◽

Large Data Sets ◽

Data Sets

Download Full-text

Association Rule Integrasi Pendekatan Metode Custom Hashing dan Data Partitioning untuk Mempercepat Proses Pencarian Frekuensi Item-set pada Algoritma Apriori

Matrik Jurnal Manajemen Teknik Informatika dan Rekayasa Komputer ◽

10.30812/matrik.v20i1.833 ◽

2020 ◽

Vol 20 (1) ◽

pp. 149-158

Author(s):

Moch. Syahrir ◽

Fatimatuzzahra Fatimatuzzahra

Keyword(s):

Data Mining ◽

Association Rule ◽

Data Partitioning ◽

Frequent Pattern ◽

Frequent Pattern Tree

Data mining dengan peran asosiasi sudah banyak digunakan oleh dunia usaha, salah satu algoritma yang sering digunakan untuk aturan asosiasi adalah apriori. Namun apriori memiliki kelemahan dalam hal performa, karena pada setiap penentuan frequent k-itemset harus melakukan scan database. Hal ini akan menjadi masalah apabila kandidat k-itemset memiliki dimensi yang banyak. proses scan database yang besar akan memakan waktu yang lama dan berpengaruh pada penggunaan memori dan prosesor. Apriori sudah sering dikembangkan, salah satu yang populer adalah Frequent Pattern (fp-growth), apriori dan fp-growth sama-sama merupakan algoritma untuk aturan asosiasi, hanya saja fp-growth menggunakan pendekatan yang berbeda dengan apriori yakni menggunakan pendekatan Frequent Pattern Tree (fp-tree). Meski fp-growth memiiki performa yang bagus ketika scan database namun rules yang di hasilkan oleh fp-growth tidak sebaik yang di hasilkan oleh apriori. Alternatif lain yang bisa digunakan adalah metode hashing, hal ini bisa menjadi solusi untuk mengatasi masalah dalam proses pencarian dan penentuan frequent k-itemset, sehingga proses scan database bisa lebih cepat. Tujuan penelitian adalah memperbaiki kinerja apriori dalam proses pencarian frekuensi itemset sehingga waktu scan database bisa lebih cepat

Download Full-text

A Weighted Frequent Item-Set Mining using WD-FIM Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3683.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 4792-4796

Keyword(s):

Data Mining ◽

Decision Making ◽

Research Area ◽

Data Sets ◽

Weight Factor ◽

Smart Systems ◽

Frequent Item ◽

Significant Research ◽

Downward Closure ◽

The One

Smart systems are the one of the most significant inventions of our times. These systems rely on powerful information mining techniques to achieve intelligence in decision making. Frequent item set mining (FIM), has become one of the most significant research area of data mining. The information present in databases is in-general ambiguous and uncertain. In such databases, one should think of weighted FIM to discover item sets which are significant from end user’s perspective. Be that as it may, with introduction of weight-factor for FIM makes the weighted continuous item sets may not fulfil the descending conclusion property anymore. Subsequently, the pursuit space of successive item set can't be limited by descending conclusion property which prompts a poor time effectiveness. In this paper, we introduce two properties for FIM, first one is, weight judgment downward closure property (WD-FIM), it is for weighted FIM and the second one is existence property for its subsets. In view of above two properties, the WD-FIM calculation is proposed to limit the looking through space of the weighted regular item sets and improve the time effectiveness. In addition, the culmination and time productivity of WD-FIM calculation are examined hypothetically. At last, the exhibition of the proposed WD-FIM calculation is confirmed on both engineered and genuine data sets

Download Full-text

Bio-Inspired Algorithms for Medical Data Analysis

Handbook of Research on Biomimicry in Information Retrieval and Knowledge Management - Advances in Web Technologies and Engineering ◽

10.4018/978-1-5225-3004-6.ch014 ◽

2018 ◽

pp. 251-275 ◽

Cited By ~ 1

Author(s):

Hanane Menad ◽

Abdelmalek Amine

Keyword(s):

Data Mining ◽

Data Analysis ◽

Social Behavior ◽

Medical Data ◽

The Other ◽

Data Sets ◽

Classification Rules ◽

Medical Data Mining ◽

Good Efficiency

Medical data mining has great potential for exploring the hidden patterns in the data sets of the medical domain. These patterns can be utilized for clinical diagnosis. Bio-inspired algorithms is a new field of research. Its main advantage is knitting together subfields related to the topics of connectionism, social behavior, and emergence. Briefly put, it is the use of computers to model living phenomena and simultaneously the study of life to improve the usage of computers. In this chapter, the authors present an application of four bio-inspired algorithms and meta heuristics for classification of seven different real medical data sets. Two of these algorithms are based on similarity calculation between training and test data while the other two are based on random generation of population to construct classification rules. The results showed a very good efficiency of bio-inspired algorithms for supervised classification of medical data.

Download Full-text