Clustering Techniques in Data Mining: A Comparative Analysis

Decision tree induction and Clustering are two of the most prevalent data mining techniques used separately or together in many business applications. Most commercial data mining software tools provide these two techniques but few of them satisfy business needs. There are many criteria and factors to choose the most appropriate software for a particular organization. This paper aims to provide a comparative analysis for three popular data mining software tools, which are SAS® Enterprise Miner, SPSS Clementine, and IBM DB2® Intelligent Miner based on four main criteria, which are performance, functionality, usability, and auxiliary Task Support.

Download Full-text

Spam Mail Detection Using Data Mining: A Comparative Analysis

Smart Intelligent Computing and Applications - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-13-1921-1_56 ◽

2018 ◽

pp. 571-580 ◽

Cited By ~ 1

Author(s):

Soumyabrata Saha ◽

Suparna DasGupta ◽

Suman Kumar Das

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Using Data

Download Full-text

Comparative Analysis of Data Mining Models for Crop Yield by Using Rainfall and Soil Attributes

2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT) ◽

10.1109/icicct.2018.8473074 ◽

2018 ◽

Author(s):

Kunal Teeda ◽

Nandini Vallabhaneni ◽

T. Sridevi

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Crop Yield ◽

Soil Attributes

Download Full-text

A Survey on Major Classification Algorithms and Comparative Analysis of Few Classification Algorithms on Contact Lenses Data Set Using Data Mining Tool

New Trends in Computational Vision and Bio-inspired Computing ◽

10.1007/978-3-030-41862-5_121 ◽

2020 ◽

pp. 1201-1209

Author(s):

Syed Nawaz Pasha ◽

D. Ramesh ◽

Mohammad Sallauddin

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Contact Lenses ◽

Classification Algorithms ◽

Data Set ◽

Data Mining Tool ◽

Mining Tool ◽

Using Data

Download Full-text

A comparative Analysis of Multiple Regression in Data Mining

Journal of Computer & Information Technology ◽

10.22147/jucit/080601 ◽

2017 ◽

Vol 08 (06) ◽

pp. 37-40

Author(s):

PRIYANKA VERMA ◽

◽

RAJNI KORI ◽

SHIV KUMAR ◽

◽

...

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Multiple Regression

Download Full-text

A Comparative Analysis of Data Mining Techniques on Breast Cancer Diagnosis Data using WEKA Toolbox

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2020.0110829 ◽

2020 ◽

Vol 11 (8) ◽

Cited By ~ 1

Author(s):

Majdah Alshammari ◽

Mohammad Mezher

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Comparative Analysis ◽

Cancer Diagnosis ◽

Breast Cancer Diagnosis ◽

Data Mining Techniques

Download Full-text

Analysis of Malware Behaviour: Using Data Mining Clustering Techniques to Support Forensics Investigation

2014 Fifth Cybercrime and Trustworthy Computing Conference ◽

10.1109/ctc.2014.10 ◽

2014 ◽

Cited By ~ 5

Author(s):

Edem Inang Edem ◽

Chafika Benzaid ◽

Ameer Al-Nemrat ◽

Paul Watters

Keyword(s):

Data Mining ◽

Clustering Techniques ◽

Using Data

Download Full-text

Distance Based Pattern Driven Mining for Outlier Detection in High Dimensional Big Dataset

ACM Transactions on Management Information Systems ◽

10.1145/3469891 ◽

2022 ◽

Vol 13 (1) ◽

pp. 1-17

Author(s):

Ankit Kumar ◽

Abhishek Kumar ◽

Ali Kashif Bashir ◽

Mamoon Rashid ◽

V. D. Ambeth Kumar ◽

...

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Outlier Detection ◽

Credit Card ◽

High Dimensional ◽

Work Efficiency ◽

Average Value ◽

Novel Method ◽

Detection Of Outliers ◽

Better Than

Detection of outliers or anomalies is one of the vital issues in pattern-driven data mining. Outlier detection detects the inconsistent behavior of individual objects. It is an important sector in the data mining field with several different applications such as detecting credit card fraud, hacking discovery and discovering criminal activities. It is necessary to develop tools used to uncover the critical information established in the extensive data. This paper investigated a novel method for detecting cluster outliers in a multidimensional dataset, capable of identifying the clusters and outliers for datasets containing noise. The proposed method can detect the groups and outliers left by the clustering process, like instant irregular sets of clusters (C) and outliers (O), to boost the results. The results obtained after applying the algorithm to the dataset improved in terms of several parameters. For the comparative analysis, the accurate average value and the recall value parameters are computed. The accurate average value is 74.05% of the existing COID algorithm, and our proposed algorithm has 77.21%. The average recall value is 81.19% and 89.51% of the existing and proposed algorithm, which shows that the proposed work efficiency is better than the existing COID algorithm.

Download Full-text

Algorithms for Data Mining

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch074 ◽

2008 ◽

pp. 1301-1319

Author(s):

Tadao Takaoka ◽

Nigel K.L. Pope ◽

Kevin E. Voges

Keyword(s):

Data Mining ◽

Association Rules ◽

Image Data ◽

Regression Trees ◽

Clustering Techniques ◽

Data Mining Algorithms ◽

Mining Algorithms

In this chapter, we present an overview of some common data mining algorithms. Two techniques are considered in detail. The first is association rules, a fundamental approach that is one of the oldest and most widely used techniques in data mining. It is used, for example, in supermarket basket analysis to identify relationships between purchased items. The second is the maximum sub-array problem, which is an emerging area that is yet to produce a textbook description. This area is becoming important as a new tool for data mining, particularly in the analysis of image data. For both of these techniques, algorithms are presented in pseudo-code to demonstrate the logic of the approaches. We also briefly consider decision and regression trees and clustering techniques.

Download Full-text

Visual Data Mining: A Comparative Analysis of Selected Datasets

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-71187-0_35 ◽

2021 ◽

pp. 377-391

Author(s):

Ujunwa Mgboh ◽

Blessing Ogbuokiri ◽

George Obaido ◽

Kehinde Aruleba

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Visual Data ◽

Visual Data Mining

Download Full-text