A non-parametric hierarchical clustering model

The purpose of this article is to weigh up the foremost imperative features of Chronic Kidney Disease (CKD). This study is based mostly on three cluster techniques like; K means, Fuzzy c-means and hierarchical clustering. The authors used evolutionary techniques like genetic algorithms (GA) to extend the performance of the clustering model. The performance of these three clusters: live parameter purity, entropy, and Adjusted Rand Index (ARI) have been contemplated. The best purity is obtained by the K-means clustering technique, 96.50%; whereas, Fuzzy C-means clustering received 93.50% and hierarchical clustering was the lowest at 92. 25%. After using evolutionary technique Genetic Algorithm as Feature selection technique, the best purity is obtained by hierarchical clustering, 97.50%, compared to K –means clustering, 96.75%, and Fuzzy C-means clustering at 94.00%.

Download Full-text

An efficient hierarchical clustering model for grouping web transactions

International Journal of Business Intelligence and Data Mining ◽

10.1504/ijbidm.2008.020516 ◽

2008 ◽

Vol 3 (2) ◽

pp. 147 ◽

Cited By ~ 1

Author(s):

Darenna Syahida Suib ◽

Mustafa Mat Deris

Keyword(s):

Hierarchical Clustering ◽

Clustering Model ◽

Web Transactions

Download Full-text

Analysis and Comparison of Clustering Techniques for Chronic Kidney Disease With Genetic Algorithm

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2018100102 ◽

2018 ◽

Vol 8 (4) ◽

pp. 16-25

Author(s):

Sanat Kumar Sahu ◽

A. K. Shrivas

Keyword(s):

Genetic Algorithm ◽

Chronic Kidney Disease ◽

Kidney Disease ◽

Hierarchical Clustering ◽

Adjusted Rand Index ◽

Feature Selection Technique ◽

Fuzzy C Means ◽

Clustering Model ◽

Fuzzy C Means Clustering ◽

Evolutionary Technique

The purpose of this article is to weigh up the foremost imperative features of Chronic Kidney Disease (CKD). This study is based mostly on three cluster techniques like; K means, Fuzzy c-means and hierarchical clustering. The authors used evolutionary techniques like genetic algorithms (GA) to extend the performance of the clustering model. The performance of these three clusters: live parameter purity, entropy, and Adjusted Rand Index (ARI) have been contemplated. The best purity is obtained by the K-means clustering technique, 96.50%; whereas, Fuzzy C-means clustering received 93.50% and hierarchical clustering was the lowest at 92. 25%. After using evolutionary technique Genetic Algorithm as Feature selection technique, the best purity is obtained by hierarchical clustering, 97.50%, compared to K –means clustering, 96.75%, and Fuzzy C-means clustering at 94.00%.

Download Full-text

An Improved Pearson’s Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

The Scientific World JOURNAL ◽

10.1155/2014/357873 ◽

2014 ◽

Vol 2014 ◽

pp. 1-10

Author(s):

P. M. Booma ◽

S. Prabhakaran ◽

R. Dhanalakshmi

Keyword(s):

Gene Expression ◽

Hierarchical Clustering ◽

Microarray Gene Expression ◽

Great Awareness ◽

Pearson’S Correlation ◽

Process Measures ◽

Significance Level ◽

Biological Association ◽

Clustering Model ◽

Pearson's Correlation

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.

Download Full-text