gene expression data clustering
Recently Published Documents


TOTAL DOCUMENTS

31
(FIVE YEARS 4)

H-INDEX

7
(FIVE YEARS 0)

2021 ◽  
Author(s):  
Meskat Jahan ◽  
Mahmudul Hasan

Abstract In the big data era, clustering is one of the most popular data mining method. The majority of clustering algorithms have complications like automatic cluster number determination, poor clustering precision, inconsistent clustering of various datasets and parameter-dependent etc. A new fuzzy autonomous solution for clustering named Meskat-Mahmudul (MM) clustering algorithm proposed to overcome the complexity of parameter–free automatic cluster number determination and clustering accuracy. MM clustering algorithm finds out the exact number of clusters based on Average Silhouette method in multivariate mixed attribute dataset, including real-time gene expression dataset and dealt missing values, noise and outliers. MM Extended K-Means (MMK) clustering algorithm is an enhancement of the K-Means algorithm, which serves the purpose for automatic cluster discovery and runtime cluster placement. Several validation methods used to evaluate cluster and certify optimum cluster partitioning and perfection. Some datasets used to assess the performance of the proposed algorithms to other algorithms in terms of time complexity and clustering efficiency. Finally, MM clustering and MMK clustering algorithms found superior over conventional algorithms.


2020 ◽  
Vol 8 (6) ◽  
pp. 5765-5767

Microarray innovation as of late has significant effects in numerous fields, for example, medical fields, bio-drug, describing different gene capacities, understanding diverse atomic bio-legitimate procedures, gene expression profiling and so on. In any case, microarray chips comprise of expression levels of an immense number of genes, thus produce huge measures of data to deal with. Because of its huge volume, the computational examination is basic for extricating information from microarray gene expression data. Clustering is one of the essential ways to deal with break down such a huge measure of data to find the gatherings of co-communicated genes. The issues tended to in hard clustering could be fathomed in a fuzzy clustering strategy. Among fuzzy based clustering, fuzzy c-means (FCM) is the most reasonable for microarray gene expression data. The issue related to fuzzy c-means is the number of clusters to be generated for the given dataset should be determined in earlier. The fundamental goal of this proposed Novel fuzzy cmeans (NFCM) strategy is to decide the exact number of clusters and decipher the equivalent effect.


Sign in / Sign up

Export Citation Format

Share Document