gene expression data clustering Latest Research Papers

Abstract In the big data era, clustering is one of the most popular data mining method. The majority of clustering algorithms have complications like automatic cluster number determination, poor clustering precision, inconsistent clustering of various datasets and parameter-dependent etc. A new fuzzy autonomous solution for clustering named Meskat-Mahmudul (MM) clustering algorithm proposed to overcome the complexity of parameter–free automatic cluster number determination and clustering accuracy. MM clustering algorithm finds out the exact number of clusters based on Average Silhouette method in multivariate mixed attribute dataset, including real-time gene expression dataset and dealt missing values, noise and outliers. MM Extended K-Means (MMK) clustering algorithm is an enhancement of the K-Means algorithm, which serves the purpose for automatic cluster discovery and runtime cluster placement. Several validation methods used to evaluate cluster and certify optimum cluster partitioning and perfection. Some datasets used to assess the performance of the proposed algorithms to other algorithms in terms of time complexity and clustering efficiency. Finally, MM clustering and MMK clustering algorithms found superior over conventional algorithms.

Download Full-text

Infinite Von Mises-Fisher Mixture Model and Its Application to Gene Expression Data Clustering

10.1145/3461353.3461364 ◽

2021 ◽

Author(s):

Zhu Jiaojiao ◽

Fan Wentao

Keyword(s):

Gene Expression ◽

Mixture Model ◽

Gene Expression Data ◽

Data Clustering ◽

Expression Data ◽

Von Mises ◽

Gene Expression Data Clustering

Download Full-text

Implementation of Novel Fuzzy C-Means Method in Gene Data

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f7299.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 5765-5767

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Microarray Gene Expression Data ◽

Expression Data ◽

Microarray Gene Expression ◽

Number Of Clusters ◽

Fuzzy C Means ◽

Equivalent Effect ◽

Microarray Gene ◽

Gene Expression Data Clustering

Microarray innovation as of late has significant effects in numerous fields, for example, medical fields, bio-drug, describing different gene capacities, understanding diverse atomic bio-legitimate procedures, gene expression profiling and so on. In any case, microarray chips comprise of expression levels of an immense number of genes, thus produce huge measures of data to deal with. Because of its huge volume, the computational examination is basic for extricating information from microarray gene expression data. Clustering is one of the essential ways to deal with break down such a huge measure of data to find the gatherings of co-communicated genes. The issues tended to in hard clustering could be fathomed in a fuzzy clustering strategy. Among fuzzy based clustering, fuzzy c-means (FCM) is the most reasonable for microarray gene expression data. The issue related to fuzzy c-means is the number of clusters to be generated for the given dataset should be determined in earlier. The fundamental goal of this proposed Novel fuzzy cmeans (NFCM) strategy is to decide the exact number of clusters and decipher the equivalent effect.

Download Full-text