Model-based clustering with sparse covariance matrices

Statistics and Computing ◽

10.1007/s11222-018-9838-y ◽

2018 ◽

Vol 29 (4) ◽

pp. 791-819 ◽

Author(s):

Michael Fop ◽

Thomas Brendan Murphy ◽

Luca Scrucca

Keyword(s):

Covariance Matrices ◽

Model Based Clustering ◽

Download Full-text

Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables

Electronic Journal of Statistics ◽

10.1214/08-ejs194 ◽

2008 ◽

Vol 2 (0) ◽

pp. 168-212 ◽

Author(s):

Benhuai Xie ◽

Wei Pan ◽

Xiaotong Shen

Keyword(s):

Covariance Matrices ◽

Model Based Clustering ◽

Model Based ◽

Grouped Variables

Download Full-text

Robust Clustering Method in the Presence of Scattered Observations

Neural Computation ◽

10.1162/neco_a_00833 ◽

2016 ◽

Vol 28 (6) ◽

pp. 1141-1162

Author(s):

Akifumi Notsu ◽

Shinto Eguchi

Keyword(s):

Data Analysis ◽

Degrees Of Freedom ◽

Covariance Matrices ◽

The Other ◽

Proper Solution ◽

Clustering Method ◽

Robust Clustering ◽

Standard Methods ◽

Model Based Clustering ◽

Contamination of scattered observations, which are either featureless or unlike the other observations, frequently degrades the performance of standard methods such as K-means and model-based clustering. In this letter, we propose a robust clustering method in the presence of scattered observations called Gamma-clust. Gamma-clust is based on a robust estimation for cluster centers using gamma-divergence. It provides a proper solution for clustering in which the distributions for clustered data are nonnormal, such as t-distributions with different variance-covariance matrices and degrees of freedom. As demonstrated in a simulation study and data analysis, Gamma-clust is more flexible and provides superior results compared to the robustified K-means and model-based clustering.

Download Full-text

Model-Based Clustering with Measurement or Estimation Errors

Genes ◽

10.3390/genes11020185 ◽

2020 ◽

Vol 11 (2) ◽

pp. 185 ◽

Author(s):

Wanli Zhang ◽

Yanming Di

Keyword(s):

Finite Mixture Models ◽

Estimation Error ◽

Covariance Matrices ◽

Finite Mixture ◽

Estimation Errors ◽

Data Set ◽

Component Distribution ◽

Model Based Clustering ◽

Model Based ◽

Error Covariance

Model-based clustering with finite mixture models has become a widely used clustering method. One of the recent implementations is MCLUST. When objects to be clustered are summary statistics, such as regression coefficient estimates, they are naturally associated with estimation errors, whose covariance matrices can often be calculated exactly or approximated using asymptotic theory. This article proposes an extension to Gaussian finite mixture modeling—called MCLUST-ME—that properly accounts for the estimation errors. More specifically, we assume that the distribution of each observation consists of an underlying true component distribution and an independent measurement error distribution. Under this assumption, each unique value of estimation error covariance corresponds to its own classification boundary, which consequently results in a different grouping from MCLUST. Through simulation and application to an RNA-Seq data set, we discovered that under certain circumstances, explicitly, modeling estimation errors, improves clustering performance or provides new insights into the data, compared with when errors are simply ignored, whereas the degree of improvement depends on factors such as the distribution of error covariance matrices.

Download Full-text

Penalized model-based clustering with unconstrained covariance matrices

Electronic Journal of Statistics ◽

10.1214/09-ejs487 ◽

2009 ◽

Vol 3 (0) ◽

pp. 1473-1496 ◽

Author(s):

Hui Zhou ◽

Wei Pan ◽

Xiaotong Shen

Keyword(s):

Covariance Matrices ◽

Model Based Clustering ◽

Download Full-text

Model-Based Clustering and Classification for Data Science

10.1017/9781108644181 ◽

2019 ◽

Author(s):

Charles Bouveyron ◽

Gilles Celeux ◽

T. Brendan Murphy ◽

Adrian E. Raftery

Keyword(s):

Data Science ◽

Model Based Clustering ◽

Model Based ◽

Clustering And Classification

Download Full-text

Model-based Clustering and Prediction with Mixed Measurements involving Surrogate Classifiers

Statistics in Biopharmaceutical Research ◽

10.1080/19466315.2020.1863257 ◽

2020 ◽

pp. 1-30

Author(s):

Hua Shenam ◽

Alexander R. de Leon

Keyword(s):

Model Based Clustering ◽

Download Full-text

Model-Based Clustering

Comprehensive Chemometrics ◽

10.1016/b978-0-12-409547-2.14649-9 ◽

2020 ◽

pp. 509-529

Author(s):

G.J. McLachlan ◽

S.I. Rathnayake ◽

S.X. Lee

Keyword(s):

Model Based Clustering ◽

Download Full-text

Automated gating of flow cytometry data via robust model-based clustering

Cytometry Part A ◽

10.1002/cyto.a.20531 ◽

2008 ◽

Vol 73A (4) ◽

pp. 321-332 ◽

Author(s):

Kenneth Lo ◽

Ryan Remy Brinkman ◽

Raphael Gottardo

Keyword(s):

Flow Cytometry ◽

Model Based Clustering ◽

Flow Cytometry Data ◽

Model Based ◽

Download Full-text

Model-based principal components of covariance matrices

British Journal of Mathematical and Statistical Psychology ◽

10.1348/000711009x428189 ◽

2010 ◽

Vol 63 (1) ◽

pp. 113-137 ◽

Author(s):

Robert J. Boik ◽

Kamolchanok Panishkan ◽

Scott K. Hyde

Keyword(s):

Principal Components ◽

Covariance Matrices ◽

Download Full-text

Detecting experimental noises in protein-protein interactions with iterative sampling and model-based clustering

Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings. ◽

10.1109/bibe.2003.1188977 ◽

2003 ◽

Author(s):

H. Mamitsuka

Keyword(s):

Protein Interactions ◽

Protein Protein Interactions ◽

Model Based Clustering ◽

Model Based ◽

Iterative Sampling

Download Full-text