Data Assimilation with Gaussian Mixture Models Using the Dynamically Orthogonal Field Equations. Part I: Theory and Scheme

Abstract This work introduces and derives an efficient, data-driven assimilation scheme, focused on a time-dependent stochastic subspace that respects nonlinear dynamics and captures non-Gaussian statistics as it occurs. The motivation is to obtain a filter that is applicable to realistic geophysical applications, but that also rigorously utilizes the governing dynamical equations with information theory and learning theory for efficient Bayesian data assimilation. Building on the foundations of classical filters, the underlying theory and algorithmic implementation of the new filter are developed and derived. The stochastic Dynamically Orthogonal (DO) field equations and their adaptive stochastic subspace are employed to predict prior probabilities for the full dynamical state, effectively approximating the Fokker–Planck equation. At assimilation times, the DO realizations are fit to semiparametric Gaussian Mixture Models (GMMs) using the Expectation-Maximization algorithm and the Bayesian Information Criterion. Bayes’s law is then efficiently carried out analytically within the evolving stochastic subspace. The resulting GMM-DO filter is illustrated in a very simple example. Variations of the GMM-DO filter are also provided along with comparisons with related schemes.

Download Full-text

Regularized Parameter Estimation in High-Dimensional Gaussian Mixture Models

Neural Computation ◽

10.1162/neco_a_00128 ◽

2011 ◽

Vol 23 (6) ◽

pp. 1605-1622 ◽

Cited By ~ 12

Author(s):

Lingyan Ruan ◽

Ming Yuan ◽

Hui Zou

Keyword(s):

Parameter Estimation ◽

Mixture Models ◽

Gaussian Mixture Models ◽

Expectation Maximization Algorithm ◽

Real Data ◽

Gaussian Mixture ◽

High Dimensional ◽

Model Based Clustering ◽

Text Type ◽

Effective Dimensionality

Finite gaussian mixture models are widely used in statistics thanks to their great flexibility. However, parameter estimation for gaussian mixture models with high dimensionality can be challenging because of the large number of parameters that need to be estimated. In this letter, we propose a penalized likelihood estimator to address this difficulty. The [Formula: see text]-type penalty we impose on the inverse covariance matrices encourages sparsity on its entries and therefore helps to reduce the effective dimensionality of the problem. We show that the proposed estimate can be efficiently computed using an expectation-maximization algorithm. To illustrate the practical merits of the proposed method, we consider its applications in model-based clustering and mixture discriminant analysis. Numerical experiments with both simulated and real data show that the new method is a valuable tool for high-dimensional data analysis.

Download Full-text

Video Image Segmentation Using Gaussian Mixture Models Based on the Differential Evolution-Based Parameter Estimation

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.474-476.442 ◽

2011 ◽

Vol 474-476 ◽

pp. 442-447

Author(s):

Zhi Gao Zeng ◽

Li Xin Ding ◽

Sheng Qiu Yi ◽

San You Zeng ◽

Zi Hua Qiu

Keyword(s):

Image Segmentation ◽

Differential Evolution ◽

Mixture Models ◽

Clustering Algorithms ◽

Gaussian Mixture Models ◽

Expectation Maximization Algorithm ◽

Image Data ◽

Gaussian Mixture ◽

Parameters Estimation ◽

Video Image

In order to improve the accuracy of the image segmentation in video surveillance sequences and to overcome the limits of the traditional clustering algorithms that can not accurately model the image data sets which Contains noise data, the paper presents an automatic and accurate video image segmentation algorithm, according to the spatial properties, which uses the Gaussian mixture models to segment the image. But the expectation-maximization algorithm is very sensitive to initial values, and easy to fall into local optimums, so the paper presents a differential evolution-based parameters estimation for Gaussian mixture models. The experiment result shows that the segmentation accuracy has been improved greatly than by the traditional segmentation algorithms.

Download Full-text

Unsupervised learning of Gaussian Mixture Models: Evolutionary Create and Eliminate for Expectation Maximization algorithm

2013 IEEE Congress on Evolutionary Computation ◽

10.1109/cec.2013.6557962 ◽

2013 ◽

Cited By ~ 2

Author(s):

Thiago F. Covoes ◽

Eduardo R. Hruschka

Keyword(s):

Unsupervised Learning ◽

Mixture Models ◽

Expectation Maximization ◽

Gaussian Mixture Models ◽

Expectation Maximization Algorithm ◽

Gaussian Mixture

Download Full-text

Data Assimilation with Gaussian Mixture Models Using the Dynamically Orthogonal Field Equations. Part II: Applications

Monthly Weather Review ◽

10.1175/mwr-d-11-00296.1 ◽

2013 ◽

Vol 141 (6) ◽

pp. 1761-1785 ◽

Cited By ~ 12

Author(s):

Thomas Sondergaard ◽

Pierre F. J. Lermusiaux

Keyword(s):

Gaussian Mixture Models ◽

Mean Field ◽

Information Criterion ◽

Gaussian Mixture ◽

Sudden Expansion ◽

Test Case ◽

Field Equations ◽

Gaussian Statistics ◽

Non Gaussian ◽

Orthogonal Field

Abstract The properties and capabilities of the Gaussian Mixture Model–Dynamically Orthogonal filter (GMM-DO) are assessed and exemplified by applications to two dynamical systems: 1) the double well diffusion and 2) sudden expansion flows; both of which admit far-from-Gaussian statistics. The former test case, or twin experiment, validates the use of the Expectation-Maximization (EM) algorithm and Bayesian Information Criterion with GMMs in a filtering context; the latter further exemplifies its ability to efficiently handle state vectors of nontrivial dimensionality and dynamics with jets and eddies. For each test case, qualitative and quantitative comparisons are made with contemporary filters. The sensitivity to input parameters is illustrated and discussed. Properties of the filter are examined and its estimates are described, including the equation-based and adaptive prediction of the probability densities; the evolution of the mean field, stochastic subspace modes, and stochastic coefficients; the fitting of GMMs; and the efficient and analytical Bayesian updates at assimilation times and the corresponding data impacts. The advantages of respecting nonlinear dynamics and preserving non-Gaussian statistics are brought to light. For realistic test cases admitting complex distributions and with sparse or noisy measurements, the GMM-DO filter is shown to fundamentally improve the filtering skill, outperforming simpler schemes invoking the Gaussian parametric distribution.

Download Full-text

Comparison of speaker segmentation methods based on the Bayesian Information Criterion and adapted Gaussian mixture models

2008 15th International Conference on Systems, Signals and Image Processing ◽

10.1109/iwssip.2008.4604392 ◽

2008 ◽

Author(s):

Matej Grasic ◽

Marko Kos ◽

Andrej Zgank ◽

Zdravko Kacic

Keyword(s):

Mixture Models ◽

Bayesian Information Criterion ◽

Gaussian Mixture Models ◽

Information Criterion ◽

Gaussian Mixture ◽

Speaker Segmentation ◽

Segmentation Methods

Download Full-text

Analysis of a Generalized Expectation-Maximization Algorithm for Gaussian Mixture Models: A Control Systems Perspective

International Journal of Control ◽

10.1080/00207179.2021.1931964 ◽

2021 ◽

pp. 1-17

Author(s):

Sarthak Chatterjee ◽

Orlando Romero ◽

Sérgio Pequito

Keyword(s):

Control Systems ◽

Mixture Models ◽

Expectation Maximization ◽

Gaussian Mixture Models ◽

Expectation Maximization Algorithm ◽

Gaussian Mixture ◽

Systems Perspective ◽

Generalized Expectation

Download Full-text

MPI Implementation of Expectation Maximization Algorithm for Gaussian Mixture Models

Advances in Intelligent Systems and Computing - Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India CSI Volume 2 ◽

10.1007/978-3-319-13731-5_56 ◽

2015 ◽

pp. 517-523 ◽

Cited By ~ 1

Author(s):

Ayush Kapoor ◽

Harsh Hemani ◽

N. Sakthivel ◽

S. Chaturvedi

Keyword(s):

Mixture Models ◽

Expectation Maximization ◽

Gaussian Mixture Models ◽

Expectation Maximization Algorithm ◽

Gaussian Mixture ◽

Mpi Implementation

Download Full-text

Identifying the number of components in Gaussian mixture models using numerical algebraic geometry

Journal of Algebra and Its Applications ◽

10.1142/s0219498820502047 ◽

2019 ◽

Vol 19 (11) ◽

pp. 2050204

Author(s):

Sara Shirinkam ◽

Adel Alaeddini ◽

Elizabeth Gross

Keyword(s):

Algebraic Geometry ◽

Mixture Models ◽

Gaussian Mixture Models ◽

Information Criterion ◽

Gaussian Mixture ◽

Smoothing Spline ◽

Numerical Algebraic Geometry ◽

Automotive Manufacturing ◽

Local Maxima ◽

Number Of Components

Using Gaussian mixture models for clustering is a statistically mature method for clustering in data science with numerous successful applications in science and engineering. The parameters for a Gaussian mixture model (GMM) are typically estimated from training data using the iterative expectation-maximization algorithm, which requires the number of Gaussian components a priori. In this study, we propose two algorithms rooted in numerical algebraic geometry (NAG), namely, an area-based algorithm and a local maxima algorithm, to identify the optimal number of components. The area-based algorithm transforms several GMM with varying number of components into sets of equivalent polynomial regression splines. Next, it uses homotopy continuation methods for evaluating the resulting splines to identify the number of components that is most compatible with the gradient data. The local maxima algorithm forms a set of polynomials by fitting a smoothing spline to a dataset. Next, it uses NAG to solve the system of the first derivatives for finding the local maxima of the resulting smoothing spline, which represent the number of mixture components. The local maxima algorithm also identifies the location of the centers of Gaussian components. Using a real-world case study in automotive manufacturing and extensive simulations, we demonstrate that the performance of the proposed algorithms is comparable with that of Akaike information criterion (AIC) and Bayesian information criterion (BIC), which are popular methods in the literature. We also show the proposed algorithms are more robust than AIC and BIC when the Gaussian assumption is violated.

Download Full-text

Audio Classification and Retrieval Using Wavelets and Gaussian Mixture Models

International Journal of Multimedia Data Engineering and Management ◽

10.4018/jmdem.2013010101 ◽

2013 ◽

Vol 4 (1) ◽

pp. 1-20 ◽

Cited By ~ 2

Author(s):

Ching-Hua Chuan

Keyword(s):

Mixture Models ◽

Gaussian Mixture Models ◽

Expectation Maximization Algorithm ◽

Gaussian Mixture ◽

Discrete Wavelet ◽

Audio Classification ◽

Acoustic Features ◽

Frame Size ◽

Audio Recordings ◽

Female Speech

This paper presents an audio classification and retrieval system using wavelets for extracting low-level acoustic features. The author performed multiple-level decomposition using discrete wavelet transform to extract acoustic features from audio recordings at different scales and times. The extracted features are then translated into a compact vector representation. Gaussian mixture models with expectation maximization algorithm are used to build models for audio classes and individual audio examples. The system is evaluated using three audio classification tasks: speech/music, male/female speech, and music genre. They also show how wavelets and Gaussian mixture models are used for class-based audio retrieval in two approaches: indexing using only wavelets versus indexing by Gaussian components. By evaluating the system through 10-fold cross-validation, the author shows the promising capability of wavelets and Gaussian mixture models for audio classification and retrieval. They also compare how parameters including frame size, wavelet level, Gaussian components, and sampling size affect performance in Gaussian models.

Download Full-text

An EM algorithm for singular Gaussian mixture models

Filomat ◽

10.2298/fil1915753m ◽

2019 ◽

Vol 33 (15) ◽

pp. 4753-4767

Author(s):

Khalil Masmoudi ◽

Afif Masmoudi

Keyword(s):

Mixture Models ◽

Finite Mixture Models ◽

Gaussian Mixture Models ◽

Expectation Maximization Algorithm ◽

Gaussian Mixture ◽

Covariance Matrices ◽

Multivariate Normal ◽

Complete Proof ◽

Numerical Instabilities ◽

Financial Asset Returns

In this paper, we introduce finite mixture models with singular multivariate normal components. These models are useful when the observed data involves collinearities, that is when the covariance matrices are singular. They are also useful when the covariance matrices are ill-conditioned. In the latter case, the classical approaches may lead to numerical instabilities and give inaccurate estimations. Hence, an extension of the Expectation Maximization algorithm, with complete proof, is proposed to derive the maximum likelihood estimators and cluster the data instances for mixtures of singular multivariate normal distributions. The accuracy of the proposed algorithm is then demonstrated on the grounds of several numerical experiments. Finally, we discuss the application of the proposed distribution to financial asset returns modeling and portfolio selection.

Download Full-text