Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation

A commonly used tool for estimating the parameters of a mixture model is the Expectation–Maximization (EM) algorithm, which is an iterative procedure that can serve as a maximum-likelihood estimator. The EM algorithm has well-documented drawbacks, such as the need for good initial values and the possibility of being trapped in local optima. Nevertheless, because of its appealing properties, EM plays an important role in estimating the parameters of mixture models. To overcome these initialization problems with EM, in this paper, we propose the Rough-Enhanced-Bayes mixture estimation (REBMIX) algorithm as a more effective initialization algorithm. Three different strategies are derived for dealing with the unknown number of components in the mixture model. These strategies are thoroughly tested on artificial datasets, density–estimation datasets and image–segmentation problems and compared with state-of-the-art initialization methods for the EM. Our proposal shows promising results in terms of clustering and density-estimation performance as well as in terms of computational efficiency. All the improvements are implemented in the rebmix R package.

Download Full-text

Statistical Inference in Evolutionary Models of DNA Sequences via the EM Algorithm

Statistical Applications in Genetics and Molecular Biology ◽

10.2202/1544-6115.1127 ◽

2005 ◽

Vol 4 (1) ◽

Cited By ~ 23

Author(s):

Asger Hobolth ◽

Jens Ledet Jensen

Keyword(s):

Maximum Likelihood ◽

Em Algorithm ◽

Statistical Inference ◽

Dna Sequences ◽

Expectation Maximization ◽

Continuous Time ◽

Information Matrix ◽

Evolutionary Models ◽

Likelihood Estimator ◽

The Em Algorithm

We describe statistical inference in continuous time Markov processes of DNA sequences related by a phylogenetic tree. The maximum likelihood estimator can be found by the expectation maximization (EM) algorithm and an expression for the information matrix is also derived. We provide explicit analytical solutions for the EM algorithm and information matrix.

Download Full-text

THE NOISY EXPECTATION–MAXIMIZATION ALGORITHM

Fluctuation and Noise Letters ◽

10.1142/s0219477513500120 ◽

2013 ◽

Vol 12 (03) ◽

pp. 1350012 ◽

Cited By ~ 19

Author(s):

OSONDE OSOBA ◽

SANYA MITAIM ◽

BART KOSKO

Keyword(s):

Em Algorithm ◽

Mixture Model ◽

Expectation Maximization ◽

Additive Noise ◽

Expectation Maximization Algorithm ◽

Local Maximum ◽

Gaussian Mixture ◽

Positivity Condition ◽

The Em Algorithm ◽

Special Cases

We present a noise-injected version of the expectation–maximization (EM) algorithm: the noisy expectation–maximization (NEM) algorithm. The NEM algorithm uses noise to speed up the convergence of the EM algorithm. The NEM theorem shows that additive noise speeds up the average convergence of the EM algorithm to a local maximum of the likelihood surface if a positivity condition holds. Corollary results give special cases when noise improves the EM algorithm. We demonstrate these noise benefits on EM algorithms for three data models: the Gaussian mixture model (GMM), the Cauchy mixture model (CMM), and the censored log-convex gamma model. The NEM positivity condition simplifies to a quadratic inequality in the GMM and CMM cases. A final theorem shows that the noise benefit for independent identically distributed additive noise decreases with sample size in mixture models. This theorem implies that the noise benefit is most pronounced if the data is sparse.

Download Full-text

Application of the multivariate skew normal mixture model with the EM Algorithm to Value-at-Risk

Chan, F., Marinova, D. and Anderssen, R.S. (eds) MODSIM2011, 19th International Congress on Modelling and Simulation. ◽

10.36334/modsim.2011.d10.soltyk ◽

2011 ◽

Keyword(s):

At Risk ◽

Em Algorithm ◽

Mixture Model ◽

Value At Risk ◽

Normal Mixture ◽

Normal Mixture Model ◽

The Em Algorithm ◽

Skew Normal

Download Full-text

Unsupervised Learning in RSS-Based DFLT Using an EM Algorithm

Sensors ◽

10.3390/s21165549 ◽

2021 ◽

Vol 21 (16) ◽

pp. 5549

Author(s):

Ossi Kaltiokallio ◽

Roland Hostettler ◽

Hüseyin Yiğitler ◽

Mikko Valkama

Keyword(s):

Em Algorithm ◽

Expectation Maximization ◽

Tracking System ◽

Model Parameters ◽

Calibration Data ◽

Tracking Accuracy ◽

Time Period ◽

The Em Algorithm ◽

Device Free ◽

Localization And Tracking

Received signal strength (RSS) changes of static wireless nodes can be used for device-free localization and tracking (DFLT). Most RSS-based DFLT systems require access to calibration data, either RSS measurements from a time period when the area was not occupied by people, or measurements while a person stands in known locations. Such calibration periods can be very expensive in terms of time and effort, making system deployment and maintenance challenging. This paper develops an Expectation-Maximization (EM) algorithm based on Gaussian smoothing for estimating the unknown RSS model parameters, liberating the system from supervised training and calibration periods. To fully use the EM algorithm’s potential, a novel localization-and-tracking system is presented to estimate a target’s arbitrary trajectory. To demonstrate the effectiveness of the proposed approach, it is shown that: (i) the system requires no calibration period; (ii) the EM algorithm improves the accuracy of existing DFLT methods; (iii) it is computationally very efficient; and (iv) the system outperforms a state-of-the-art adaptive DFLT system in terms of tracking accuracy.

Download Full-text

Models and Algorithms for Tracking Target with Coordinated Turn Motion

Mathematical Problems in Engineering ◽

10.1155/2014/649276 ◽

2014 ◽

Vol 2014 ◽

pp. 1-10 ◽

Cited By ~ 8

Author(s):

Xianghui Yuan ◽

Feng Lian ◽

Chongzhao Han

Keyword(s):

Em Algorithm ◽

Expectation Maximization ◽

Multiple Models ◽

Interacting Multiple Model ◽

Multiple Model ◽

Motion Model ◽

Single Model ◽

Kinematic Constraint ◽

The Em Algorithm ◽

Turn Rate

Tracking target with coordinated turn (CT) motion is highly dependent on the models and algorithms. First, the widely used models are compared in this paper—coordinated turn (CT) model with known turn rate, augmented coordinated turn (ACT) model with Cartesian velocity, ACT model with polar velocity, CT model using a kinematic constraint, and maneuver centered circular motion model. Then, in the single model tracking framework, the tracking algorithms for the last four models are compared and the suggestions on the choice of models for different practical target tracking problems are given. Finally, in the multiple models (MM) framework, the algorithm based on expectation maximization (EM) algorithm is derived, including both the batch form and the recursive form. Compared with the widely used interacting multiple model (IMM) algorithm, the EM algorithm shows its effectiveness.

Download Full-text

The EM Algorithm for Generalized Exponential Mixture Model

2010 International Conference on Computational Intelligence and Software Engineering ◽

10.1109/cise.2010.5677272 ◽

2010 ◽

Cited By ~ 1

Author(s):

Yueyang Teng ◽

Tie Zhang

Keyword(s):

Em Algorithm ◽

Mixture Model ◽

Exponential Mixture ◽

The Em Algorithm

Download Full-text

Doppler Velocity Estimation of Overlapping Linear-Period-Modulated Ultrasonic Waves Based on an Expectation-Maximization Algorithm

Advances in Acoustics and Vibration ◽

10.1155/2014/921876 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7 ◽

Cited By ~ 1

Author(s):

Natee Thong-un ◽

Minoru K. Kurosawa

Keyword(s):

Em Algorithm ◽

Expectation Maximization ◽

Pulse Compression ◽

Moving Objects ◽

Expectation Maximization Algorithm ◽

Ultrasonic Waves ◽

Velocity Estimation ◽

Doppler Velocity ◽

The Em Algorithm ◽

Linear Period

The occurrence of an overlapping signal is a significant problem in performing multiple objects localization. Doppler velocity is sensitive to the echo shape and is also able to be connected to the physical properties of moving objects, especially for a pulse compression ultrasonic signal. The expectation-maximization (EM) algorithm has the ability to achieve signal separation. Thus, applying the EM algorithm to the overlapping pulse compression signals is of interest. This paper describes a proposed method, based on the EM algorithm, of Doppler velocity estimation for overlapping linear-period-modulated (LPM) ultrasonic signals. Simulations are used to validate the proposed method.

Download Full-text

Hybrid DE-EM Algorithm for Gaussian Mixture Model-Based Wireless Channel Multipath Clustering

International Journal of Antennas and Propagation ◽

10.1155/2019/4639612 ◽

2019 ◽

Vol 2019 ◽

pp. 1-10 ◽

Cited By ~ 1

Author(s):

Yupeng Li ◽

Jianhua Zhang ◽

Ruisi He ◽

Lei Tian ◽

Hewen Wei

Keyword(s):

Em Algorithm ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Clustering Algorithm ◽

Gaussian Mixture ◽

Wireless Channel ◽

Model Parameters ◽

Em Algorithms ◽

The Em Algorithm ◽

Searching Ability

In this paper, the Gaussian mixture model (GMM) is introduced to the channel multipath clustering. In the GMM field, the expectation-maximization (EM) algorithm is usually utilized to estimate the model parameters. However, the EM widely converges into local optimization. To address this issue, a hybrid differential evolution (DE) and EM (DE-EM) algorithms are proposed in this paper. To be specific, the DE is employed to initialize the GMM parameters. Then, the parameters are estimated with the EM algorithm. Thanks to the global searching ability of DE, the proposed hybrid DE-EM algorithm is more likely to obtain the global optimization. Simulations demonstrate that our proposed DE-EM clustering algorithm can significantly improve the clustering performance.

Download Full-text

The EM algorithm for multi-dimensional Gaussian mixture model

International Journal of Scientific and Research Publications (IJSRP) ◽

10.29322/ijsrp.11.06.2021.p11467 ◽

2021 ◽

Vol 11 (6) ◽

pp. 515-517

Author(s):

Qian Wang ◽

Jian Wang

Keyword(s):

Em Algorithm ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

The Em Algorithm

Download Full-text

Theory and Practice of Expectation Maximization (EM) Algorithm

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch300 ◽

2011 ◽

pp. 1966-1973

Author(s):

Chandan K. Reddy ◽

Bala Rajaratnam

Keyword(s):

Em Algorithm ◽

Expectation Maximization ◽

Latent Variables ◽

Likelihood Function ◽

Optimal Solution ◽

Local Maximum ◽

Search Space ◽

The Em Algorithm ◽

Log Likelihood ◽

Estimation Problems

In the field of statistical data mining, the Expectation Maximization (EM) algorithm is one of the most popular methods used for solving parameter estimation problems in the maximum likelihood (ML) framework. Compared to traditional methods such as steepest descent, conjugate gradient, or Newton-Raphson, which are often too complicated to use in solving these problems, EM has become a popular method because it takes advantage of some problem specific properties (Xu et al., 1996). The EM algorithm converges to the local maximum of the log-likelihood function under very general conditions (Demspter et al., 1977; Redner et al., 1984). Efficiently maximizing the likelihood by augmenting it with latent variables and guarantees of convergence are some of the important hallmarks of the EM algorithm. EM based methods have been applied successfully to solve a wide range of problems that arise in fields of pattern recognition, clustering, information retrieval, computer vision, bioinformatics (Reddy et al., 2006; Carson et al., 2002; Nigam et al., 2000), etc. Given an initial set of parameters, the EM algorithm can be implemented to compute parameter estimates that locally maximize the likelihood function of the data. In spite of its strong theoretical foundations, its wide applicability and important usage in solving some real-world problems, the standard EM algorithm suffers from certain fundamental drawbacks when used in practical settings. Some of the main difficulties of using the EM algorithm on a general log-likelihood surface are as follows (Reddy et al., 2008): • EM algorithm for mixture modeling converges to a local maximum of the log-likelihood function very quickly. • There are many other promising local optimal solutions in the close vicinity of the solutions obtained from the methods that provide good initial guesses of the solution. • Model selection criterion usually assumes that the global optimal solution of the log-likelihood function can be obtained. However, achieving this is computationally intractable. • Some regions in the search space do not contain any promising solutions. The promising and nonpromising regions co-exist and it becomes challenging to avoid wasting computational resources to search in non-promising regions. Of all the concerns mentioned above, the fact that most of the local maxima are not distributed uniformly makes it important to develop algorithms that not only help in avoiding some inefficient search over the lowlikelihood regions but also emphasize the importance of exploring promising subspaces more thoroughly (Zhang et al, 2004). This subspace search will also be useful for making the solution less sensitive to the initial set of parameters. In this chapter, we will discuss the theoretical aspects of the EM algorithm and demonstrate its use in obtaining the optimal estimates of the parameters for mixture models. We will also discuss some of the practical concerns of using the EM algorithm and present a few results on the performance of various algorithms that try to address these problems.

Download Full-text