Application of l 1 Estimation of Gaussian Mixture Model Parameters for Language Identification

Abstract When formulating history matching within the Bayesian framework, we may quantify the uncertainty of model parameters and production forecasts using conditional realizations sampled from the posterior probability density function (PDF). It is quite challenging to sample such a posterior PDF. Some methods e.g., Markov chain Monte Carlo (MCMC), are very expensive (e.g., MCMC) while others are cheaper but may generate biased samples. In this paper, we propose an unconstrained Gaussian Mixture Model (GMM) fitting method to approximate the posterior PDF and investigate new strategies to further enhance its performance. To reduce the CPU time of handling bound constraints, we reformulate the GMM fitting formulation such that an unconstrained optimization algorithm can be applied to find the optimal solution of unknown GMM parameters. To obtain a sufficiently accurate GMM approximation with the lowest number of Gaussian components, we generate random initial guesses, remove components with very small or very large mixture weights after each GMM fitting iteration and prevent their reappearance using a dedicated filter. To prevent overfitting, we only add a new Gaussian component if the quality of the GMM approximation on a (large) set of blind-test data sufficiently improves. The unconstrained GMM fitting method with the new strategies proposed in this paper is validated using nonlinear toy problems and then applied to a synthetic history matching example. It can construct a GMM approximation of the posterior PDF that is comparable to the MCMC method, and it is significantly more efficient than the constrained GMM fitting formulation, e.g., reducing the CPU time by a factor of 800 to 7300 for problems we tested, which makes it quite attractive for large scale history matching problems.

Download Full-text

Soft margin estimation of Gaussian mixture model parameters for spoken language recognition

2010 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2010.5495079 ◽

2010 ◽

Cited By ~ 1

Author(s):

Donglai Zhu ◽

Bin Ma ◽

Haizhou Li

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Spoken Language ◽

Model Parameters ◽

Language Recognition ◽

Soft Margin

Download Full-text

Statistical Model Based Digital Image Retrieval and Fragile Watermarking

10.32920/ryerson.14664453.v1 ◽

2021 ◽

Author(s):

Hua Yuan

Keyword(s):

Image Retrieval ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Statistical Modelling ◽

Gaussian Mixture ◽

Image Features ◽

Fragile Watermarking ◽

Model Parameters ◽

Wavelet Domain ◽

Watermarking Scheme

The objective of this thesis is to acquire abstract image features through statistical modelling in the wavelet domain and then based on the extracted image features, develop an effective content-based image retreival (CBIR) system and a fragile watermarking scheme. In this thesis, we first present a statistical modelling of images in the wavelet domain through a Gaussian mixture model (GMM) and a generalized Gaussian mixture model (GGMM). An Expectation Maximization (EM) algorithm is developed to help estimate the model parameters. A novel similarity measure based on the Kullback-Leibler divergence is also developed to calculate the distance of two distinct model distributions. We then apply the statistical modelling to two application areas: image retrieval and fragile watermarking. In image retrieval, the model parameters are employed as image features to compose the indexing feature space, while the feature distance of two compared images is computed using the novel similarity measure. The new image retrieval method has a better retrieval performance than most conventional methods. In fragile watermarking, the model parameters are utilized for the watermark embedding. The new watermarking scheme achieves a virtually imperceptible embedding of watermarks because it modifies only a few image data and embeds watermarks at image texture edges. A multiscale embedding of fragile watermarks is given to enhance the embeddability rate and on the other hand, to constitute a semi-fragile approach.

Download Full-text

Optimization of Gaussian Mixture Model Parameters for Speaker Identification

Genetic and Evolutionary Computation – GECCO 2004 - Lecture Notes in Computer Science ◽

10.1007/978-3-540-24855-2_141 ◽

2004 ◽

pp. 1310-1311

Author(s):

Q. Y. Hong ◽

Sam Kwong ◽

H. L. Wang

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Speaker Identification ◽

Gaussian Mixture ◽

Model Parameters

Download Full-text

An Improved Algorithm for Real-Time Moving Target Detection Based on Gaussian Mixture Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.599-601.814 ◽

2014 ◽

Vol 599-601 ◽

pp. 814-818 ◽

Cited By ~ 1

Author(s):

Xue Yuan Chen ◽

Xia Fu Lv ◽

Jie Liu

Keyword(s):

Real Time ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Detection System ◽

Gaussian Mixture ◽

Learning Rate ◽

Model Parameters ◽

Illumination Variation ◽

Matching Algorithm ◽

Passive Learning

Gaussian Mixture Model is a popular method to detect moving targets for static cameras. Since the traditional Gaussian Mixture Model has a poor adaptability when the illumination is changing in the scene and has passive learning rate, this paper describes a method that can detect illumination variation and update the learning rate adaptively. It proposes an approach which uses the color histogram matching algorithm and adjusts the learning rate automatically after introducing illumination variation factor and model parameters. Furthermore, the proposed method can select the number of describing model component adaptively, so this method reduced the computation complexity and improved the real-time performance. The experiment results indicate that the detection system gets better robustness, adaptability and stability.

Download Full-text

Hybrid DE-EM Algorithm for Gaussian Mixture Model-Based Wireless Channel Multipath Clustering

International Journal of Antennas and Propagation ◽

10.1155/2019/4639612 ◽

2019 ◽

Vol 2019 ◽

pp. 1-10 ◽

Cited By ~ 1

Author(s):

Yupeng Li ◽

Jianhua Zhang ◽

Ruisi He ◽

Lei Tian ◽

Hewen Wei

Keyword(s):

Em Algorithm ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Clustering Algorithm ◽

Gaussian Mixture ◽

Wireless Channel ◽

Model Parameters ◽

Em Algorithms ◽

The Em Algorithm ◽

Searching Ability

In this paper, the Gaussian mixture model (GMM) is introduced to the channel multipath clustering. In the GMM field, the expectation-maximization (EM) algorithm is usually utilized to estimate the model parameters. However, the EM widely converges into local optimization. To address this issue, a hybrid differential evolution (DE) and EM (DE-EM) algorithms are proposed in this paper. To be specific, the DE is employed to initialize the GMM parameters. Then, the parameters are estimated with the EM algorithm. Thanks to the global searching ability of DE, the proposed hybrid DE-EM algorithm is more likely to obtain the global optimization. Simulations demonstrate that our proposed DE-EM clustering algorithm can significantly improve the clustering performance.

Download Full-text

Large margin estimation of Gaussian mixture model parameters with extended baum-welch for spoken language recognition

10.21437/interspeech.2009-621 ◽

2009 ◽

Author(s):

Donglai Zhu ◽

Bin Ma ◽

Haizhou Li

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Spoken Language ◽

Model Parameters ◽

Language Recognition ◽

Large Margin

Download Full-text

GMM-UBM Based Modeling for Language Identification using New Feature Vectors

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.d1919.029420 ◽

2020 ◽

Vol 9 (4) ◽

pp. 3034-3039

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Language Identification ◽

Training Data ◽

Background Model ◽

Indian Languages ◽

Feature Vectors ◽

Universal Background Model ◽

New Feature

The most of the existing LID systems based on the Gaussian Mixture model. The main requirement of the GMM based LID system is it require large amount of speech data to train the GMM model. Most of the Indian languages have the similarity because they are derived from Devanagari. Even though common phonemes exists in phoneme sets across the Indian languages, each language contain its unique phonotactic constraints imposed by the language. Any modeling technique capable of capturing all these slight variations imposed by the language is one of the important language identification cue. To model the GMM based LID system which captures above variations it require large number of mixture components.To model the large number of mixture components using Gaussian Mixture Model (GMM), the technique requires a large number of training data for each language class, which is very difficult to get for Indian languages. The main objective of GMM-UBM based LID system is it require less amount of training data to train(model) the system. In this paper, the importance of GMM-UBM modeling for language identification (LID) task for Indian languages are explored using new set of feature vectors. In GMM-UBM LID system based on the new feature vectors, the phonotactic variations imparted by different Indian languages are modeled using Gaussian Mixture model and Universal Background Model (GMM-UBM) technique. In this type of modeling, some amount of data from each class of language is pooled to create a universal background model. From this UBM model each model class is adapted. In this study, it is found that the performance of new feature vectors GMM-UBM based LID system is superior when compared to conventional new feature vectors based GMM LID system.

Download Full-text

Statistical Model Based Digital Image Retrieval and Fragile Watermarking

10.32920/ryerson.14664453 ◽

2021 ◽

Author(s):

Hua Yuan

Keyword(s):

Image Retrieval ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Statistical Modelling ◽

Gaussian Mixture ◽

Image Features ◽

Fragile Watermarking ◽

Model Parameters ◽

Wavelet Domain ◽

Watermarking Scheme

The objective of this thesis is to acquire abstract image features through statistical modelling in the wavelet domain and then based on the extracted image features, develop an effective content-based image retreival (CBIR) system and a fragile watermarking scheme. In this thesis, we first present a statistical modelling of images in the wavelet domain through a Gaussian mixture model (GMM) and a generalized Gaussian mixture model (GGMM). An Expectation Maximization (EM) algorithm is developed to help estimate the model parameters. A novel similarity measure based on the Kullback-Leibler divergence is also developed to calculate the distance of two distinct model distributions. We then apply the statistical modelling to two application areas: image retrieval and fragile watermarking. In image retrieval, the model parameters are employed as image features to compose the indexing feature space, while the feature distance of two compared images is computed using the novel similarity measure. The new image retrieval method has a better retrieval performance than most conventional methods. In fragile watermarking, the model parameters are utilized for the watermark embedding. The new watermarking scheme achieves a virtually imperceptible embedding of watermarks because it modifies only a few image data and embeds watermarks at image texture edges. A multiscale embedding of fragile watermarks is given to enhance the embeddability rate and on the other hand, to constitute a semi-fragile approach.

Download Full-text

Strategies to Enhance the Performance of Gaussian Mixture Model Fitting for Uncertainty Quantification

SPE Journal ◽

10.2118/204008-pa ◽

2021 ◽

pp. 1-20

Author(s):

Guohua Gao ◽

Jeroen Vink ◽

Fredrik Saaf ◽

Terence Wells

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

History Matching ◽

Gaussian Mixture ◽

Model Parameters ◽

Processing Unit ◽

Large Set ◽

Cpu Time ◽

Fitting Method ◽

New Strategies

Summary When formulating history matching within the Bayesian framework, we may quantify the uncertainty of model parameters and production forecasts using conditional realizations sampled from the posterior probability density function (PDF). It is quite challenging to sample such a posterior PDF. Some methods [e.g., Markov chain Monte Carlo (MCMC)] are very expensive, whereas other methods are cheaper but may generate biased samples. In this paper, we propose an unconstrained Gaussian mixture model (GMM) fitting method to approximate the posterior PDF and investigate new strategies to further enhance its performance. To reduce the central processing unit (CPU) time of handling bound constraints, we reformulate the GMM fitting formulation such that an unconstrained optimization algorithm can be applied to find the optimal solution of unknown GMM parameters. To obtain a sufficiently accurate GMM approximation with the lowest number of Gaussian components, we generate random initial guesses, remove components with very small or very large mixture weights after each GMM fitting iteration, and prevent their reappearance using a dedicated filter. To prevent overfitting, we add a new Gaussian component only if the quality of the GMM approximation on a (large) set of blind-test data sufficiently improves. The unconstrained GMM fitting method with the new strategies proposed in this paper is validated using nonlinear toy problems and then applied to a synthetic history-matching example. It can construct a GMM approximation of the posterior PDF that is comparable to the MCMC method, and it is significantly more efficient than the constrained GMM fitting formulation (e.g., reducing the CPU time by a factor of 800 to 7,300 for problems we tested), which makes it quite attractive for large-scalehistory-matchingproblems. NOTE: This paper is published as part of the 2021 SPE Reservoir Simulation Special Issue.

Download Full-text