Automatic language identification based on Gaussian mixture model and universal background model

Author(s):  
Dan Qu ◽  
Bingxi Wang ◽  
Xin Wei

The most of the existing LID systems based on the Gaussian Mixture model. The main requirement of the GMM based LID system is it require large amount of speech data to train the GMM model. Most of the Indian languages have the similarity because they are derived from Devanagari. Even though common phonemes exists in phoneme sets across the Indian languages, each language contain its unique phonotactic constraints imposed by the language. Any modeling technique capable of capturing all these slight variations imposed by the language is one of the important language identification cue. To model the GMM based LID system which captures above variations it require large number of mixture components.To model the large number of mixture components using Gaussian Mixture Model (GMM), the technique requires a large number of training data for each language class, which is very difficult to get for Indian languages. The main objective of GMM-UBM based LID system is it require less amount of training data to train(model) the system. In this paper, the importance of GMM-UBM modeling for language identification (LID) task for Indian languages are explored using new set of feature vectors. In GMM-UBM LID system based on the new feature vectors, the phonotactic variations imparted by different Indian languages are modeled using Gaussian Mixture model and Universal Background Model (GMM-UBM) technique. In this type of modeling, some amount of data from each class of language is pooled to create a universal background model. From this UBM model each model class is adapted. In this study, it is found that the performance of new feature vectors GMM-UBM based LID system is superior when compared to conventional new feature vectors based GMM LID system.


2016 ◽  
Vol 9 (1) ◽  
pp. 36-40
Author(s):  
Renu Singh ◽  
Arvind Singh ◽  
Utpal Bhattacharjee

This paper presents a reviewof various speaker verification approaches in realistic world, and explore a combinational approach between Gaussian Mixture Model (GMM) and Support Vector Machine (SVM) as well as Gaussian Mixture Model (GMM) and Universal Background Model (UBM).


2015 ◽  
Vol 734 ◽  
pp. 463-467 ◽  
Author(s):  
Pan Pan Zhang ◽  
Chun Yang Mu ◽  
Xing Ma ◽  
Fu Lu Xu

Detection of moving object is a hot topic in computer vision. Traditionally, it is detected for every pixel in whole image by Gaussian mixture background model, which may waste more time and space. In order to improving the computational efficiency, an advanced Gaussian mixture model based on Region of Interest was proposed. Firstly, the solution finds out the most probably region where the target may turn up. And then Gaussian mixture background model is built in this area. Finally, morphological filter algorithm is used for improving integrity of the detected targets. Results show that the improved method could have a more perfect detection but no more time increasing than typical method.


This paper proposes a novel approach that combines the power of generative Gaussian mixture models (GMM) and discriminative support vector machines (SVM). The main objective this paper is to incorporating the GMM super vectors based on SVM classifier for language identification (LID) task. The GMM based LID system to capture all the variations present in phonotactic constraints imposed by the language requires large amount of training data. The Gaussian mixture model (GMM)- universal background model (UBM) modeling require less amount of training data. In GMM-UBM LID system, a language model is created by maximum a posterior (MAP) adaptation of the means of the universal background model (UBM). Here the GMM super vectors are created by concatenating the means of the adapted mixture components from UBM. Then these super vectors are applied to a SVM for classification purpose. In this paper, the performance of GMM-UBM LID system based on SVM is compared with the conventional GMM LID system. Form the performance analysis it is found that GMM-UBM LID system based on SVM is performed well when compared to GMM based LID system.


Sign in / Sign up

Export Citation Format

Share Document