Outlier Detection in Large-Scale Traffic Data by Naïve Bayes Method and Gaussian Mixture Model Method

High-dimensional data recognition problem based on the Gaussian Mixture model has useful applications in many area, such as audio signal recognition, image analysis, and biological evolution. The expectation-maximization algorithm is a popular approach to the derivation of the maximum likelihood estimators of the Gaussian mixture model (GMM). An alternative solution is to adopt a generalized Bayes estimator for parameter estimation. In this study, an estimator based on the generalized Bayes approach is established. A simulation study shows that the proposed approach has a performance competitive to that of the conventional method in high-dimensional Gaussian mixture model recognition. We use a musical data example to illustrate this recognition problem. Suppose that we have audio data of a piece of music and know that the music is from one of four compositions, but we do not know exactly which composition it comes from. The generalized Bayes method shows a higher average recognition rate than the conventional method. This result shows that the generalized Bayes method is a competitor to the conventional method in this real application.

Download Full-text

A Gaussian Mixture Model Method for Eigenvalue-Based Spectrum Sensing with Uncalibrated Multiple Antennas

Signal Processing ◽

10.1016/j.sigpro.2021.108404 ◽

2021 ◽

pp. 108404

Author(s):

Saikat Majumder

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Spectrum Sensing ◽

Multiple Antennas ◽

Gaussian Mixture ◽

Model Method

Download Full-text

Reduced Degrees of Freedom Gaussian Mixture Model Fitting for Large Scale History Matching Problems

10.2118/193916-ms ◽

2019 ◽

Author(s):

Guohua Gao ◽

Hao Jiang ◽

Chaohui Chen ◽

Jeroen C. Vink ◽

Yaakoub El Khamra ◽

...

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

History Matching ◽

Degrees Of Freedom ◽

Large Scale ◽

Model Fitting ◽

Gaussian Mixture ◽

Matching Problems

Download Full-text

Outlier Detection Algorithm Based on Gaussian Mixture Model

2019 IEEE International Conference on Power, Intelligent Computing and Systems (ICPICS) ◽

10.1109/icpics47731.2019.8942474 ◽

2019 ◽

Author(s):

Wenbo Liu ◽

Delong Cui ◽

Zhiping Peng ◽

Jihai Zhong

Keyword(s):

Outlier Detection ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Detection Algorithm

Download Full-text

Panoramic Gaussian Mixture Model and large-scale range background substraction method for PTZ camera-based surveillance systems

Machine Vision and Applications ◽

10.1007/s00138-012-0426-4 ◽

2012 ◽

Vol 24 (3) ◽

pp. 477-492 ◽

Cited By ~ 25

Author(s):

Kang Xue ◽

Yue Liu ◽

Gbolabo Ogunmakin ◽

Jing Chen ◽

Jiangen Zhang

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Large Scale ◽

Gaussian Mixture ◽

Surveillance Systems ◽

Ptz Camera ◽

Scale Range

Download Full-text

PoSTcode: Probabilistic image-based spatial transcriptomics decoder

10.1101/2021.10.12.464086 ◽

2021 ◽

Author(s):

Milana Gataric ◽

Jun Sung Park ◽

Tong Li ◽

Vasy Vaskivskyi ◽

Jessica Svedlund ◽

...

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Large Scale ◽

Probabilistic Method ◽

Gaussian Mixture ◽

Full Potential ◽

Correlated Noise ◽

New Approach ◽

Tuning Parameters

Realising the full potential of novel image-based spatial transcriptomic (IST) technologies requires robust and accurate algorithms for decoding the hundreds of thousand fluorescent signals each derived from single molecules of mRNA. In this paper, we introduce PoSTcode, a probabilistic method for transcript decoding from cyclic multi-channel images, whose effectiveness is demonstrated on multiple large-scale datasets generated using different versions of the in situ sequencing protocols. PoSTcode is based on a re-parametrised matrix-variate Gaussian mixture model designed to account for correlated noise across fluorescence channels and imaging cycles. PoSTcode is shown to recover up to 50% more confidently decoded molecules while simultaneously decreasing transcript mislabeling when compared to existing decoding techniques. In addition, we demonstrate its increased stability to various types of noise and tuning parameters, which makes this new approach reliable and easy to use in practice. Lastly, we show that PoSTcode produces fewer doublet signals compared to a pixel-based decoding algorithm.

Download Full-text

Outlier Detection in Energy Disaggregation Using Subspace Learning and Gaussian Mixture Model

International Journal of Control and Automation ◽

10.14257/ijca.2015.8.8.17 ◽

2015 ◽

Vol 8 (8) ◽

pp. 161-170 ◽

Cited By ~ 3

Author(s):

Xiu-ming Tang ◽

Rong-xiang Yuan ◽

Jun Chen

Keyword(s):

Outlier Detection ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Gaussian Mixture ◽

Subspace Learning ◽

Energy Disaggregation

Download Full-text

Estimating hotspots using a Gaussian mixture model from large-scale taxi GPS trace data

Transportation Safety and Environment ◽

10.1093/tse/tdz006 ◽

2019 ◽

Vol 1 (2) ◽

pp. 145-153

Author(s):

Jin-jun Tang ◽

Jin Hu ◽

Yi-wei Wang ◽

He-lai Huang ◽

Yin-hai Wang

Keyword(s):

Gaussian Mixture Model ◽

Mixture Model ◽

Large Scale ◽

Spatial Information ◽

Spatial Clustering ◽

Gaussian Mixture ◽

Traffic Information ◽

Trace Data ◽

Real Trace ◽

Two Parameters

Abstract The data collected from taxi vehicles using the global positioning system (GPS) traces provides abundant temporal-spatial information, as well as information on the activity of drivers. Using taxi vehicles as mobile sensors in road networks to collect traffic information is an important emerging approach in efforts to relieve congestion. In this paper, we present a hybrid model for estimating driving paths using a density-based spatial clustering of applications with noise (DBSCAN) algorithm and a Gaussian mixture model (GMM). The first step in our approach is to extract the locations from pick-up and drop-off records (PDR) in taxi GPS equipment. Second, the locations are classified into different clusters using DBSCAN. Two parameters (density threshold and radius) are optimized using real trace data recorded from 1100 drivers. A GMM is also utilized to estimate a significant number of locations; the parameters of the GMM are optimized using an expectation-maximum (EM) likelihood algorithm. Finally, applications are used to test the effectiveness of the proposed model. In these applications, locations distributed in two regions (a residential district and a railway station) are clustered and estimated automatically.

Download Full-text