3D Object Classification Using a Two-Dimensional Hidden Markov Model

2013 ◽  
Vol 411-414 ◽  
pp. 2041-2046 ◽  
Author(s):  
Jing Guo ◽  
Ming Quan Zhou ◽  
Chao Li ◽  
Zhe Shi

In this paper, we develop a novel method of 3D object classification based on a Two-Dimensional Hidden Markov Model (2D HMM). Hidden Markov Models are a widely used methodology for sequential data modeling, of growing importance in the last years. In the proposed approach, each object is decomposed by a spiderweb model and a shape function D2 is computed for each bin. These feature vectors are then arranged in a sequential fashion to compose a sequence vector, which is used to train HMMs. In 2D HMM, we assume that feature vectors are statistically dependent on an underlying state process which has transition probabilities conditioning the states of two neighboring bins. Thus the dependency of two dimensions is reflected simultaneously. To classify an object, the maximized posteriori probability is calculated by a given model and the observed sequence of an unknown object. Comparing with 1D HMM, the 2D HMM gets more information from the neighboring bins. Analysis and experimental results show that the proposed approach performs better than existing ones in database.

2014 ◽  
Vol 2014 ◽  
pp. 1-7 ◽  
Author(s):  
Md. Rabiul Islam ◽  
Md. Abdus Sobhan

The aim of the paper is to propose a feature fusion based Audio-Visual Speaker Identification (AVSI) system with varied conditions of illumination environments. Among the different fusion strategies, feature level fusion has been used for the proposed AVSI system where Hidden Markov Model (HMM) is used for learning and classification. Since the feature set contains richer information about the raw biometric data than any other levels, integration at feature level is expected to provide better authentication results. In this paper, both Mel Frequency Cepstral Coefficients (MFCCs) and Linear Prediction Cepstral Coefficients (LPCCs) are combined to get the audio feature vectors and Active Shape Model (ASM) based appearance and shape facial features are concatenated to take the visual feature vectors. These combined audio and visual features are used for the feature-fusion. To reduce the dimension of the audio and visual feature vectors, Principal Component Analysis (PCA) method is used. The VALID audio-visual database is used to measure the performance of the proposed system where four different illumination levels of lighting conditions are considered. Experimental results focus on the significance of the proposed audio-visual speaker identification system with various combinations of audio and visual features.


2017 ◽  
Vol 31 (4) ◽  
pp. 1543-1550 ◽  
Author(s):  
Lin Li ◽  
Tingfeng Ming ◽  
Shuyong Liu ◽  
Shuai Zhang

2016 ◽  
Vol 89 ◽  
pp. 435-446 ◽  
Author(s):  
Guo-gang Wang ◽  
Gui-jin Tang ◽  
Zong-liang Gan ◽  
Zi-guan Cui ◽  
Xiu-chang Zhu

2021 ◽  
Vol 17 (8) ◽  
pp. e1009280
Author(s):  
Indie C. Garwood ◽  
Sourish Chakravarty ◽  
Jacob Donoghue ◽  
Meredith Mahnke ◽  
Pegah Kahali ◽  
...  

Ketamine is an NMDA receptor antagonist commonly used to maintain general anesthesia. At anesthetic doses, ketamine causes high power gamma (25-50 Hz) oscillations alternating with slow-delta (0.1-4 Hz) oscillations. These dynamics are readily observed in local field potentials (LFPs) of non-human primates (NHPs) and electroencephalogram (EEG) recordings from human subjects. However, a detailed statistical analysis of these dynamics has not been reported. We characterize ketamine’s neural dynamics using a hidden Markov model (HMM). The HMM observations are sequences of spectral power in seven canonical frequency bands between 0 to 50 Hz, where power is averaged within each band and scaled between 0 and 1. We model the observations as realizations of multivariate beta probability distributions that depend on a discrete-valued latent state process whose state transitions obey Markov dynamics. Using an expectation-maximization algorithm, we fit this beta-HMM to LFP recordings from 2 NHPs, and separately, to EEG recordings from 9 human subjects who received anesthetic doses of ketamine. Our beta-HMM framework provides a useful tool for experimental data analysis. Together, the estimated beta-HMM parameters and optimal state trajectory revealed an alternating pattern of states characterized primarily by gamma and slow-delta activities. The mean duration of the gamma activity was 2.2s([1.7,2.8]s) and 1.2s([0.9,1.5]s) for the two NHPs, and 2.5s([1.7,3.6]s) for the human subjects. The mean duration of the slow-delta activity was 1.6s([1.2,2.0]s) and 1.0s([0.8,1.2]s) for the two NHPs, and 1.8s([1.3,2.4]s) for the human subjects. Our characterizations of the alternating gamma slow-delta activities revealed five sub-states that show regular sequential transitions. These quantitative insights can inform the development of rhythm-generating neuronal circuit models that give mechanistic insights into this phenomenon and how ketamine produces altered states of arousal.


Sign in / Sign up

Export Citation Format

Share Document