Speech emotion recognition based on Gaussian Mixture Models and Deep Neural Networks

Metagenomics has become an integral part of defining microbial diversity in various environments. Many ecosystems have characteristically low biomass and few cultured representatives. Linking potential metabolisms to phylogeny in environmental microorganisms is important for interpreting microbial community functions and the impacts these communities have on geochemical cycles. However, with metagenomic studies there is the computational hurdle of ‘binning’ contigs into phylogenetically related units or putative genomes. Binning methods have been implemented with varying approaches such as k-means clustering, Gaussian mixture models, hierarchical clustering, neural networks, and two-way clustering; however, many of these suffer from biases against low coverage/abundance organisms and closely related taxa/strains. We are introducing a new binning method, BinSanity, that utilizes the clustering algorithm affinity propagation (AP), to cluster assemblies using coverage with compositional based refinement (tetranucleotide frequency and percent GC content) to optimize bins containing multiple source organisms. This separation of composition and coverage based clustering reduces bias for closely related taxa. BinSanity was developed and tested on artificial metagenomes varying in size and complexity. Results indicate that BinSanity has a higher precision, recall, and Adjusted Rand Index compared to five commonly implemented methods. When tested on a previously published environmental metagenome, BinSanity generated high completion and low redundancy bins corresponding with the published metagenome-assembled genomes.

Download Full-text

f-Similarity Preservation Loss for Soft Labels: A Demonstration on Cross-Corpus Speech Emotion Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015725 ◽

2019 ◽

Vol 33 ◽

pp. 5725-5732

Author(s):

Biqiao Zhang ◽

Yuqing Kong ◽

Georg Essl ◽

Emily Mower Provost

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Loss Function ◽

Deep Neural Networks ◽

Metric Learning ◽

Loss Functions ◽

Speech Emotion Recognition ◽

Subjective Data ◽

Dual Form ◽

Deep Metric Learning

In this paper, we propose a Deep Metric Learning (DML) approach that supports soft labels. DML seeks to learn representations that encode the similarity between examples through deep neural networks. DML generally presupposes that data can be divided into discrete classes using hard labels. However, some tasks, such as our exemplary domain of speech emotion recognition (SER), work with inherently subjective data, data for which it may not be possible to identify a single hard label. We propose a family of loss functions, fSimilarity Preservation Loss (f-SPL), based on the dual form of f-divergence for DML with soft labels. We show that the minimizer of f-SPL preserves the pairwise label similarities in the learned feature embeddings. We demonstrate the efficacy of the proposed loss function on the task of cross-corpus SER with soft labels. Our approach, which combines f-SPL and classification loss, significantly outperforms a baseline SER system with the same structure but trained with only classification loss in most experiments. We show that the presented techniques are more robust to over-training and can learn an embedding space in which the similarity between examples is meaningful.

Download Full-text

Speech Emotion Recognition using Deep Neural Networks

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2020.6395 ◽

2020 ◽

Vol 8 (6) ◽

pp. 2460-2465

Author(s):

Balaji Dharamsoth

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Deep Neural Networks ◽

Speech Emotion Recognition

Download Full-text

Speech emotion recognition based on Gaussian Mixture Models and Deep Neural Networks

Variational Gaussian Mixture Models for Speech Emotion Recognition

Gaussian mixture models with class-dependent features for speech emotion recognition

Deep neural networks with auxiliary Gaussian mixture models for real-time speech recognition

Speech Emotion Recognition Using Deep Neural Networks on Multilingual Databases

Towards real-time Speech Emotion Recognition using deep neural networks

Speech emotion recognition on mobile devices based on modulation spectral feature pooling and deep neural networks

Emotion recognition from speech VIA boosted Gaussian mixture models

BinSanity: unsupervised clustering of environmental microbial assemblies using coverage and affinity propagation

f-Similarity Preservation Loss for Soft Labels: A Demonstration on Cross-Corpus Speech Emotion Recognition

Speech Emotion Recognition using Deep Neural Networks

Export Citation Format