critical band Latest Research Papers

Emotion Recognition From Speech Using Perceptual Filter and Neural Network

10.4018/978-1-6684-2408-7.ch054 ◽

2022 ◽

pp. 1146-1156

Author(s):

Revathi A. ◽

Sasikaladevi N.

Keyword(s):

Neural Network ◽

Emotion Recognition ◽

Vector Quantization ◽

Group Performance ◽

Back Propagation ◽

Critical Band ◽

Emotion Classification ◽

Back Propagation Algorithm ◽

Propagation Algorithm ◽

Speaker Independent

This chapter on multi speaker independent emotion recognition encompasses the use of perceptual features with filters spaced in Equivalent rectangular bandwidth (ERB) and BARK scale and vector quantization (VQ) classifier for classifying groups and artificial neural network with back propagation algorithm for emotion classification in a group. Performance can be improved by using the large amount of data in a pertinent emotion to adequately train the system. With the limited set of data, this proposed system has provided consistently better accuracy for the perceptual feature with critical band analysis done in ERB scale.

Download Full-text

Evaluating Tenney's critical band using a computational model of the human cochlea

10.18061/fdmc.2021.0048 ◽

2021 ◽

Author(s):

Ashkan Fakhrtabatabaie ◽

Skyler G. Jennings

Keyword(s):

Computational Model ◽

Critical Band ◽

Human Cochlea

Download Full-text

The Role of the Bandwidth-Duration Product WT in the Detectability of Diotic Signals

10.26686/wgtn.16934959.v1 ◽

2021 ◽

Author(s):

◽

Judi Lapsley Miller

Keyword(s):

Gaussian Noise ◽

Critical Band ◽

Ideal Observer ◽

Fundamental Parameter ◽

Band Pass Filter ◽

Individual Values ◽

Pass Filter ◽

Time Frequency ◽

Linear Detector ◽

Temporal Integrator

<p>The bandwidth-duration product, WT , is a fundamental parameter in most theories of aural amplitude discrimination of Gaussian noise. These theories predict that detectability is dependent on WT , but not on the individual values of bandwidth and duration. Due to the acoustical uncertainty principle, it is impossible to completely specify an acoustic waveform with both finite duration and finite bandwidth. An observer must decide how best to trade-off information in the time domain with information in the frequency domain. As Licklider (1963) states, "The nature of [the ear's] solution to the time-frequency problem is, in fact, one of the central problems in the psychology of hearing."This problem is still unresolved, primarily due to observer inconsistency in experiments, which degrades performance making it difficult to compare models. The aim was to compare human observers' ability to trade bandwidth and duration, with simulated and theoretical observers. Human observers participated in a parametric study where the bandwidth and duration of 500 Hz noise waveforms was systematically varied for the same bandwidth-duration products (WT = 1, 2, and 4, where W varied over 2.5-160 Hz, and T varied over 400-6.25 ms, in octave steps). If observers can trade bandwidth and duration, detectability should be constant for the same WT . The observers replicated the experiments six times so that group operating characteristic (GOC) analysis could be used to reduce the effects of their inconsistent decision making. Asymptotic errorless performance was estimated by extrapolating results from the GOC analysis, as a function of replications added. Three simulated ideal observers: the energy, envelope, and full-linear (band-pass filter, full-wave rectifier, and true integrator) detectors were compared with each other, with mathematical theory and with human observers. Asymptotic detectability relative to the full-linear detector indicates that human observers best detect signals with a bandwidth of 40-80 Hz and a duration of 50-100 ms, and that other values are traded off in approximately concentric ellipses of equal detectability. Human detectability of Gaussian noise was best modelled by the full-linear detector using a non-optimal filter. Comparing psychometric functions for this detector with human data shows many striking similarities, indicating that human observers can sometimes perform as well as an ideal observer, once their inconsistency is minimised. These results indicate that the human hearing system can trade bandwidth and duration of signals, but not optimally. This accounts for many of the disparate estimates of the critical band, rectifier, and temporal integrator, found in the literature, because (a) the critical band is adjustable, but has a minimum of 40-50 Hz, (b) the rectifier is linear, rather than square-law, and (c) the temporal integrator is either true or leaky with a very long time constant.</p>

Download Full-text

The Role of the Bandwidth-Duration Product WT in the Detectability of Diotic Signals

10.26686/wgtn.16934959 ◽

2021 ◽

Author(s):

◽

Judi Lapsley Miller

Keyword(s):

Gaussian Noise ◽

Critical Band ◽

Ideal Observer ◽

Fundamental Parameter ◽

Band Pass Filter ◽

Individual Values ◽

Pass Filter ◽

Time Frequency ◽

Linear Detector ◽

Temporal Integrator

<p>The bandwidth-duration product, WT , is a fundamental parameter in most theories of aural amplitude discrimination of Gaussian noise. These theories predict that detectability is dependent on WT , but not on the individual values of bandwidth and duration. Due to the acoustical uncertainty principle, it is impossible to completely specify an acoustic waveform with both finite duration and finite bandwidth. An observer must decide how best to trade-off information in the time domain with information in the frequency domain. As Licklider (1963) states, "The nature of [the ear's] solution to the time-frequency problem is, in fact, one of the central problems in the psychology of hearing."This problem is still unresolved, primarily due to observer inconsistency in experiments, which degrades performance making it difficult to compare models. The aim was to compare human observers' ability to trade bandwidth and duration, with simulated and theoretical observers. Human observers participated in a parametric study where the bandwidth and duration of 500 Hz noise waveforms was systematically varied for the same bandwidth-duration products (WT = 1, 2, and 4, where W varied over 2.5-160 Hz, and T varied over 400-6.25 ms, in octave steps). If observers can trade bandwidth and duration, detectability should be constant for the same WT . The observers replicated the experiments six times so that group operating characteristic (GOC) analysis could be used to reduce the effects of their inconsistent decision making. Asymptotic errorless performance was estimated by extrapolating results from the GOC analysis, as a function of replications added. Three simulated ideal observers: the energy, envelope, and full-linear (band-pass filter, full-wave rectifier, and true integrator) detectors were compared with each other, with mathematical theory and with human observers. Asymptotic detectability relative to the full-linear detector indicates that human observers best detect signals with a bandwidth of 40-80 Hz and a duration of 50-100 ms, and that other values are traded off in approximately concentric ellipses of equal detectability. Human detectability of Gaussian noise was best modelled by the full-linear detector using a non-optimal filter. Comparing psychometric functions for this detector with human data shows many striking similarities, indicating that human observers can sometimes perform as well as an ideal observer, once their inconsistency is minimised. These results indicate that the human hearing system can trade bandwidth and duration of signals, but not optimally. This accounts for many of the disparate estimates of the critical band, rectifier, and temporal integrator, found in the literature, because (a) the critical band is adjustable, but has a minimum of 40-50 Hz, (b) the rectifier is linear, rather than square-law, and (c) the temporal integrator is either true or leaky with a very long time constant.</p>

Download Full-text

Emotion Recognition From Speech Using Perceptual Filter and Neural Network

Advances in Computer and Electrical Engineering - Neural Networks for Natural Language Processing ◽

10.4018/978-1-7998-1159-6.ch004 ◽

2020 ◽

pp. 78-91 ◽

Cited By ~ 2

Author(s):

Revathi A. ◽

Sasikaladevi N.

Keyword(s):

Neural Network ◽

Emotion Recognition ◽

Vector Quantization ◽

Group Performance ◽

Back Propagation ◽

Critical Band ◽

Emotion Classification ◽

Back Propagation Algorithm ◽

Propagation Algorithm ◽

Speaker Independent

This chapter on multi speaker independent emotion recognition encompasses the use of perceptual features with filters spaced in Equivalent rectangular bandwidth (ERB) and BARK scale and vector quantization (VQ) classifier for classifying groups and artificial neural network with back propagation algorithm for emotion classification in a group. Performance can be improved by using the large amount of data in a pertinent emotion to adequately train the system. With the limited set of data, this proposed system has provided consistently better accuracy for the perceptual feature with critical band analysis done in ERB scale.

Download Full-text

Rendering a virtual light source to seem like a realistic light source in an electronic display: A critical band of luminance gradients for the perception of self-luminosity

Displays ◽

10.1016/j.displa.2019.07.001 ◽

2019 ◽

Vol 59 ◽

pp. 44-52

Author(s):

Hui-Ning Wu ◽

Xue-Min Wang ◽

Li-Kun Yu ◽

Tian Yuan ◽

Shu-Guang Kuai

Keyword(s):

Light Source ◽

Critical Band ◽

Electronic Display ◽

Perception Of Self

Download Full-text

Sound‐quality diagnosis method of permanent magnet synchronous motor for electric vehicles based on critical band analysis

IET Electric Power Applications ◽

10.1049/iet-epa.2019.0088 ◽

2019 ◽

Vol 13 (10) ◽

pp. 1613-1621

Author(s):

Conggan Ma ◽

Yuansheng An ◽

Lantao Liu ◽

Michele Degano ◽

Xingjiang Ning ◽

...

Keyword(s):

Permanent Magnet ◽

Electric Vehicles ◽

Permanent Magnet Synchronous Motor ◽

Sound Quality ◽

Synchronous Motor ◽

Critical Band ◽

Diagnosis Method ◽

Quality Diagnosis

Download Full-text

Susceptibilities and the critical band of crossover region in the QCD phase diagram

The European Physical Journal C ◽

10.1140/epjc/s10052-019-6915-0 ◽

2019 ◽

Vol 79 (5) ◽

Cited By ~ 3

Author(s):

Shu-Sheng Xu ◽

Pei-Lin Yin ◽

Hong-Shi Zong

Keyword(s):

Phase Diagram ◽

Critical Band ◽

Crossover Region ◽

Qcd Phase Diagram

Download Full-text

CNN and RNN mixed model for image classification

MATEC Web of Conferences ◽

10.1051/matecconf/201927702001 ◽

2019 ◽

Vol 277 ◽

pp. 02001 ◽

Cited By ~ 1

Author(s):

Qiwei Yin ◽

Ruixun Zhang ◽

XiuLi Shao

Keyword(s):

Neural Networks ◽

Image Classification ◽

Recurrent Neural Networks ◽

Mixed Model ◽

Input Sequence ◽

Image Data ◽

Critical Band ◽

Model Image ◽

Classification Prediction ◽

The Fourier Transform

In this paper, we propose a CNN(Convolutional neural networks) and RNN(recurrent neural networks) mixed model for image classification, the proposed network, called CNN-RNN model. Image data can be viewed as two-dimensional wave data, and convolution calculation is a filtering process. It can filter non-critical band information in an image, leaving behind important features of image information. The CNN-RNN model can use the RNN to Calculate the Dependency and Continuity Features of the Intermediate Layer Output of the CNN Model, connect the characteristics of these middle tiers to the final full-connection network for classification prediction, which will result in better classification accuracy. At the same time, in order to satisfy the restriction of the length of the input sequence by the RNN model and prevent the gradient explosion or gradient disappearing in the network, this paper combines the wavelet transform (WT) method in the Fourier transform to filter the input data. We will test the proposed CNN-RNN model on a widely-used datasets CIFAR-10. The results prove the proposed method has a better classification effect than the original CNN network, and that further investigation is needed.

Download Full-text

Long-Term Critical Band Energy-Based Feature Set for Dialect Identification Using a Neuro-Fuzzy Approach

IEEE Intelligent Systems ◽

10.1109/mis.2018.111144010 ◽

2018 ◽

Vol 33 (1) ◽

pp. 40-52 ◽

Cited By ~ 2

Author(s):

Mousmita Sarma ◽

Kandarpa Kumar Sarma

Keyword(s):

Critical Band ◽

Fuzzy Approach ◽

Band Energy ◽

Neuro Fuzzy

Download Full-text

critical band
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Emotion Recognition From Speech Using Perceptual Filter and Neural Network

Evaluating Tenney's critical band using a computational model of the human cochlea

The Role of the Bandwidth-Duration Product WT in the Detectability of Diotic Signals

The Role of the Bandwidth-Duration Product WT in the Detectability of Diotic Signals

Emotion Recognition From Speech Using Perceptual Filter and Neural Network

Rendering a virtual light source to seem like a realistic light source in an electronic display: A critical band of luminance gradients for the perception of self-luminosity

Sound‐quality diagnosis method of permanent magnet synchronous motor for electric vehicles based on critical band analysis

Susceptibilities and the critical band of crossover region in the QCD phase diagram

CNN and RNN mixed model for image classification

Long-Term Critical Band Energy-Based Feature Set for Dialect Identification Using a Neuro-Fuzzy Approach

Export Citation Format

critical bandRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Emotion Recognition From Speech Using Perceptual Filter and Neural Network

Evaluating Tenney's critical band using a computational model of the human cochlea

The Role of the Bandwidth-Duration Product WT in the Detectability of Diotic Signals

The Role of the Bandwidth-Duration Product WT in the Detectability of Diotic Signals

Emotion Recognition From Speech Using Perceptual Filter and Neural Network

Rendering a virtual light source to seem like a realistic light source in an electronic display: A critical band of luminance gradients for the perception of self-luminosity

Sound‐quality diagnosis method of permanent magnet synchronous motor for electric vehicles based on critical band analysis

Susceptibilities and the critical band of crossover region in the QCD phase diagram

CNN and RNN mixed model for image classification

Long-Term Critical Band Energy-Based Feature Set for Dialect Identification Using a Neuro-Fuzzy Approach

critical band
Recently Published Documents