time varying pitch Latest Research Papers

AbstractThe frequency-following response (FFR) is a scalp-recorded electrophysiological potential that closely follows the periodicity of complex sounds such as speech. It has been suggested that FFRs reflect the linear superposition of responses that are triggered by the glottal pulse in each cycle of the fundamental frequency (F0 responses) and sequentially propagate through auditory processing stages in brainstem, midbrain, and cortex. However, this conceptualization of the FFR is debated, and it remains unclear if and how well a simple linear superposition can capture the spectro-temporal complexity of FFRs that are generated within the highly recurrent and non-linear auditory system. To address this question, we used a deconvolution approach to compute the hypothetical F0 responses that best explain the FFRs in rhesus monkeys to human speech and click trains with time-varying pitch patterns. The linear superposition of F0 responses explained well over 90% of the variance of click train steady state FFRs and well over 80% of mandarin tone steady state FFRs. The F0 responses could be measured with high signal-to-noise ratio and featured several spectro-temporally and topographically distinct components that likely reflect the activation of brainstem (<5ms; 200-1000 Hz), midbrain (5-15 ms; 100-250 Hz) and cortex (15-35 ms; ~90 Hz). In summary, our results in the monkey support the notion that FFRs arise as the superposition of F0 responses by showing for the first time that they can capture the bulk of the variance and spectro-temporal complexity of FFRs to human speech with time-varying pitch. These findings identify F0 responses as a potential diagnostic tool that may be useful to reliably link altered FFRs in speech and language disorders to altered F0 responses and thus to specific latencies, frequency bands and ultimately processing stages.

Download Full-text

Deconstructing the Cortical Sources of Frequency Following Responses to Speech: A Cross-species Approach

10.1101/2021.05.17.444462 ◽

2021 ◽

Author(s):

G. Nike Gnanteja ◽

Kyle Rupp ◽

Fernando Llanos ◽

Madison Remick ◽

Marianny Pernia ◽

...

Keyword(s):

Animal Models ◽

Auditory Cortex ◽

Human Subjects ◽

Auditory Pathway ◽

Time Varying ◽

Volume Conduction ◽

Cortical Sources ◽

Neural Ensembles ◽

Processing Of Speech Signals ◽

Time Varying Pitch

Time-varying pitch is a vital cue in the processing of speech signals. Neural processing of time-varying pitch cues in speech has been extensively assayed using scalp-recorded frequency-following responses (FFRs), which are thought to reflect integrated phase-locked activity from neural ensembles exclusively along the subcortical auditory pathway. Emerging evidence however suggests that the auditory cortex contributes to the FFRs as well. However, the response properties and the relative cortical contribution to the scalp-recorded FFR are only beginning to be explored. Here we used direct intracortical recordings from human subjects and animal models (macaque, guinea pig) to deconstruct the cortical sources of FFRs and leveraged representational similarity analysis as a translational bridge to characterize similarities between the human and animal models. We found robust FFRs in the auditory cortex that emerged from the thalamorecepient layers of the auditory cortex and contributed to the scalp-recorded FFRs via volume conduction.

Download Full-text

Influence of time-varying pitch on timbre: “Coherence and incoherence” based on spectral centroid

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2015.7178770 ◽

2015 ◽

Cited By ~ 1

Author(s):

S. Arthi ◽

T. V. Sreenivas

Keyword(s):

Time Varying ◽

Spectral Centroid ◽

Time Varying Pitch

Download Full-text

A Shift-Invariant Latent Variable Model for Automatic Music Transcription

Computer Music Journal ◽

10.1162/comj_a_00146 ◽

2012 ◽

Vol 36 (4) ◽

pp. 81-94 ◽

Cited By ~ 36

Author(s):

Emmanouil Benetos ◽

Simon Dixon

Keyword(s):

Latent Variable ◽

Markov Models ◽

Variable Model ◽

Data Set ◽

Music Transcription ◽

Transcription System ◽

Automatic Music Transcription ◽

Error Metrics ◽

Frequency Modulations ◽

Time Varying Pitch

In this work, a probabilistic model for multiple-instrument automatic music transcription is proposed. The model extends the shift-invariant probabilistic latent component analysis method, which is used for spectrogram factorization. Proposed extensions support the use of multiple spectral templates per pitch and per instrument source, as well as a time-varying pitch contribution for each source. Thus, this method can effectively be used for multiple-instrument automatic transcription. In addition, the shift-invariant aspect of the method can be exploited for detecting tuning changes and frequency modulations, as well as for visualizing pitch content. For note tracking and smoothing, pitch-wise hidden Markov models are used. For training, pitch templates from eight orchestral instruments were extracted, covering their complete note range. The transcription system was tested on multiple-instrument polyphonic recordings from the RWC database, a Disklavier data set, and the MIREX 2007 multi-F0 data set. Results demonstrate that the proposed method outperforms leading approaches from the transcription literature, using several error metrics.

Download Full-text

Detection of time varying pitch in tonal languages: an approach based on ensemble empirical mode decomposition

Journal of Zhejiang University SCIENCE C ◽

10.1631/jzus.c1100092 ◽

2012 ◽

Vol 13 (2) ◽

pp. 139-145 ◽

Cited By ~ 1

Author(s):

Hong Hong ◽

Xiao-hua Zhu ◽

Wei-min Su ◽

Run-tong Geng ◽

Xin-long Wang

Keyword(s):

Empirical Mode Decomposition ◽

Ensemble Empirical Mode Decomposition ◽

Time Varying ◽

Mode Decomposition ◽

Time Varying Pitch

Download Full-text

time varying pitch
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Linear superposition of responses evoked by individual glottal pulses explain over 80% of the frequency following response to human speech in the macaque monkey

Deconstructing the Cortical Sources of Frequency Following Responses to Speech: A Cross-species Approach

Influence of time-varying pitch on timbre: “Coherence and incoherence” based on spectral centroid

A Shift-Invariant Latent Variable Model for Automatic Music Transcription

Detection of time varying pitch in tonal languages: an approach based on ensemble empirical mode decomposition

Export Citation Format

time varying pitchRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Linear superposition of responses evoked by individual glottal pulses explain over 80% of the frequency following response to human speech in the macaque monkey

Deconstructing the Cortical Sources of Frequency Following Responses to Speech: A Cross-species Approach

Influence of time-varying pitch on timbre: “Coherence and incoherence” based on spectral centroid

A Shift-Invariant Latent Variable Model for Automatic Music Transcription

Detection of time varying pitch in tonal languages: an approach based on ensemble empirical mode decomposition

time varying pitch
Recently Published Documents