time varying pitch
Recently Published Documents


TOTAL DOCUMENTS

5
(FIVE YEARS 2)

H-INDEX

1
(FIVE YEARS 0)

2021 ◽  
Author(s):  
Tobias Teichert ◽  
G. Nike Gnanateja ◽  
Srivatsun Sadagopan ◽  
Bharath Chandrasekaran

AbstractThe frequency-following response (FFR) is a scalp-recorded electrophysiological potential that closely follows the periodicity of complex sounds such as speech. It has been suggested that FFRs reflect the linear superposition of responses that are triggered by the glottal pulse in each cycle of the fundamental frequency (F0 responses) and sequentially propagate through auditory processing stages in brainstem, midbrain, and cortex. However, this conceptualization of the FFR is debated, and it remains unclear if and how well a simple linear superposition can capture the spectro-temporal complexity of FFRs that are generated within the highly recurrent and non-linear auditory system. To address this question, we used a deconvolution approach to compute the hypothetical F0 responses that best explain the FFRs in rhesus monkeys to human speech and click trains with time-varying pitch patterns. The linear superposition of F0 responses explained well over 90% of the variance of click train steady state FFRs and well over 80% of mandarin tone steady state FFRs. The F0 responses could be measured with high signal-to-noise ratio and featured several spectro-temporally and topographically distinct components that likely reflect the activation of brainstem (<5ms; 200-1000 Hz), midbrain (5-15 ms; 100-250 Hz) and cortex (15-35 ms; ~90 Hz). In summary, our results in the monkey support the notion that FFRs arise as the superposition of F0 responses by showing for the first time that they can capture the bulk of the variance and spectro-temporal complexity of FFRs to human speech with time-varying pitch. These findings identify F0 responses as a potential diagnostic tool that may be useful to reliably link altered FFRs in speech and language disorders to altered F0 responses and thus to specific latencies, frequency bands and ultimately processing stages.


2021 ◽  
Author(s):  
G. Nike Gnanteja ◽  
Kyle Rupp ◽  
Fernando Llanos ◽  
Madison Remick ◽  
Marianny Pernia ◽  
...  

Time-varying pitch is a vital cue in the processing of speech signals. Neural processing of time-varying pitch cues in speech has been extensively assayed using scalp-recorded frequency-following responses (FFRs), which are thought to reflect integrated phase-locked activity from neural ensembles exclusively along the subcortical auditory pathway. Emerging evidence however suggests that the auditory cortex contributes to the FFRs as well. However, the response properties and the relative cortical contribution to the scalp-recorded FFR are only beginning to be explored. Here we used direct intracortical recordings from human subjects and animal models (macaque, guinea pig) to deconstruct the cortical sources of FFRs and leveraged representational similarity analysis as a translational bridge to characterize similarities between the human and animal models. We found robust FFRs in the auditory cortex that emerged from the thalamorecepient layers of the auditory cortex and contributed to the scalp-recorded FFRs via volume conduction.


2012 ◽  
Vol 36 (4) ◽  
pp. 81-94 ◽  
Author(s):  
Emmanouil Benetos ◽  
Simon Dixon

In this work, a probabilistic model for multiple-instrument automatic music transcription is proposed. The model extends the shift-invariant probabilistic latent component analysis method, which is used for spectrogram factorization. Proposed extensions support the use of multiple spectral templates per pitch and per instrument source, as well as a time-varying pitch contribution for each source. Thus, this method can effectively be used for multiple-instrument automatic transcription. In addition, the shift-invariant aspect of the method can be exploited for detecting tuning changes and frequency modulations, as well as for visualizing pitch content. For note tracking and smoothing, pitch-wise hidden Markov models are used. For training, pitch templates from eight orchestral instruments were extracted, covering their complete note range. The transcription system was tested on multiple-instrument polyphonic recordings from the RWC database, a Disklavier data set, and the MIREX 2007 multi-F0 data set. Results demonstrate that the proposed method outperforms leading approaches from the transcription literature, using several error metrics.


Sign in / Sign up

Export Citation Format

Share Document