envelope processing
Recently Published Documents


TOTAL DOCUMENTS

35
(FIVE YEARS 8)

H-INDEX

9
(FIVE YEARS 1)

2021 ◽  
Vol 11 (22) ◽  
pp. 10788
Author(s):  
Ali Fallah ◽  
Steven van de Par

Speech intelligibility in public places can be degraded by the environmental noise and reverberation. In this study, a new near-end listening enhancement (NELE) approach is proposed in which using a time varying filter jointly enhances the onsets and reduces the overlap masking. For optimization, some look-ahead in clean speech and prior knowledge of room impulse response (RIR) are required. In this method, by optimizing a defined cost function, the Spectro-Temporal Envelope of reverb speech is optimized to be as close as possible to that of clean speech. In this cost function, onsets of speech are optimized with increased weight. This approach is different from overlap-masking ratio (OMR) and speech enhancement (OE) approaches (Grosse, van de Par, 2017, J. Audio Eng. Soc., Vol. 65(1/2), pp. 31–41) that only consider previous frames in each time slot for determining the time variant filtering. The SRT measurements show that the new optimization framework enhances the speech intelligibility up to 2 dB more that OE.


2021 ◽  
Author(s):  
Dhatri S. Devaraju ◽  
Amy Kemp ◽  
David A. Eddins ◽  
Rahul Shrivastav ◽  
Bharath Chandrasekaran ◽  
...  

AbstractPurposeListeners shift their listening strategies to prioritize lower-level acoustic information and higher-level semantic information in challenging listening conditions. However, the neural mechanisms underlying different strategies are unclear. The current study examined the extent to which encoding of lower-level acoustic cues is modulated by task demand and relationships with the higher-level semantic processing.MethodElectroencephalography (EEG) was acquired while participants listened to sentences in noise that contained either higher or lower probability final words. Task difficulty was modulated by time available to process responses. Cortical tracking of speech - neural correlates of acoustic temporal envelope processing - were estimated using temporal response functions (TRFs).ResultsTask difficulty did not affect cortical tracking of temporal envelope of speech under challenging listening conditions. No correlations were observed between the cortical tracking of temporal envelope of speech and semantic processes, even after controlling for the effect of individualized signal-to-noise ratios.ConclusionsCortical tracking of temporal envelope of speech and semantic processing are differentially influenced by task difficulty. While increased task demands modulated higher-level semantic processing, cortical tracking of temporal envelope of speech may be influenced by task difficulty primarily when the demand is manipulated in terms of acoustic properties of the stimulus, consistent with an emerging perspective in speech perception.


2020 ◽  
Vol 12 ◽  
Author(s):  
Ehsan Darestani Farahani ◽  
Jan Wouters ◽  
Astrid van Wieringen

Speech understanding problems are highly prevalent in the aging population, even when hearing sensitivity is clinically normal. These difficulties are attributed to changes in central temporal processing with age and can potentially be captured by age-related changes in neural generators. The aim of this study is to investigate age-related changes in a wide range of neural generators during temporal processing in middle-aged and older persons with normal audiometric thresholds. A minimum-norm imaging technique is employed to reconstruct cortical and subcortical neural generators of temporal processing for different acoustic modulations. The results indicate that for relatively slow modulations (<50 Hz), the response strength of neural sources is higher in older adults than in younger ones, while the phase-locking does not change. For faster modulations (80 Hz), both the response strength and the phase-locking of neural sources are reduced in older adults compared to younger ones. These age-related changes in temporal envelope processing of slow and fast acoustic modulations are possibly due to loss of functional inhibition, which is accompanied by aging. Both cortical (primary and non-primary) and subcortical neural generators demonstrate similar age-related changes in response strength and phase-locking. Hemispheric asymmetry is also altered in older adults compared to younger ones. Alterations depend on the modulation frequency and side of stimulation. The current findings at source level could have important implications for the understanding of age-related changes in auditory temporal processing and for developing advanced rehabilitation strategies to address speech understanding difficulties in the aging population.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Robin Gransier ◽  
Robert P. Carlyon ◽  
Jan Wouters

Abstract Cochlear-implant (CI) users rely on temporal envelope modulations (TEMs) to understand speech, and clinical outcomes depend on the accuracy with which these TEMs are encoded by the electrically-stimulated neural ensembles. Non-invasive EEG measures of this encoding could help clinicians identify and disable electrodes that evoke poor neural responses so as to improve CI outcomes. However, recording EEG during CI stimulation reveals huge stimulation artifacts that are up to orders of magnitude larger than the neural response. Here we used a custom-built EEG system having an exceptionally high sample rate to accurately measure the artefact, which we then removed using linear interpolation so as to reveal the neural response during continuous electrical stimulation. In ten adult CI users, we measured the 40-Hz electrically evoked auditory steady-state response (eASSR) and electrically evoked auditory change complex (eACC) to amplitude-modulated 900-pulses-per-second pulse trains, stimulated in monopolar mode (i.e. the clinical default), and at different modulation depths. We successfully measured artifact-free 40-Hz eASSRs and eACCs. Moreover, we found that the 40-Hz eASSR, in contrast to the eACC, showed substantial responses even at shallow modulation depths. We argue that the 40-Hz eASSR is a clinically feasible objective measure to assess TEM encoding in CI users.


2020 ◽  
Author(s):  
Chhayakanta Patro ◽  
Heather A. Kreft ◽  
Magdalena Wojtczak

AbstractOlder adults often experience difficulties understanding speech in adverse listening conditions. These difficulties are partially attributed to auditory temporal-processing deficits associated with aging even in the absence of hearing loss. The aim of this study was to assess effects of age and hearing loss on temporal envelope processing and speech-on-speech masking. Listeners with normal and near-normal hearing across a wide age range (20 to 66 years) were tested using a series of psychophysical (amplitude-modulation detection, gap detection, and interaural-envelope-phase discrimination), physiological (electroencephalographic envelope-following responses), speech perception (spatial release from masking), and cognitive (processing speed) measures. Results showed that: (i) psychophysical measures of monaural and binaural envelope processing and neural measures of envelope processing are not affected by aging after accounting for audiometric hearing loss, (ii) behavioral gap-detection thresholds decline with age, (iii) aging results in a reduction of spatial release from masking, even as speech intensity is amplified in the region of hearing loss, (iv) aging is associated with poorer measures of cognitive function. Although age significantly contributed to a decline in spatial release from speech-on-speech masking, individual differences in envelope processing and in scores from nonauditory cognitive tests used in this study were not significant predictors of speech performance.HighlightsAge per se does not affect psychophysical and physiological measures of monaural amplitude-modulation processing.Age does not affect the ability to detect interaural disparities in envelope timing between the ears.Gap detection thresholds degrades with age even after hearing thresholds are statistically accounted for.Age, independent of hearing thresholds, can substantially reduce spatial release from masking.Cognitive ability declines with age. However, such declines do not necessarily cause deficits in spatial release from masking.


2019 ◽  
Vol 122 (4) ◽  
pp. 1685-1696 ◽  
Author(s):  
Carolyn M. McClaskey ◽  
James W. Dias ◽  
Kelly C. Harris

Temporal modulations are an important part of speech signals. An accurate perception of these time-varying qualities of sound is necessary for successful communication. The current study investigates the relationship between sustained envelope encoding and speech-in-noise perception in a cohort of normal-hearing younger (ages 18–30 yr, n = 22) and older adults (ages 55–90+ yr, n = 35) using the subcortical auditory steady-state response (ASSR). ASSRs were measured in response to the envelope of 400-ms amplitude-modulated (AM) tones with 3,000-Hz carrier frequencies and 80-Hz modulation frequencies. AM tones had modulation depths of 0, −4, and −8 dB relative to m = 1 ( m = 1, 0.631, and 0.398, respectively). The robustness, strength at modulation frequency, and synchrony of subcortical envelope encoding were quantified via time-domain correlations, spectral amplitude, and phase-locking value, respectively. Speech-in-noise ability was quantified via the QuickSIN test in the 0- and 5-dB signal-to-noise (SNR) conditions. All ASSR metrics increased with increasing modulation depth and there were no effects of age group. ASSR metrics in response to shallow modulation depths predicted 0-dB speech scores. Results demonstrate that sustained amplitude envelope processing in the brainstem relates to speech-in-noise abilities, but primarily in difficult listening conditions at low SNRs. These findings furthermore highlight the utility of shallow modulation depths for studying temporal processing. The absence of age effects in these data demonstrate that individual differences in the robustness, strength, and specificity of subcortical envelope processing, and not age, predict speech-in-noise performance in the most difficult listening conditions. NEW & NOTEWORTHY Failure to correctly understand speech in the presence of background noise is a significant problem for many normal-hearing adults and may impede healthy communication. The relationship between sustained envelope encoding in the brainstem and speech-in-noise perception remains to be clarified. The present study demonstrates that the strength, specificity, and robustness of the brainstem’s representations of sustained stimulus periodicity relates to speech-in-noise perception in older and younger normal-hearing adults, but only in highly challenging listening environments.


Sensors ◽  
2019 ◽  
Vol 19 (12) ◽  
pp. 2666 ◽  
Author(s):  
Qiuming Nan ◽  
Sheng Li ◽  
Yiqiang Yao ◽  
Zhengying Li ◽  
Honghai Wang ◽  
...  

Tracking operating trains and identifying illegal intruders are two important and critical issues in subway safety management. One challenge is to find a reliable methodology that would enable these two needs to be addressed with high sensitivity and spatial resolution over a long-distance range. This paper proposes a novel monitoring approach based on distributed vibration, which is suitable for both train tracking and incursion detection. For an actual subway system, ultra-weak fiber Bragg grating (FBG) sensing technology was applied to collect the distributed vibration responses from moving trains and intruders. The monitoring data from the subway operation stage were directly utilized to evaluate the feasibility of the proposed method for tracking trains. Moreover, a field simulation experiment was performed to validate the possibility of detecting human intrusion. The results showed that the diagonal signal pattern in the distributed vibration response can be used to reveal the location and speed of the moving loads (e.g., train and intruders). Other train parameters, such as length and the number of compartments, can also be obtained from the vibration responses through cross-correlation and envelope processing. Experimental results in the time and frequency domains within the selected intrusion range indicated that the proposed method can distinguish designed intrusion cases in terms of strength and mode.


Author(s):  
Luz Garcia ◽  
Isaac Alvarez ◽  
Manuel Titos ◽  
Alejandro Diaz-Moreno ◽  
M. Carmen Benitez ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document