Creating Clarity in Noisy Environments by Using Deep Learning in Hearing Aids

AbstractHearing aids continue to acquire increasingly sophisticated sound-processing features beyond basic amplification. On the one hand, these have the potential to add user benefit and allow for personalization. On the other hand, if such features are to benefit according to their potential, they require clinicians to be acquainted with both the underlying technologies and the specific fitting handles made available by the individual hearing aid manufacturers. Ensuring benefit from hearing aids in typical daily listening environments requires that the hearing aids handle sounds that interfere with communication, generically referred to as “noise.” With this aim, considerable efforts from both academia and industry have led to increasingly advanced algorithms that handle noise, typically using the principles of directional processing and postfiltering. This article provides an overview of the techniques used for noise reduction in modern hearing aids. First, classical techniques are covered as they are used in modern hearing aids. The discussion then shifts to how deep learning, a subfield of artificial intelligence, provides a radically different way of solving the noise problem. Finally, the results of several experiments are used to showcase the benefits of recent algorithmic advances in terms of signal-to-noise ratio, speech intelligibility, selective attention, and listening effort.

Download Full-text

Effect of Signal-to-Noise Ratio on Directional Microphone Benefit and Preference

Journal of the American Academy of Audiology ◽

10.3766/jaaa.16.9.4 ◽

2005 ◽

Vol 16 (09) ◽

pp. 662-676 ◽

Cited By ~ 17

Author(s):

Brian E. Walden ◽

Rauna K. Surr ◽

Kenneth W. Grant ◽

W. Van Summers ◽

Mary T. Cord ◽

...

Keyword(s):

Hearing Aids ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Hearing Aid ◽

Directional Hearing ◽

Signal To Noise ◽

Directional Microphones ◽

Directional Microphone ◽

The Individual ◽

Directional Hearing Aids

This study examined speech intelligibility and preferences for omnidirectional and directional microphone hearing aid processing across a range of signal-to-noise ratios (SNRs). A primary motivation for the study was to determine whether SNR might be used to represent distance between talker and listener in automatic directionality algorithms based on scene analysis. Participants were current hearing aid users who either had experience with omnidirectional microphone hearing aids only or with manually switchable omnidirectional/directional hearing aids. Using IEEE/Harvard sentences from a front loudspeaker and speech-shaped noise from three loudspeakers located behind and to the sides of the listener, the directional advantage (DA) was obtained at 11 SNRs ranging from -15 dB to +15 dB in 3 dB steps. Preferences for the two microphone modes at each of the 11 SNRs were also obtained using concatenated IEEE sentences presented in the speech-shaped noise. Results revealed that a DA was observed across a broad range of SNRs, although directional processing provided the greatest benefit within a narrower range of SNRs. Mean data suggested that microphone preferences were determined largely by the DA, such that the greater the benefit to speech intelligibility provided by the directional microphones, the more likely the listeners were to prefer that processing mode. However, inspection of the individual data revealed that highly predictive relationships did not exist for most individual participants. Few preferences for omnidirectional processing were observed. Overall, the results did not support the use of SNR to estimate the effects of distance between talker and listener in automatic directionality algorithms.

Download Full-text

Evaluation of the Efficacy of a Dual Variable Speed Compressor over a Single Fixed Speed Compressor

Journal of the American Academy of Audiology ◽

10.3766/jaaa.17127 ◽

2019 ◽

Vol 30 (07) ◽

pp. 590-606 ◽

Cited By ~ 2

Author(s):

Francis Kuk ◽

Chris Slugocki ◽

Petri Korhonen ◽

Eric Seper ◽

Ole Hau

Keyword(s):

Hearing Aids ◽

Repeated Measures ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Recall Performance ◽

Variable Speed ◽

Listening Effort ◽

Subjective Report ◽

Repeated Measures Design ◽

Compression Speed

AbstractIt has been suggested that hearing-impaired listeners with a good working memory (WM) should be fitted with a compression system using short time constants (i.e., fast-acting compression [FAC]), whereas those with a poorer WM should be fitted with a longer time constant (i.e., slow-acting compression [SAC]). However, commercial hearing aids (HAs) seldom use a fixed speed of compression.The performance of a variable speed compression (VSC) system relative to a fixed speed compressor (FAC and SAC) on measures of speech intelligibility, recall, and subjective report of listening effort and tolerable time was evaluated. The potential interaction with the listeners’ WM capacity (WMC) was also examined.A double-blinded, repeated measures design.Seventeen HA wearers (16 with greater than one year HA experience) with a bilaterally symmetrical, mild to moderately severe sensorineural hearing loss participated in the study.Participants wore the study HAs at three compression speeds (FAC, SAC, and VSC). Each listener was evaluated on the Office of Research in Clinical Amplification-nonsense syllable test (NST) at 50 dB SPL (signal-to-noise ratio [SNR] = +15 dB), 65 dB SPL (SNR = +5 dB), 80 dB SPL (SNR = 0 dB), and a split (80 dB SPL–50 dB SPL) condition. Listeners were also evaluated on a Repeat Recall Test (RRT), where they had to repeat six short sentences (both high- and low-context sentences) after each was presented. Listeners recalled target words in all six sentences after they were presented. They also rated their listening effort and the amount of time they would tolerate listening under the specific condition. RRT sentences were presented at 75 dB SPL in quiet, as well as SNR = 0, 5, 10, and 15 dB. A Reading Span Test (RST) was also administered to assess listeners’ WMC. Analysis of variance using RST scores as a covariate was used to examine differences in listener performance among compressor speeds.Listener performance on the NST was similar among all three compression speeds at 50, 65, and 80 dB SPL. Performance with FAC was significantly better than SAC for the split condition; however, performance did not differ between FAC and VSC or between SAC and VSC. Performance on the NST was not affected by listeners’ RST scores. On the RRT, there was no effect of compressor speed on measures of repeat, recall, listening effort, and tolerable time. However, VSC resulted in significantly lower (better) speech reception threshold at the 85% correct recognition criterion (SRT85) than FAC and SAC. Listener RST scores significantly affected recall performance on the RRT but did not affect SRT85, repeat, listening effort, or tolerable time.These results suggest that the VSC, FAC, and SAC yield similar performance in most but not all test conditions. FAC outperforms SAC, where the stimulus levels change abruptly (i.e., split condition). The VSC yields a lower SRT85 than a fixed compression speed at a moderately high level with a favorable SNR. There is no interaction between compression speed and the participants’ WMC.

Download Full-text

Can Dual Compression Offer Better Mandarin Speech Intelligibility and Sound Quality Than Fast-Acting Compression?

Trends in Hearing ◽

10.1177/2331216521997610 ◽

2021 ◽

Vol 25 ◽

pp. 233121652199761

Author(s):

Yuan Chen ◽

Lena L. N. Wong ◽

Volker Kuehnel ◽

Jinyu Qian ◽

Solveig Christina Voss ◽

...

Keyword(s):

Hearing Aids ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Hearing Aid ◽

Sound Quality ◽

Listening Effort ◽

Theoretical Understanding ◽

Quality Ratings ◽

Speech Reception Thresholds ◽

Fast Acting

The aim of this study was to evaluate the efficacy of dual compression for Mandarin-speaking hearing aid users. Dual compression combines fast and slow compressors operating simultaneously across all frequency channels. The study participants were 31 hearing aid users with symmetrical moderate-to-severe hearing loss, with a mean age of 67 years. A new pair of 20-channel behind-the-ear hearing aids (i.e., Phonak Bolero B90-P) was used during the testing. The results revealed a significant improvement in speech reception thresholds in noise when switching from fast-acting compression to dual compression. The sound quality ratings revealed that most listeners preferred dual compression to fast-acting compression for listening effort, listening comfort, speech clarity, and overall sound quality at +4 dB signal-to-noise ratio. These results are consistent with predictions based on the theoretical understanding of dual and fast-acting compression. However, whether these results can be generalized to other languages or other dual compression systems should be verified by future studies.

Download Full-text

Measuring the Influence of Noise Reduction on Listening Effort in Hearing-Impaired Listeners Using Response Times to an Arithmetic Task in Noise

Trends in Hearing ◽

10.1177/23312165211014437 ◽

2021 ◽

Vol 25 ◽

pp. 233121652110144

Author(s):

Ilja Reinten ◽

Inge De Ronde-Brons ◽

Rolph Houben ◽

Wouter Dreschler

Keyword(s):

Noise Reduction ◽

Hearing Aids ◽

Speech Intelligibility ◽

Response Times ◽

Hearing Impaired ◽

Listening Effort ◽

Arithmetic Task ◽

Test Retest Reliability ◽

Influence Of Noise ◽

Measured Response

Single microphone noise reduction (NR) in hearing aids can provide a subjective benefit even when there is no objective improvement in speech intelligibility. A possible explanation lies in a reduction of listening effort. Previously, we showed that response times (a proxy for listening effort) to an auditory-only dual-task were reduced by NR in normal-hearing (NH) listeners. In this study, we investigate if the results from NH listeners extend to the hearing-impaired (HI), the target group for hearing aids. In addition, we assess the relevance of the outcome measure for studying and understanding listening effort. Twelve HI subjects were asked to sum two digits of a digit triplet in noise. We measured response times to this task, as well as subjective listening effort and speech intelligibility. Stimuli were presented at three signal-to-noise ratios (SNR; –5, 0, +5 dB) and in quiet. Stimuli were processed with ideal or nonideal NR, or unprocessed. The effect of NR on response times in HI listeners was significant only in conditions where speech intelligibility was also affected (–5 dB SNR). This is in contrast to the previous results with NH listeners. There was a significant effect of SNR on response times for HI listeners. The response time measure was reasonably correlated ( R142 = 0.54) to subjective listening effort and showed a sufficient test–retest reliability. This study thus presents an objective, valid, and reliable measure for evaluating an aspect of listening effort of HI listeners.

Download Full-text

Review of Recent Research on the Selection of Frequency-Gain Characteristics for Hearing Aids

Annals of Otology Rhinology & Laryngology ◽

10.1177/00034894800890s522 ◽

1980 ◽

Vol 89 (5_suppl) ◽

pp. 79-83

Author(s):

Richard Lippmann

Keyword(s):

Hearing Aids ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Hearing Aid ◽

Linear Amplification ◽

Percentage Points ◽

Gain Characteristic ◽

The Relationship ◽

Fitting Hearing Aids ◽

Selection Of

Following the Harvard master hearing aid study in 1947 there was little research on linear amplification. Recently, however, there have been a number of studies designed to determine the relationship between the frequency-gain characteristic of a hearing aid and speech intelligibility for persons with sensorineural hearing loss. These studies have demonstrated that a frequency-gain characteristic that rises at a rate of 6 dB/octave, as suggested by the Harvard study, is not optimal. They have also demonstrated that high-frequency emphasis of 10–40 dB above 500–1000 Hz is beneficial. Most importantly, they have demonstrated that hearing aids as they are presently being fit do not provide maximum speech intelligibility. Percent word correct scores obtained with the best frequency-gain characteristics tested in various studies have been found to be 9 to 19 percentage points higher than scores obtained with commercial aids owned by subjects. This increase in scores is equivalent to an increase in signal-to-noise ratio of 10 to 20 dB. This is a significant increase which could allow impaired listeners to communicate in many situations where they presently cannot. These results demonstrate the need for further research on linear amplification aimed at developing practical suggestions for fitting hearing aids.

Download Full-text

Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort

Trends in Hearing ◽

10.1177/2331216519854597 ◽

2019 ◽

Vol 23 ◽

pp. 233121651985459 ◽

Cited By ~ 8

Author(s):

Jan Rennies ◽

Virginia Best ◽

Elin Roverud ◽

Gerald Kidd

Keyword(s):

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Spatial Separation ◽

Signal To Noise ◽

Listening Effort ◽

Complex Sound ◽

Time Frequency ◽

Sound Fields ◽

Energetic Masking

Speech perception in complex sound fields can greatly benefit from different unmasking cues to segregate the target from interfering voices. This study investigated the role of three unmasking cues (spatial separation, gender differences, and masker time reversal) on speech intelligibility and perceived listening effort in normal-hearing listeners. Speech intelligibility and categorically scaled listening effort were measured for a female target talker masked by two competing talkers with no unmasking cues or one to three unmasking cues. In addition to natural stimuli, all measurements were also conducted with glimpsed speech—which was created by removing the time–frequency tiles of the speech mixture in which the maskers dominated the mixture—to estimate the relative amounts of informational and energetic masking as well as the effort associated with source segregation. The results showed that all unmasking cues as well as glimpsing improved intelligibility and reduced listening effort and that providing more than one cue was beneficial in overcoming informational masking. The reduction in listening effort due to glimpsing corresponded to increases in signal-to-noise ratio of 8 to 18 dB, indicating that a significant amount of listening effort was devoted to segregating the target from the maskers. Furthermore, the benefit in listening effort for all unmasking cues extended well into the range of positive signal-to-noise ratios at which speech intelligibility was at ceiling, suggesting that listening effort is a useful tool for evaluating speech-on-speech masking conditions at typical conversational levels.

Download Full-text

Effects of a Transient Noise Reduction Algorithm on Speech Understanding, Subjective Preference, and Preferred Gain

Journal of the American Academy of Audiology ◽

10.3766/jaaa.24.9.8 ◽

2013 ◽

Vol 24 (09) ◽

pp. 845-858 ◽

Cited By ~ 3

Author(s):

Petri Korhonen ◽

Francis Kuk ◽

Chi Lau ◽

Denise Keenan ◽

Jennifer Schumacher ◽

...

Keyword(s):

Noise Reduction ◽

Hearing Aids ◽

Repeated Measures ◽

Speech Intelligibility ◽

Repeated Measures Design ◽

Subjective Preference ◽

Listening Environments ◽

Speech Identification ◽

Noise Reduction Algorithm ◽

Phoneme Identification

Background: Today's compression hearing aids with noise reduction systems may not manage transient noises effectively because of the short duration of these sounds compared to the onset times of the compressors and/or noise reduction algorithms. Purpose: The current study was designed to evaluate the effect of a transient noise reduction (TNR) algorithm on listening comfort, speech intelligibility in quiet, and preferred wearer gain in the presence of transients. Research Design: A single-blinded, repeated-measures design was used. Study Sample: Thirteen experienced hearing aid users with bilaterally symmetrical (≤7.5 dB) sensorineural hearing loss participated in the study. Results: Speech identification in quiet (no transient noise) was identical between the TNR On and the TNR Off conditions. The participants showed subjective preference for the TNR algorithm when “comfortable listening” was used as the criterion. Participants preferred less gain than the default prescription in the presence of transient noise sounds. However, the preferred gain was 2.9 dB higher when the TNR was activated than when it was deactivated. This translated to 12.1% improvement in phoneme identification over the TNR Off condition for soft speech. Conclusions: This study demonstrated that the use of the TNR algorithm would not negatively affect speech identification. The results also suggested that this algorithm may improve listening comfort in the presence of transient noise sounds and ensure consistent use of prescribed gain. Such an algorithm may ensure more consistent audibility across listening environments.

Download Full-text

Enhancement of Speech Intelligibility using Binary Mask Based on Noise Constraints

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c5260.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 3509-3516

Keyword(s):

Hearing Aids ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Random Noise ◽

Wiener Filter ◽

Binary Mask ◽

Gain Function ◽

Magnitude Spectrum ◽

Objective Tests ◽

Babble Noise

The primary aim of this paper is to examine the application of binary mask to improve intelligibility in most unfavorable conditions where hearing impaired/normal listeners find it difficult to understand what is being told. Most of the existing noise reduction algorithms are known to improve the speech quality but they hardly improve speech intelligibility. The paper proposed by Gibak Kim and Philipos C. Loizou uses the Weiner gain function for improving speech intelligibility. Here, in this paper we have proposed to apply the same approach in magnitude spectrum using the parametric wiener filter in order to study its effects on overall speech intelligibility. Subjective and objective tests were conducted to evaluate the performance of the enhanced speech for various types of noises. The results clearly indicate that there is an improvement in average segmental signal-to-noise ratio for the speech corrupted at -5dB, 0dB, 5dB and 10dB SNR values for random noise, babble noise, car noise and helicopter noise. This technique can be used in real time applications, such as mobile, hearing aids and speech–activated machines

Download Full-text

Interactions Between Digital Noise Reduction and Reverberation: Acoustic and Behavioral Effects

Journal of the American Academy of Audiology ◽

10.3766/jaaa.18048 ◽

2020 ◽

Vol 31 (01) ◽

pp. 017-029

Author(s):

Paul Reinhart ◽

Pavel Zahorik ◽

Pamela Souza

Keyword(s):

Noise Reduction ◽

Hearing Impairment ◽

Hearing Aids ◽

Background Noise ◽

Speech Intelligibility ◽

Spectral Subtraction ◽

Listening Effort ◽

Speech In Noise ◽

Reverberant Environments ◽

Speech Naturalness

AbstractDigital noise reduction (DNR) processing is used in hearing aids to enhance perception in noise by classifying and suppressing the noise acoustics. However, the efficacy of DNR processing is not known under reverberant conditions where the speech-in-noise acoustics are further degraded by reverberation.The purpose of this study was to investigate acoustic and perceptual effects of DNR processing across a range of reverberant conditions for individuals with hearing impairment.This study used an experimental design to investigate the effects of varying reverberation on speech-in-noise processed with DNR.Twenty-six listeners with mild-to-moderate sensorineural hearing impairment participated in the study.Speech stimuli were combined with unmodulated broadband noise at several signal-to-noise ratios (SNRs). A range of reverberant conditions with realistic parameters were simulated, as well as an anechoic control condition without reverberation. Reverberant speech-in-noise signals were processed using a spectral subtraction DNR simulation. Signals were acoustically analyzed using a phase inversion technique to quantify improvement in SNR as a result of DNR processing. Sentence intelligibility and subjective ratings of listening effort, speech naturalness, and background noise comfort were examined with and without DNR processing across the conditions.Improvement in SNR was greatest in the anechoic control condition and decreased as the ratio of direct to reverberant energy decreased. There was no significant effect of DNR processing on speech intelligibility in the anechoic control condition, but there was a significant decrease in speech intelligibility with DNR processing in all of the reverberant conditions. Subjectively, listeners reported greater listening effort and lower speech naturalness with DNR processing in some of the reverberant conditions. Listeners reported higher background noise comfort with DNR processing only in the anechoic control condition.Results suggest that reverberation affects DNR processing using a spectral subtraction algorithm in such a way that decreases the ability of DNR to reduce noise without distorting the speech acoustics. Overall, DNR processing may be most beneficial in environments with little reverberation and that the use of DNR processing in highly reverberant environments may actually produce adverse perceptual effects. Further research is warranted using commercial hearing aids in realistic reverberant environments.

Download Full-text

A Comparison of Personal Sound Amplification Products and Hearing Aids in Ecologically Relevant Test Environments

American Journal of Audiology ◽

10.1044/2018_aja-18-0027 ◽

2018 ◽

Vol 27 (4) ◽

pp. 581-593 ◽

Cited By ~ 11

Author(s):

Lisa Brody ◽

Yu-Hsiang Wu ◽

Elizabeth Stangl

Keyword(s):

Speech Recognition ◽

Hearing Aids ◽

Speech Intelligibility ◽

Best Practice ◽

Hearing Aid ◽

Recognition Performance ◽

Sound Quality ◽

Listening Effort ◽

Speech Intelligibility Index ◽

Sound Amplification

Purpose The aim of this study was to compare the benefit of self-adjusted personal sound amplification products (PSAPs) to audiologist-fitted hearing aids based on speech recognition, listening effort, and sound quality in ecologically relevant test conditions to estimate real-world effectiveness. Method Twenty-five older adults with bilateral mild-to-moderate hearing loss completed the single-blinded, crossover study. Participants underwent aided testing using 3 PSAPs and a traditional hearing aid, as well as unaided testing. PSAPs were adjusted based on participant preference, whereas the hearing aid was configured using best-practice verification protocols. Audibility provided by the devices was quantified using the Speech Intelligibility Index (American National Standards Institute, 2012). Outcome measures assessing speech recognition, listening effort, and sound quality were administered in ecologically relevant laboratory conditions designed to represent real-world speech listening situations. Results All devices significantly improved Speech Intelligibility Index compared to unaided listening, with the hearing aid providing more audibility than all PSAPs. Results further revealed that, in general, the hearing aid improved speech recognition performance and reduced listening effort significantly more than all PSAPs. Few differences in sound quality were observed between devices. All PSAPs improved speech recognition and listening effort compared to unaided testing. Conclusions Hearing aids fitted using best-practice verification protocols were capable of providing more aided audibility, better speech recognition performance, and lower listening effort compared to the PSAPs tested in the current study. Differences in sound quality between the devices were minimal. However, because all PSAPs tested in the study significantly improved participants' speech recognition performance and reduced listening effort compared to unaided listening, PSAPs could serve as a budget-friendly option for those who cannot afford traditional amplification.

Download Full-text