A Danish Sentence Corpus for Assessing Speech Recognition in Noise in School-Age Children

For the audiological assessment of the speech-in-noise abilities of children with normal or impaired hearing, appropriate test materials are required. However, in Denmark, no standardized materials exist. The purpose of this study was to develop a Danish sentence corpus suitable for testing school-age children. Based on the 600 validated test sentences from the Danish DAT ( Dagmar, Asta, or Tine) corpus, 11 test lists comprising 20 sentences each were carefully constructed. These lists were evaluated in terms of their perceptual similarity and reliability with a group of 20 typically developing, normal-hearing children aged 6 to 12 years. Using stationary speech-shaped noise and diotic stimulus presentation, speech recognition thresholds (SRTs) were measured twice per list and participant at two separate visits. The analyses showed that six test lists were perceptually equivalent. These lists are characterized by a grand average SRT of −2.6 dB signal-to-noise ratio, a test–retest improvement of 0.6 dB, and a within-subject standard deviation of 1.1 dB signal-to-noise ratio. The other lists were characterized by slightly higher SRTs, slightly larger training effects, and slightly larger measurement uncertainty, but were otherwise also usable. Overall, it is therefore concluded that the developed corpus is suited for assessing speech recognition in noise in Danish 6- to 12-year olds. The corpus is publicly available.

Download Full-text

Behavioral Validation of the Smartphone for Remote Microphone Technology

Seminars in Hearing ◽

10.1055/s-0040-1718714 ◽

2020 ◽

Vol 41 (04) ◽

pp. 291-301

Author(s):

Stephanie Tittle ◽

Linda M. Thibodeau ◽

Issa Panahi ◽

Serkan Tokgoz ◽

Nikhil Shankar ◽

...

Keyword(s):

Speech Recognition ◽

Open Source ◽

Hearing Aids ◽

Signal To Noise Ratio ◽

Communication Disorders ◽

National Institutes Of Health ◽

Impaired Hearing ◽

Signal To Noise ◽

Behavioral Validation ◽

Speech Recognition In Noise

AbstractAs part of a National Institutes of Health–National Institute on Deafness and Other communication Disorders (NIH-NIDCD)–supported project to develop open-source research and smartphone-based apps for enhancing speech recognition in noise, an app called Smartphone Hearing Aid Research Project Version 2 (SHARP-2) was tested with persons with normal and impaired hearing when using three sets of hearing aids (HAs) with wireless connectivity to an iPhone. Participants were asked to type sentences presented from a speaker in front of them while hearing noise from behind in two conditions, HA alone and HA + SHARP-2 app running on the iPhone. The signal was presented at a constant level of 65 dBA and the signal-to-noise ratio varied from −10 to +10, so that the task was difficult when listening through the bilateral HAs alone. This was important to allow for improvement to be measured when the HAs were connected to the SHARP-2 app on the smartphone. Benefit was achieved for most listeners with all three manufacturer HAs with the greatest improvements recorded for persons with normal (33.56%) and impaired hearing (22.21%) when using the SHARP-2 app with one manufacturer's made-for-all phones HAs. These results support the continued development of smartphone-based apps as an economical solution for enhancing speech recognition in noise for both persons with normal and impaired hearing.

Download Full-text

The Use of Speech Recognition Systems to Select a Useful Signal in Noisy Speech at a Low Signal-To-Noise Ratio

10.1109/dynamics52735.2021.9653711 ◽

2021 ◽

Author(s):

Sh. R. Salimov ◽

N. A. Volkov ◽

A. V. Ivanov

Keyword(s):

Speech Recognition ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Noisy Speech ◽

Useful Signal ◽

Recognition Systems ◽

Noise Ratio

Download Full-text

Research on Speaker Recognition of DRNN in Different Noise Environment

10.21203/rs.3.rs-124941/v1 ◽

2020 ◽

Author(s):

chaofeng lan ◽

yuanyuan Zhang ◽

hongyun Zhao

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recurrent Neural Network ◽

Speaker Recognition ◽

Signal To Noise Ratio ◽

Recognition Rate ◽

Noisy Environment ◽

Signal To Noise ◽

Noise Ratio ◽

Improved Model

Abstract This paper draws on the training method of Recurrent Neural Network (RNN), By increasing the number of hidden layers of RNN and changing the layer activation function from traditional Sigmoid to Leaky ReLU on the input layer, the first group and the last set of data are zero-padded to enhance the effective utilization of data such that the improved reduction model of Denoise Recurrent Neural Network (DRNN) with high calculation speed and good convergence is constructed to solve the problem of low speaker recognition rate in noisy environment. According to this model, the random semantic speech signal with a sampling rate of 16 kHz and a duration of 5 seconds in the speech library is studied. The experimental settings of the signal-to-noise ratios are − 10dB, -5dB, 0dB, 5dB, 10dB, 15dB, 20dB, 25dB. In the noisy environment, the improved model is used to denoise the Mel Frequency Cepstral Coefficients (MFCC) and the Gammatone Frequency Cepstral Coefficents (GFCC), impact of the traditional model and the improved model on the speech recognition rate is analyzed. The research shows that the improved model can effectively eliminate the noise of the feature parameters and improve the speech recognition rate. When the signal-to-noise ratio is low, the speaker recognition rate can be more obvious. Furthermore, when the signal-to-noise ratio is 0dB, the speaker recognition rate of people is increased by 40%, which can be 85% improved compared with the traditional speech model. On the other hand, with the increase in the signal-to-noise ratio, the recognition rate is gradually increased. When the signal-to-noise ratio is 15dB, the recognition rate of speakers is 93%.

Download Full-text

Effects of Adaptive Hearing Aid Directionality and Noise Reduction on Masked Speech Recognition for Children Who Are Hard of Hearing

American Journal of Audiology ◽

10.1044/2018_aja-18-0045 ◽

2019 ◽

Vol 28 (1) ◽

pp. 101-113 ◽

Cited By ~ 3

Author(s):

Jenna M. Browning ◽

Emily Buss ◽

Mary Flaherty ◽

Tim Vallier ◽

Lori J. Leibold

Keyword(s):

Speech Recognition ◽

Hearing Aids ◽

Hard Of Hearing ◽

Signal To Noise Ratio ◽

Hearing Aid ◽

Normal Hearing ◽

Signal To Noise ◽

Open Set ◽

Fully Adaptive ◽

Noise Ratio

Purpose The purpose of this study was to evaluate speech-in-noise and speech-in-speech recognition associated with activation of a fully adaptive directional hearing aid algorithm in children with mild to severe bilateral sensory/neural hearing loss. Method Fourteen children (5–14 years old) who are hard of hearing participated in this study. Participants wore laboratory hearing aids. Open-set word recognition thresholds were measured adaptively for 2 hearing aid settings: (a) omnidirectional (OMNI) and (b) fully adaptive directionality. Each hearing aid setting was evaluated in 3 listening conditions. Fourteen children with normal hearing served as age-matched controls. Results Children who are hard of hearing required a more advantageous signal-to-noise ratio than children with normal hearing to achieve comparable performance in all 3 conditions. For children who are hard of hearing, the average improvement in signal-to-noise ratio when comparing fully adaptive directionality to OMNI was 4.0 dB in noise, regardless of target location. Children performed similarly with fully adaptive directionality and OMNI settings in the presence of the speech maskers. Conclusions Compared to OMNI, fully adaptive directionality improved speech recognition in steady noise for children who are hard of hearing, even when they were not facing the target source. This algorithm did not affect speech recognition when the background noise was speech. Although the use of hearing aids with fully adaptive directionality is not proposed as a substitute for remote microphone systems, it appears to offer several advantages over fixed directionality, because it does not depend on children facing the target talker and provides access to multiple talkers within the environment. Additional experiments are required to further evaluate children's performance under a variety of spatial configurations in the presence of both noise and speech maskers.

Download Full-text

Fitting and Verification of Frequency Modulation Systems on Children with Normal Hearing

Journal of the American Academy of Audiology ◽

10.3766/jaaa.25.6.3 ◽

2014 ◽

Vol 25 (06) ◽

pp. 529-540 ◽

Cited By ~ 2

Author(s):

Erin C. Schafer ◽

Danielle Bryant ◽

Katie Sanders ◽

Nicole Baldus ◽

Katherine Algier ◽

...

Keyword(s):

Speech Recognition ◽

Frequency Modulation ◽

Auditory Processing ◽

Signal To Noise Ratio ◽

Recognition Performance ◽

Normal Hearing ◽

Signal To Noise ◽

The Real ◽

Hearing Sensitivity ◽

Speech Recognition In Noise

Background: Several recent investigations support the use of frequency modulation (FM) systems in children with normal hearing and auditory processing or listening disorders such as those diagnosed with auditory processing disorders, autism spectrum disorders, attention-deficit hyperactivity disorder, Friedreich ataxia, and dyslexia. The American Academy of Audiology (AAA) published suggested procedures, but these guidelines do not cite research evidence to support the validity of the recommended procedures for fitting and verifying nonoccluding open-ear FM systems on children with normal hearing. Documenting the validity of these fitting procedures is critical to maximize the potential FM-system benefit in the abovementioned populations of children with normal hearing and those with auditory-listening problems. Purpose: The primary goal of this investigation was to determine the validity of the AAA real-ear approach to fitting FM systems on children with normal hearing. The secondary goal of this study was to examine speech-recognition performance in noise and loudness ratings without and with FM systems in children with normal hearing sensitivity. Research Design: A two-group, cross-sectional design was used in the present study. Study Sample: Twenty-six typically functioning children, ages 5–12 yr, with normal hearing sensitivity participated in the study. Intervention: Participants used a nonoccluding open-ear FM receiver during laboratory-based testing. Data Collection and Analysis: Participants completed three laboratory tests: (1) real-ear measures, (2) speech recognition performance in noise, and (3) loudness ratings. Four real-ear measures were conducted to (1) verify that measured output met prescribed-gain targets across the 1000–4000 Hz frequency range for speech stimuli, (2) confirm that the FM-receiver volume did not exceed predicted uncomfortable loudness levels, and (3 and 4) measure changes to the real-ear unaided response when placing the FM receiver in the child’s ear. After completion of the fitting, speech recognition in noise at a –5 signal-to-noise ratio and loudness ratings at a +5 signal-to-noise ratio were measured in four conditions: (1) no FM system, (2) FM receiver on the right ear, (3) FM receiver on the left ear, and (4) bilateral FM system. Results: The results of this study suggested that the slightly modified AAA real-ear measurement procedures resulted in a valid fitting of one FM system on children with normal hearing. On average, prescriptive targets were met for 1000, 2000, 3000, and 4000 Hz within 3 dB, and maximum output of the FM system never exceeded and was significantly lower than predicted uncomfortable loudness levels for the children. There was a minimal change in the real-ear unaided response when the open-ear FM receiver was placed into the ear. Use of the FM system on one or both ears resulted in significantly better speech recognition in noise relative to a no-FM condition, and the unilateral and bilateral FM receivers resulted in a comfortably loud signal when listening in background noise. Conclusions: Real-ear measures are critical for obtaining an appropriate fit of an FM system on children with normal hearing.

Download Full-text

Speech Recognition in Nonnative versus Native English-Speaking College Students in a Virtual Classroom

Journal of the American Academy of Audiology ◽

10.3766/jaaa.15125 ◽

2017 ◽

Vol 28 (05) ◽

pp. 404-414 ◽

Cited By ~ 4

Author(s):

Dorothy Neave-DiToro ◽

Adrienne Rubinstein ◽

Arlene C. Neuman

Keyword(s):

College Students ◽

Speech Recognition ◽

Signal To Noise Ratio ◽

Nonnative Speakers ◽

Mixed Design ◽

Signal To Noise ◽

Classroom Acoustics ◽

English Speaking ◽

Noise Ratio ◽

The Impact

Background: Limited attention has been given to the effects of classroom acoustics at the college level. Many studies have reported that nonnative speakers of English are more likely to be affected by poor room acoustics than native speakers. An important question is how classroom acoustics affect speech perception of nonnative college students. Purpose: The combined effect of noise and reverberation on the speech recognition performance of college students who differ in age of English acquisition was evaluated under conditions simulating classrooms with reverberation times (RTs) close to ANSI recommended RTs. Research Design: A mixed design was used in this study. Study Sample: Thirty-six native and nonnative English-speaking college students with normal hearing, ages 18–28 yr, participated. Intervention: Two groups of nine native participants (native monolingual [NM] and native bilingual) and two groups of nine nonnative participants (nonnative early and nonnative late) were evaluated in noise under three reverberant conditions (0.03, 0.06, and 0.08 sec). Data Collection and Analysis: A virtual test paradigm was used, which represented a signal reaching a student at the back of a classroom. Speech recognition in noise was measured using the Bamford–Kowal–Bench Speech-in-Noise (BKB-SIN) test and signal-to-noise ratio required for correct repetition of 50% of the key words in the stimulus sentences (SNR-50) was obtained for each group in each reverberant condition. A mixed-design analysis of variance was used to determine statistical significance as a function of listener group and RT. Results: SNR-50 was significantly higher for nonnative listeners as compared to native listeners, and a more favorable SNR-50 was needed as RT increased. The most dramatic effect on SNR-50 was found in the group with later acquisition of English, whereas the impact of early introduction of a second language was subtler. At the ANSI standard’s maximum recommended RT (0.6 sec), all groups except the NM group exhibited a mild signal-to-noise ratio (SNR) loss. At the 0.8 sec RT, all groups exhibited a mild SNR loss. Conclusion: Acoustics in the classroom are an important consideration for nonnative speakers who are proficient in English and enrolled in college. To address the need for a clearer speech signal by nonnative students (and for all students), universities should follow ANSI recommendations, as well as minimize background noise in occupied classrooms. Behavioral/instructional strategies should be considered to address factors that cannot be compensated for through acoustic design.

Download Full-text

The effects of compression ratio, signal-to-noise ratio, and level on speech recognition in normal-hearing listeners

The Journal of the Acoustical Society of America ◽

10.1121/1.1369105 ◽

2001 ◽

Vol 109 (6) ◽

pp. 2964-2973 ◽

Cited By ~ 25

Author(s):

Benjamin W. Y. Hornsby ◽

Todd A. Ricketts

Keyword(s):

Speech Recognition ◽

Compression Ratio ◽

Signal To Noise Ratio ◽

Normal Hearing ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Forward and Backward Masking of Consonants in School-Age Children and Adults

Journal of Speech Language and Hearing Research ◽

10.1044/2018_jslhr-h-17-0403 ◽

2018 ◽

Vol 61 (7) ◽

pp. 1807-1814 ◽

Cited By ~ 1

Author(s):

Heather L. Porter ◽

Emily R. Spitzer ◽

Emily Buss ◽

Lori J. Leibold ◽

John H. Grose

Keyword(s):

Word Recognition ◽

Signal To Noise Ratio ◽

Recognition Threshold ◽

School Age Children ◽

Backward Masking ◽

Signal To Noise ◽

Percent Correct ◽

Speech Stimuli ◽

Noise Ratio ◽

Psychophysical Studies

Purpose This experiment sought to determine whether children's increased susceptibility to nonsimultaneous masking, particularly backward masking, is evident for speech stimuli. Method Five- to 9-year-olds and adults with normal hearing heard nonsense consonant–vowel–consonant targets. In Experiments 1 and 2, those targets were presented between two 250-ms segments of 70-dB-SPL speech-shaped noise, at either −30 dB signal-to-noise ratio (Experiment 1) or at the listener's word recognition threshold (Experiment 2). In Experiment 3, the target was presented in steady speech-shaped noise at listener threshold. For all experiments, percent correct was estimated for initial and final consonants. Results In the nonsimultaneous noise conditions, child–adult differences were larger for the final consonant than the initial consonant whether listeners were tested at −30 dB signal-to-noise ratio (Experiment 1) or at their individual word recognition threshold (Experiment 2). Children were not particularly susceptible to backward masking relative to adults when tested in a steady masker (Experiment 3). Conclusions Child–adult differences were greater for backward than forward masking for speech in a nonsimultaneous noise masker, as observed in previous psychophysical studies using tonal stimuli. Children's greater susceptibility to nonsimultaneous masking, and backward masking in particular, could play a role in their limited ability to benefit from masker envelope modulation when recognizing masked speech.

Download Full-text

Adaptation of the BKB-SIN Test for Use as a Pediatric Aided Outcome Measure

Journal of the American Academy of Audiology ◽

10.3766/jaaa.22.6.6 ◽

2011 ◽

Vol 22 (06) ◽

pp. 375-386 ◽

Cited By ~ 8

Author(s):

Stella L. Ng ◽

Christine N. Meston ◽

Susan D. Scollie ◽

Richard C. Seewald

Keyword(s):

Speech Recognition ◽

Hearing Aids ◽

Outcome Measure ◽

Signal To Noise Ratio ◽

Outcome Measurement ◽

Normal Hearing ◽

Signal To Noise ◽

Directional Microphone ◽

Adults And Children ◽

Noise Ratio

Background: There is a need for objective pediatric hearing aid outcome measurement and thus a need for the evaluation of outcome measures. We explored a commercially available pediatric sentence-in-noise measure adapted for use as an aided outcome measure. Purpose: The purposes of the current study were (1) to administer an adapted BKB-SIN (Bamford-Kowal-Bench Speech-in-Noise test) to adults and children who have normal hearing and children who use hearing aids and (2) to evaluate the utility of this adapted BKB-SIN as an aided, within-subjects outcome measure for amplification strategies. Research Design: We used a mixed within and between groups design to evaluate speech recognition in noise for the three groups of participants. The children who use hearing aids were tested under the omnidirectional, directional, and digital noise reduction (DNR) conditions. Results from each group were compared to each other, and we compared results of each aided condition for the children who use hearing aids to evaluate the test utility as an aided outcome measure. Study Sample: The study sample consisted of 14 adults with normal hearing (aged 22–28 yr) and 15 children with normal hearing (aged 6–18 yr), recruited through word of mouth, and 14 children who use hearing aids (aged 9–16 yr) recruited from local audiology clinics. Data Collection and Analysis: List pairs of the BKB-SIN test were presented at 50 dB HL as follows: four list pairs to each participant with normal hearing, four list pairs in the omnidirectional condition, and two list pairs in the directional and DNR conditions. Children who use hearing aids were fitted bilaterally with laboratory devices and completed the BKB-SIN test aided. Data were plotted as mean percent of key words correct at each signal-to-noise ratio (SNR). Further, we conducted an analysis of variance for group differences and within-groups for the three aided conditions. Results: Adult participants outperformed children with normal hearing, who outperformed the children who use hearing aids. SNR-50 (signal-to-noise ratio at which listener can obtain a speech recognition score of 50% correct) scores demonstrated reliability of the adapted test implementation. The BKB-SIN test measured significant differences in performance for omnidirectional versus directional microphone conditions but not between omnidirectional and DNR conditions. Conclusions: We conclude that the adapted implementation of the BKB-SIN test can be administered reliably and feasibly. Further study is warranted to develop norms for the adapted implementation as well as to determine if an adapted implementation can be sensitive to age effects. Until such norms are developed, clinicians should refrain from comparing results from the adapted test to the test manual norms and should instead use the adapted implementation as a within-subject measure.

Download Full-text

Effect of cochlear implant n-of-m strategy on signal-to-noise ratio below which noise hinders speech recognition

The Journal of the Acoustical Society of America ◽

10.1121/1.5107430 ◽

2019 ◽

Vol 145 (5) ◽

pp. EL417-EL422

Author(s):

Lucas Stam ◽

S. Theo Goverts ◽

Cas Smits

Keyword(s):

Speech Recognition ◽

Cochlear Implant ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Noise Ratio

Download Full-text