Quantifying the Relation Between Speech Quality and Speech Intelligibility

1995 ◽  
Vol 38 (3) ◽  
pp. 714-725 ◽  
Author(s):  
Jill E. Preminger ◽  
Dianne J. Van Tasell

The purpose of the present research was to examine the relation between speech quality and speech intelligibility. Speech quality measurements were made using continuous discourse and a category rating procedure for the following dimensions: intelligibility, pleasantness, loudness, effort, and total impression. Measurements were made using a group of listeners with normal hearing for a set of stimulus conditions in which intelligibility varied, and for a set of stimulus conditions in which intelligibility was held constant near 100%. When ratings were made for a set of stimulus conditions in which intelligibility was allowed to vary (a) intersubject reliability was high (i.e., different listeners interpreted the dimensions in a similar manner); and (b) the speech quality dimensions of intelligibility, effort, and loudness were indistinguishable. When ratings were made for a set of stimulus conditions in which intelligibility was held constant (a) intersubject reliability was reduced, indicating that different listeners interpreted the dimensions in different ways; (b) most listeners rated each dimension differently, indicating that the dimensions were unique; and (c) across listeners, no single dimension was highly correlated with total impression. These results can be used in order to examine the relation between speech quality and speech intelligibility.

1995 ◽  
Vol 38 (3) ◽  
pp. 726-736 ◽  
Author(s):  
Jill E. Preminger ◽  
Dianne J. Van Tasell

The purpose of the present research was to develop a theoretical basis for the adjustment of hearing aid frequency response based on speech quality measurements. Speech quality measurements were made using continuous discourse and a category rating procedure for the following dimensions: intelligibility, pleasantness, loudness, effort, noisiness, and total impression. Speech quality ratings were obtained from a group of listeners with hearing loss who wore hearing aids. The stimulus conditions simulated hearing aid frequency response alterations within a frequency response range where intelligibility was held constant at or near 100%. The subject ratings revealed that (a) different listeners interpreted the individual dimensions in different ways; (b) within listeners, most of the dimensions were unique; that is, they were rated differently; and (c) across listeners, pleasantness was the dimension most highly correlated with total impression.


1994 ◽  
Vol 110 (1) ◽  
pp. 75-83 ◽  
Author(s):  
C SPEAKS ◽  
T TRINE ◽  
T CRAIN ◽  
N NICCUM

Author(s):  
Seong Hee Lee ◽  
Hyun Joon Shim ◽  
Sang Won Yoon ◽  
Kyoung Won Lee

2010 ◽  
Author(s):  
Marcel Wältermann ◽  
Alexander Raake ◽  
Sebastian Möller

2010 ◽  
Vol 10 ◽  
pp. 329-339 ◽  
Author(s):  
Torsten Rahne ◽  
Michael Ziese ◽  
Dorothea Rostalski ◽  
Roland Mühler

This paper describes a logatome discrimination test for the assessment of speech perception in cochlear implant users (CI users), based on a multilingual speech database, the Oldenburg Logatome Corpus, which was originally recorded for the comparison of human and automated speech recognition. The logatome discrimination task is based on the presentation of 100 logatome pairs (i.e., nonsense syllables) with balanced representations of alternating “vowel-replacement” and “consonant-replacement” paradigms in order to assess phoneme confusions. Thirteen adult normal hearing listeners and eight adult CI users, including both good and poor performers, were included in the study and completed the test after their speech intelligibility abilities were evaluated with an established sentence test in noise. Furthermore, the discrimination abilities were measured electrophysiologically by recording the mismatch negativity (MMN) as a component of auditory event-related potentials. The results show a clear MMN response only for normal hearing listeners and CI users with good performance, correlating with their logatome discrimination abilities. Higher discrimination scores for vowel-replacement paradigms than for the consonant-replacement paradigms were found. We conclude that the logatome discrimination test is well suited to monitor the speech perception skills of CI users. Due to the large number of available spoken logatome items, the Oldenburg Logatome Corpus appears to provide a useful and powerful basis for further development of speech perception tests for CI users.


Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1878
Author(s):  
Yi Zhou ◽  
Haiping Wang ◽  
Yijing Chu ◽  
Hongqing Liu

The use of multiple spatially distributed microphones allows performing spatial filtering along with conventional temporal filtering, which can better reject the interference signals, leading to an overall improvement of the speech quality. In this paper, we propose a novel dual-microphone generalized sidelobe canceller (GSC) algorithm assisted by a bone-conduction (BC) sensor for speech enhancement, which is named BC-assisted GSC (BCA-GSC) algorithm. The BC sensor is relatively insensitive to the ambient noise compared to the conventional air-conduction (AC) microphone. Hence, BC speech can be analyzed to generate very accurate voice activity detection (VAD), even in a high noise environment. The proposed algorithm incorporates the VAD information obtained by the BC speech into the adaptive blocking matrix (ABM) and adaptive noise canceller (ANC) in GSC. By using VAD to control ABM and combining VAD with signal-to-interference ratio (SIR) to control ANC, the proposed method could suppress interferences and improve the overall performance of GSC significantly. It is verified by experiments that the proposed GSC system not only improves speech quality remarkably but also boosts speech intelligibility.


2002 ◽  
Vol 13 (05) ◽  
pp. 236-245 ◽  
Author(s):  
Gary Rance ◽  
Field Rickards

This retrospective study examines the relationship between auditory steady-state evoked potential (ASSEP) thresholds determined in infancy and subsequently obtained behavioral hearing levels in children with normal hearing or varying degrees of sensorineural hearing loss. Overall, the results from 211 subjects showed that the two test techniques were highly correlated, with Pearson r values exceeding .95 at each of the audiometric test frequencies between 500 and 4000 Hz. Analysis of the findings for babies with significant hearing loss (moderate to profound levels) showed similar threshold relationships to those obtained in previous studies involving adults and older children. The results for infants with normal or near-normal hearing did, however, differ from those reported for older subjects, with behavioral thresholds typically 10 to 15 dB better than would have been predicted from their ASSEP levels.


1976 ◽  
Vol 19 (2) ◽  
pp. 279-289 ◽  
Author(s):  
Randall B. Monsen

Although it is well known that the speech produced by the deaf is generally of low intelligibility, the sources of this low speech intelligibility have generally been ascribed either to aberrant articulation of phonemes or inappropriate prosody. This study was designed to determine to what extent a nonsegmental aspect of speech, formant transitions, may differ in the speech of the deaf and of the normal hearing. The initial second formant transitions of the vowels /i/ and /u/ after labial and alveolar consonants (/b, d, f/) were compared in the speech of six normal-hearing and six hearing-impaired adolescents. In the speech of the hearing-impaired subjects, the second formant transitions may be reduced both in time and in frequency. At its onset, the second formant may be nearer to its eventual target frequency than in the speech of the normal subjects. Since formant transitions are important acoustic cues for the adjacent consonants, reduced F 2 transitions may be an important factor in the low intelligibility of the speech of the deaf.


Sign in / Sign up

Export Citation Format

Share Document