Perceptual Evaluation of Tracheoesophageal Speech by Naive and Experienced Judges Through the Use of Semantic Differential Scales

2003 ◽  
Vol 46 (4) ◽  
pp. 947-959 ◽  
Author(s):  
Corina J. van As ◽  
Florien J. Koopmans-van Beinum ◽  
Louis C. W. Pols ◽  
Frans J. M. Hilgers

The present study was conducted to investigate voice quality in tracheoesophageal speech by means of perceptual evaluations and to develop a clinically useful subset of perceptual scales sufficient for these perceptual evaluations. The perceptual ratings were obtained from both naive and trained raters (speechlanguage pathologists [SLPs]) after listening to a read-aloud text. The perceptual evaluations were performed by means of 19 semantic bipolar 7-point scales for the naive raters and 20 semantic bipolar 7-point scales for the trained raters. The trained raters were also asked to judge the overall voice quality as good, reasonable, or poor. Both naive listeners and trained SLPs were able to perform reliable perceptual judgments. Naive raters judged the tracheoesophageal voice as more deviant than the trained raters did. Naive raters made judgments based on 2 underlying perceptual dimensions (voice quality and pitch), whereas the trained raters made judgments based on 4 underlying perceptual dimensions (voice quality, tonicity, pitch, and tempo). These perceptual dimensions were further subdivided into a subset of 4 perceptual scales for the naive raters and a subset of 8 perceptual scales for the trained raters. This appeared to provide a sufficient coverage of the underlying perceptual dimensions used by the listeners.

2020 ◽  
Vol 63 (12) ◽  
pp. 3974-3981
Author(s):  
Ashwini Joshi ◽  
Isha Baheti ◽  
Vrushali Angadi

Aim The purpose of this study was to develop and assess the reliability of a Hindi version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Reliability was assessed by comparing Hindi CAPE-V ratings with English CAPE-V ratings and by the Grade, Roughness, Breathiness, Asthenia and Strain (GRBAS) scale. Method Hindi sentences were created to match the phonemic load of the corresponding English CAPE-V sentences. The Hindi sentences were adapted for linguistic content. The original English and adapted Hindi CAPE-V and GRBAS were completed for 33 bilingual individuals with normal voice quality. Additionally, the Hindi CAPE-V and GRBAS were completed for 13 Hindi speakers with disordered voice quality. The agreement of CAPE-V ratings was assessed between language versions, GRBAS ratings, and two rater pairs (three raters in total). Pearson product–moment correlation was completed for all comparisons. Results A strong correlation ( r > .8, p < .01) was found between the Hindi CAPE-V scores and the English CAPE-V scores for most variables in normal voice participants. A weak correlation was found for the variable of strain ( r < .2, p = .400) in the normative group. A strong correlation ( r > .6, p < .01) was found between the overall severity/grade, roughness, and breathiness scores in the GRBAS scale and the CAPE-V scale in normal and disordered voice samples. Significant interrater reliability ( r > .75) was present in overall severity and breathiness. Conclusions The Hindi version of the CAPE-V demonstrates good interrater reliability and concurrent validity with the English CAPE-V and the GRBAS. The Hindi CAPE-V can be used for the auditory-perceptual voice assessment of Hindi speakers.


2017 ◽  
Vol 23 (1) ◽  
pp. 1-20
Author(s):  
Kathy Connaughton ◽  
Irena Yanushevskaya

Objective: This study explores the immediate impact of prolonged voice use by professional sports coaches. Method: Speech samples including sustained phonation of vowel /a/ and a short read passage were collected from two professional sports coaches. The audio recordings were made within an hour before and after a coaching session, over three sessions. Perceptual evaluation of voice quality was done using the GRBAS scale. The speech samples were subsequently analyzed using Praat. The acoustic measures included fundamental frequency (f0), jitter, shimmer, Harmonics-to-Noise ratio and Cepstral Peak Prominence. Main results: The results of perceptual and acoustic analysis suggest a slight shift towards a tenser phonation post-coaching session, which is a likely consequence of laryngeal muscle adaptation to prolonged voice use. This tendency was similar in sustained vowels and connected speech. Conclusion: Acoustic measures used in this study can be useful to capture the voice change post-coaching session. It is desirable, however, that more sophisticated and robust and at the same time intuitive and easy-to-use tools for voice assessment and monitoring be made available to clinicians and professional voice users.


2015 ◽  
Vol 58 (3) ◽  
pp. 535-549 ◽  
Author(s):  
Mara R. Kapsner-Smith ◽  
Eric J. Hunter ◽  
Kimberly Kirkham ◽  
Karin Cox ◽  
Ingo R. Titze

PurposeAlthough there is a long history of use of semi-occluded vocal tract gestures in voice therapy, including phonation through thin tubes or straws, the efficacy of phonation through tubes has not been established. This study compares results from a therapy program on the basis of phonation through a flow-resistant tube (FRT) with Vocal Function Exercises (VFE), an established set of exercises that utilize oral semi-occlusions.MethodTwenty subjects (16 women, 4 men) with dysphonia and/or vocal fatigue were randomly assigned to 1 of 4 treatment conditions: (a) immediate FRT therapy, (b) immediate VFE therapy, (c) delayed FRT therapy, or (d) delayed VFE therapy. Subjects receiving delayed therapy served as a no-treatment control group.ResultsVoice Handicap Index (Jacobson et al., 1997) scores showed significant improvement for both treatment groups relative to the no-treatment group. Comparison of the effect sizes suggests FRT therapy is noninferior to VFE in terms of reduction in Voice Handicap Index scores. Significant reductions in Roughness on the Consensus Auditory-Perceptual Evaluation of Voice (Kempster, Gerratt, Verdolini Abbott, Barkmeier-Kraemer, & Hillman, 2009) were found for the FRT subjects, with no other significant voice quality findings.ConclusionsVFE and FRT therapy may improve voice quality of life in some individuals with dysphonia. FRT therapy was noninferior to VFE in improving voice quality of life in this study.


2003 ◽  
Vol 12 (4) ◽  
pp. 432-439 ◽  
Author(s):  
Pauline A. Mashima ◽  
Deborah P. Birkmire-Peters ◽  
Mark J. Syms ◽  
Michael R. Holtel ◽  
Lawrence P. A. Burgess ◽  
...  

Telehealth offers the potential to meet the needs of underserved populations in remote regions. The purpose of this study was a proof-of-concept to determine whether voice therapy can be delivered effectively remotely. Treatment outcomes were evaluated for a vocal rehabilitation protocol delivered under 2 conditions: with the patient and clinician interacting within the same room (conventional group) and with the patient and clinician in separate rooms, interacting in real time via a hard-wired video camera and monitor (video teleconference group). Seventy-two patients with voice disorders served as participants. Based on evaluation by otolaryngologists, 31 participants were diagnosed with vocal nodules, 29 were diagnosed with edema, 9 were diagnosed with unilateral vocal fold paralysis, and 3 presented with vocal hyperfunction with no laryngeal pathology. Fifty-one participants (71%) completed the vocal rehabilitation protocol. Outcome measures included perceptual judgments of voice quality, acoustic analyses of voice, patient satisfaction ratings, and fiber-optic laryngoscopy. There were no differences in outcome measures between the conventional group and the remote video teleconference group. Participants in both groups showed positive changes on all outcome measures after completing the vocal rehabilitation protocol. Reasons for participants discontinuing therapy prematurely provided support for the telehealth model of service delivery.


2005 ◽  
Vol 119 (10) ◽  
pp. 825-827 ◽  
Author(s):  
Satoshi Kitahara ◽  
Yukihiro Masuda ◽  
Yoko Kitagawa

Vocal fold scarring results in the formation of fibrous tissue which disturbs the vibratory pattern of the fold during phonation. However, vocal fold scarring in humans is poorly understood because of the lack of clear case reports focusing on voice quality. The authors present a case of vocal fold scarring with changes in voice quality. At the time of injury the pedicle mucosa was cemented with fibrin glue. Phonation was inhibited for two weeks and tranilast (300 mg/day) was given for 3 months. Sixty-nine days later, perceptual evaluation showed a normal result and the phonation time became better, but the mucosal vibration was still lacking. Ninety-seven days later, mucosal vibration was finally restored. We suggest that characterization of vocal fold scarring in humans may be different from that in animals, and recommend that surgical management should be avoided for at least three months after injury.


2018 ◽  
Vol 61 (2) ◽  
pp. 227-245 ◽  
Author(s):  
Catherine L. Lortie ◽  
Isabelle Deschamps ◽  
Matthieu J. Guitton ◽  
Pascale Tremblay

Purpose The factors that influence the evaluation of voice in adulthood, as well as the consequences of such evaluation on social interactions, are not well understood. Here, we examined the effect of listeners' age and the effect of talker age, sex, and smoking status on the auditory-perceptual evaluation of voice, voice-related psychosocial attributions, and perceived speech tempo. We also examined the voice dimensions affecting the propensity to engage in social interactions. Method Twenty-five younger (age 19–37 years) and 25 older (age 51–74 years) healthy adults participated in this cross-sectional study. Their task was to evaluate the voice of 80 talkers. Results Statistical analyses revealed limited effects of the age of the listener on voice evaluation. Specifically, older listeners provided relatively more favorable voice ratings than younger listeners, mainly in terms of roughness. In contrast, the age of the talker had a broader impact on voice evaluation, affecting auditory-perceptual evaluations, psychosocial attributions, and perceived speech tempo. Some of these talker differences were dependent upon the sex of the talker and his or her smoking status. Finally, the results also show that voice-related psychosocial attribution was more strongly associated with the propensity of the listener to engage in social interactions with a person than auditory-perceptual dimensions and perceived speech tempo, especially for the younger adults. Conclusions These results suggest that age has a broad influence on voice evaluation, with a stronger impact for talker age compared with listener age. While voice-related psychosocial attributions may be an important determinant of social interactions, perceived voice quality and speech tempo appear to be less influential. Supplemental Materials https://doi.org/10.23641/asha.5844102


Revista CEFAC ◽  
2021 ◽  
Vol 23 (6) ◽  
Author(s):  
Tamar Vieira de Jesus ◽  
Aline Neves Pessoa Almeida ◽  
Zuleica Camargo

ABSTRACT Purpose: to relate ultrasound images with auditory-perceptual data on vocal quality settings in adult speakers of Brazilian Portuguese. Methods: the corpus consisted of speech samples (sentences contained in the instructional material of the Vocal Profile Analysis Scheme - VPAS-PB) from seven adult subjects of both genders, recorded simultaneously by acoustic and ultrasonographic means. Data analysis was based on auditory-perceptual judgments of vocal quality and ultrasound images generated by the AAA software. Results: vocal quality settings related to the position of the tongue body and the extension of the tongue and jaw found correspondences to the contours of ultrasound images of the tongue in selected key segments (oral vowels), especially those with greater degrees of manifestation. Conclusion: there were correspondences between vocal quality settings detected in the perceptual sphere and their respective tongue body and jaw ultrasound images.


2019 ◽  
Vol 128 (12) ◽  
pp. 1129-1133
Author(s):  
Danny B. Jandali ◽  
Ashwin Ganti ◽  
Inna A. Husain ◽  
Pete S. Batra ◽  
Bobby A. Tajudeen

Objectives: Functional endoscopic sinus surgery (FESS) is a standard treatment modality for patients with chronic rhinosinusitis (CRS) who have failed appropriate medical therapy. However, FESS entails modification of the upper airway tract that may alter phonatory resonance and produce voice changes. The effects of FESS on postoperative voice characteristics in patients with CRS have yet to be quantitatively assessed. Methods: Patients with severe CRS who underwent FESS at a tertiary care referral center between May and October 2017 were prospectively enrolled. The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) and the Voice Handicap Index (VHI) were used to quantitatively evaluate voice characteristics and quality of life, respectively. Preoperative and postoperative CAPE-V and VHI scores were compared with postoperative scores for each patient. Sino-Nasal Outcome Test (SNOT-22) scores were also obtained to assess changes in patient symptoms. Results: 18 CRS patients undergoing FESS were enrolled. The average preoperative Lund-Mackay score was 14, indicating baseline severe CRS. Postoperative assessments demonstrated a statistically significant decrease in CAPE-V (45-27, p = .005) and VHI (10-4.7, p < .001) scores. These correlated with a statistically significant decrease in SNOT-22 scores (42-13, p < .001). Conclusions: Patients with CRS experience a significant improvement in voice characteristics and vocal quality of life following FESS. Furthermore, this appears to correlate with a significant decrease in self-reported disease severity. These findings may augment the discussion of potential benefits of FESS to a new potential domain for voice quality.


Revista CEFAC ◽  
2018 ◽  
Vol 20 (2) ◽  
pp. 191-200
Author(s):  
Cristiane Marangom ◽  
Viviani Souza Peruchi ◽  
Marta Assumpção de Andrada e Silva ◽  
Irene Queiroz Marchesan ◽  
Léslie Piccolotto Ferreira

ABSTRACT Objective: to analyze the association between voice disorder and aspects related to breathing mode in children, according to sex. Methods: 250 children, aged six to nine years, attending a public school in São Paulo city, were selected. The collection consisted of spontaneously audio recorded speech samples. The breathing mode was evaluated for lip resting posture and nasal flow. Three audiologists performed the perceptual evaluation of the voice quality, with the help of GIRBAS scale. The results were associated using the chi-square test (p = 0.05). Results: 50.4% females and 49.6% males. As for the overall grade (G), 12.8% had voice disorder with respect to voice quality. In the breathing mode, 36.8% presented alterations in lip posture and 71.2%, in nasal flow. The associations between voice disorder and gender (p = 0.96), lip posture (p = 0.38) and nasal flow (p = 0.18) and between alterations in the nasal flow and sex (p = 0.449) were not confirmed. The association between lip posture and sex revealed significant differences in favor of males (p = 0.003). Conclusion: there was no statistically significant difference associating voice disorders with breathing mode (lip posture and nasal flow) and gender.


Sign in / Sign up

Export Citation Format

Share Document