Perceptual Evaluation of Driving Scene Segmentation

2020 ◽

Vol 63 (4) ◽

pp. 1018-1032

Author(s):

Chia-Hsin Wu ◽

Roger W. Chan

Keyword(s):

Acoustic Analysis ◽

Vocal Tract ◽

Exercise Program ◽

Analysis Of Covariance ◽

Elderly Subjects ◽

Control Group ◽

Perceptual Evaluation ◽

Positive Effects ◽

Aging Voice ◽

Before And After

Purpose Semi-occluded vocal tract (SOVT) exercises with tubes or straws have been widely used for a variety of voice disorders. Yet, the effects of longer periods of SOVT exercises (lasting for weeks) on the aging voice are not well understood. This study investigated the effects of a 6-week straw phonation in water (SPW) exercise program. Method Thirty-seven elderly subjects with self-perceived voice problems were assigned into two groups: (a) SPW exercises with six weekly sessions and home practice (experimental group) and (b) vocal hygiene education (control group). Before and after intervention (2 weeks after the completion of the exercise program), acoustic analysis, auditory–perceptual evaluation, and self-assessment of vocal impairment were conducted. Results Analysis of covariance revealed significant differences between the two groups in smoothed cepstral peak prominence measures, harmonics-to-noise ratio, the auditory–perceptual parameter of breathiness, and Voice Handicap Index-10 scores postintervention. No significant differences between the two groups were found for other measures. Conclusions Our results supported the positive effects of SOVT exercises for the aging voice, with a 6-week SPW exercise program being a clinical option. Future studies should involve long-term follow-up and additional outcome measures to better understand the efficacy of SOVT exercises, particularly SPW exercises, for the aging voice.

Download Full-text

Perceptual Evaluation of Speech Naturalness in Speakers of Varying Gender Identities

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-19-00337 ◽

2020 ◽

Vol 63 (7) ◽

pp. 2054-2069

Author(s):

Brandon Merritt ◽

Tessa Bent

Keyword(s):

Spontaneous Speech ◽

Identification Accuracy ◽

Rating Task ◽

Gender Identification ◽

Identification Task ◽

Male And Female ◽

Perceptual Evaluation ◽

Speech Training ◽

And Gender ◽

Speech Naturalness

Purpose The purpose of this study was to investigate how speech naturalness relates to masculinity–femininity and gender identification (accuracy and reaction time) for cisgender male and female speakers as well as transmasculine and transfeminine speakers. Method Stimuli included spontaneous speech samples from 20 speakers who are transgender (10 transmasculine and 10 transfeminine) and 20 speakers who are cisgender (10 male and 10 female). Fifty-two listeners completed three tasks: a two-alternative forced-choice gender identification task, a speech naturalness rating task, and a masculinity/femininity rating task. Results Transfeminine and transmasculine speakers were rated as significantly less natural sounding than cisgender speakers. Speakers rated as less natural took longer to identify and were identified less accurately in the gender identification task; furthermore, they were rated as less prototypically masculine/feminine. Conclusions Perceptual speech naturalness for both transfeminine and transmasculine speakers is strongly associated with gender cues in spontaneous speech. Training to align a speaker's voice with their gender identity may concurrently improve perceptual speech naturalness. Supplemental Material https://doi.org/10.23641/asha.12543158

Download Full-text

Cultural and Linguistic Adaptation of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) Into Hindi

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-20-00348 ◽

2020 ◽

Vol 63 (12) ◽

pp. 3974-3981

Author(s):

Ashwini Joshi ◽

Isha Baheti ◽

Vrushali Angadi

Keyword(s):

Strong Correlation ◽

Concurrent Validity ◽

Interrater Reliability ◽

Voice Quality ◽

Weak Correlation ◽

Voice Assessment ◽

Perceptual Evaluation ◽

Severity Grade ◽

Normal Voice ◽

Group A

Aim The purpose of this study was to develop and assess the reliability of a Hindi version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Reliability was assessed by comparing Hindi CAPE-V ratings with English CAPE-V ratings and by the Grade, Roughness, Breathiness, Asthenia and Strain (GRBAS) scale. Method Hindi sentences were created to match the phonemic load of the corresponding English CAPE-V sentences. The Hindi sentences were adapted for linguistic content. The original English and adapted Hindi CAPE-V and GRBAS were completed for 33 bilingual individuals with normal voice quality. Additionally, the Hindi CAPE-V and GRBAS were completed for 13 Hindi speakers with disordered voice quality. The agreement of CAPE-V ratings was assessed between language versions, GRBAS ratings, and two rater pairs (three raters in total). Pearson product–moment correlation was completed for all comparisons. Results A strong correlation ( r > .8, p < .01) was found between the Hindi CAPE-V scores and the English CAPE-V scores for most variables in normal voice participants. A weak correlation was found for the variable of strain ( r < .2, p = .400) in the normative group. A strong correlation ( r > .6, p < .01) was found between the overall severity/grade, roughness, and breathiness scores in the GRBAS scale and the CAPE-V scale in normal and disordered voice samples. Significant interrater reliability ( r > .75) was present in overall severity and breathiness. Conclusions The Hindi version of the CAPE-V demonstrates good interrater reliability and concurrent validity with the English CAPE-V and the GRBAS. The Hindi CAPE-V can be used for the auditory-perceptual voice assessment of Hindi speakers.

Download Full-text

Instrumental Assessment in Cleft Palate Care

Perspectives on Speech Science and Orofacial Disorders ◽

10.1044/ssod23.2.49 ◽

2013 ◽

Vol 23 (2) ◽

pp. 49-61 ◽

Cited By ~ 3

Author(s):

Jamie Perry ◽

Graham Schenck

Keyword(s):

Cleft Palate ◽

New Developments ◽

Perceptual Evaluation ◽

Advantages And Disadvantages ◽

Instrumental Assessment ◽

Speech Language Pathologist ◽

Hypernasal Speech ◽

Perceptual Assessment ◽

Velopharyngeal Function ◽

And Function

Despite advances in surgical management, it is estimated that 20–30% of children with repaired cleft palate will continue to have hypernasal speech and require a second surgery to create normal velopharyngeal function (Bricknell, McFadden, & Curran, 2002; Härtel, Karsten, & Gundlach, 1994; McWilliams, 1990). A qualitative perceptual assessment by a speech-language pathologist is considered the most important step of the evaluation for children with resonance disorders (Peterson-Falzone, Hardin-Jones, & Karnell, 2010). Direct and indirect instrumental analyses should be used to confirm or validate the perceptual evaluation of an experienced speech-language pathologist (Paal, Reulbach, Strobel-Schwarthoff, Nkenke, & Schuster, 2005). The purpose of this article is to provide an overview of current instrumental assessment methods used in cleft palate care. Both direct and indirect instrumental procedures will be reviewed with descriptions of the advantages and disadvantages of each. Lastly, new developments for evaluating velopharyngeal structures and function will be provided.

Download Full-text

Supplemental Material for Perceptual Evaluation of Musicological Cues for Automatic Song Segmentation

Psychomusicology Music Mind and Brain ◽

10.1037/a0026872.supp ◽

2012 ◽

Keyword(s):

Perceptual Evaluation

Download Full-text

Natural Scene Segmentation Based on Information Fusion and Homogeneity Property

Proceedings of the 9th Joint Conference on Information Sciences (JCIS) ◽

10.2991/jcis.2006.263 ◽

2006 ◽

Cited By ~ 1

Author(s):

Heng-Da Cheng ◽

Manasi Datar ◽

Wen Ju

Keyword(s):

Information Fusion ◽

Scene Segmentation ◽

Natural Scene ◽

Homogeneity Property

Download Full-text

Faculty Opinions recommendation of Scene segmentation and attention in primate cortical areas V1 and V2.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1010606.159957 ◽

2002 ◽

Author(s):

Hidehiko Komatsu

Keyword(s):

Scene Segmentation ◽

Cortical Areas

Download Full-text

Perceptual evaluation of the effect of mismatched Fujisaki model commands and surface tone in Sesotho

10.21437/speechprosody.2014-104 ◽

2014 ◽

Author(s):

Lehlohonolo Mohasi ◽

Thomas Niesler ◽

Hansjörg Mixdorff

Keyword(s):

Perceptual Evaluation ◽

Fujisaki Model

Download Full-text

Perceptual evaluation of individual headphone compensation in binaural synthesis based on non-individual recordings

10.21437/pqs.2010-23 ◽

2010 ◽

Author(s):

Alexander Lindau ◽

Fabian Brinkmann

Keyword(s):

Perceptual Evaluation

Download Full-text

Singular Values Decomposition and Lifting Wavelet Transform for Speech Signal Embedding into Digital Image

Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) ◽

10.2174/2352096511666180511151646 ◽

2019 ◽

Vol 12 (2) ◽

pp. 138-151

Author(s):

Mourad Talbi ◽

Med Salim Bouhlel

Keyword(s):

Wavelet Transform ◽

Speech Signal ◽

Signal To Noise Ratio ◽

Perceptual Quality ◽

Lifting Wavelet Transform ◽

Signal To Noise ◽

Perceptual Evaluation ◽

Lifting Wavelet ◽

Noise Ratio

Background: In this paper, we propose a secure image watermarking technique which is applied to grayscale and color images. It consists in applying the SVD (Singular Value Decomposition) in the Lifting Wavelet Transform domain for embedding a speech image (the watermark) into the host image. Methods: It also uses signature in the embedding and extraction steps. Its performance is justified by the computation of PSNR (Pick Signal to Noise Ratio), SSIM (Structural Similarity), SNR (Signal to Noise Ratio), SegSNR (Segmental SNR) and PESQ (Perceptual Evaluation Speech Quality). Results: The PSNR and SSIM are used for evaluating the perceptual quality of the watermarked image compared to the original image. The SNR, SegSNR and PESQ are used for evaluating the perceptual quality of the reconstructed or extracted speech signal compared to the original speech signal. Conclusion: The Results obtained from computation of PSNR, SSIM, SNR, SegSNR and PESQ show the performance of the proposed technique.

Download Full-text

Perceptual Evaluation of Driving Scene Segmentation

Effects of a 6-Week Straw Phonation in Water Exercise Program on the Aging Voice

Perceptual Evaluation of Speech Naturalness in Speakers of Varying Gender Identities

Cultural and Linguistic Adaptation of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) Into Hindi

Instrumental Assessment in Cleft Palate Care

Supplemental Material for Perceptual Evaluation of Musicological Cues for Automatic Song Segmentation

Natural Scene Segmentation Based on Information Fusion and Homogeneity Property

Faculty Opinions recommendation of Scene segmentation and attention in primate cortical areas V1 and V2.

Perceptual evaluation of the effect of mismatched Fujisaki model commands and surface tone in Sesotho

Perceptual evaluation of individual headphone compensation in binaural synthesis based on non-individual recordings

Singular Values Decomposition and Lifting Wavelet Transform for Speech Signal Embedding into Digital Image

Export Citation Format