perceptual evaluation
Recently Published Documents


TOTAL DOCUMENTS

519
(FIVE YEARS 147)

H-INDEX

34
(FIVE YEARS 4)

2022 ◽  
Vol 19 (1) ◽  
pp. 1-17
Author(s):  
Hye Ji Kim ◽  
Michael Neff ◽  
Sung-Hee Lee

Laban Movement Analysis (LMA) and its Effort element provide a conceptual framework through which we can observe, describe, and interpret the intention of movement. Effort attributes provide a link between how people move and how their movement communicates to others. It is crucial to investigate the perceptual characteristics of Effort to validate whether it can serve as an effective framework to support a wide range of applications in animation and robotics that require a system for creating or perceiving expressive variation in motion. To this end, we first constructed an Effort motion database of short video clips of five different motions: walk, sit down, pass, put, wave performed in eight ways corresponding to the extremes of the Effort elements. We then performed a perceptual evaluation to examine the perceptual consistency and perceived associations among Effort elements: Space (Indirect/Direct), Time (Sustained/Sudden), Weight (Light/Strong), and Flow (Free/Bound) that appeared in the motion stimuli. The results of the perceptual consistency evaluation indicate that although the observers do not perceive the LMA Effort element 100% as intended, true response rates of seven Effort elements are higher than false response rates except for light Effort. The perceptual consistency results showed varying tendencies by motion. The perceptual association between LMA Effort elements showed that a single LMA Effort element tends to co-occur with the elements of other factors, showing significant correlation with one or two factors (e.g., indirect and free, light and free).


2022 ◽  
Vol 12 (2) ◽  
pp. 827
Author(s):  
Ki-Seung Lee

Moderate performance in terms of intelligibility and naturalness can be obtained using previously established silent speech interface (SSI) methods. Nevertheless, a common problem associated with SSI has involved deficiencies in estimating the spectrum details, which results in synthesized speech signals that are rough, harsh, and unclear. In this study, harmonic enhancement (HE), was used during postprocessing to alleviate this problem by emphasizing the spectral fine structure of speech signals. To improve the subjective quality of synthesized speech, the difference between synthesized and actual speech was established by calculating the distance in the perceptual domains instead of using the conventional mean square error (MSE). Two deep neural networks (DNNs) were employed to separately estimate the speech spectra and the filter coefficients of HE, connected in a cascading manner. The DNNs were trained to incrementally and iteratively minimize both the MSE and the perceptual distance (PD). A feasibility test showed that the perceptual evaluation of speech quality (PESQ) and the short-time objective intelligibility measure (STOI) were improved by 17.8 and 2.9%, respectively, compared with previous methods. Subjective listening tests revealed that the proposed method yielded perceptually preferred results compared with that of the conventional MSE-based method.


Author(s):  
Seung Jin Lee

The auditory-perceptual evaluation of speech-language pathologists (SLP) in patients with voice disorders is often regarded as a touchstone in the multi-dimensional voice evaluation procedures and provides important information not available in other assessment modalities. Therefore, it is necessary for the SLPs to conduct a comprehensive and in-depth evaluation of not only voice but also the overall speech production mechanism, and they often encounter various difficulties in the evaluation process. In addition, SLPs should strive to avoid bias during the evaluation process and to maintain a wide and constant spectrum of severity for each parameter of voice quality. Lastly, it is very important for the SLPs to perform a team approach by documenting and delivering important information pertaining to auditory-perceptual characteristics in an appropriate and efficient way through close communication with the laryngologists.


2021 ◽  
Vol 12 (1) ◽  
pp. 348
Author(s):  
Vincent Martin ◽  
Isabelle Viaud-Delmon ◽  
Olivier Warusfel

Audio-only augmented reality consists of enhancing a real environment with virtual sound events. A seamless integration of the virtual events within the environment requires processing them with artificial spatialization and reverberation effects that simulate the acoustic properties of the room. However, in augmented reality, the visual and acoustic environment of the listener may not be fully mastered. This study aims to gain some insight into the acoustic cues (intensity and reverberation) that are used by the listener to form an auditory distance judgment, and to observe if these strategies can be influenced by the listener’s environment. To do so, we present a perceptual evaluation of two distance-rendering models informed by a measured Spatial Room Impulse Response. The choice of the rendering methods was made to design stimuli categories in which the availability and reproduction quality of acoustic cues are different. The proposed models have been evaluated in an online experiment gathering 108 participants who were asked to provide judgments of auditory distance about a stationary source. To evaluate the importance of environmental cues, participants had to describe the environment in which they were running the experiment, and more specifically the volume of the room and the distance to the wall they were facing. It could be shown that these context cues had a limited, but significant, influence on the perceived auditory distance.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Mourad Talbi ◽  
Med Salim Bouhlel

Speech enhancement has gained considerable attention in the employment of speech transmission via the communication channel, speaker identification, speech-based biometric systems, video conference, hearing aids, mobile phones, voice conversion, microphones, and so on. The background noise processing is needed for designing a successful speech enhancement system. In this work, a new speech enhancement technique based on Stationary Bionic Wavelet Transform (SBWT) and Minimum Mean Square Error (MMSE) Estimate of Spectral Amplitude is proposed. This technique consists at the first step in applying the SBWT to the noisy speech signal, in order to obtain eight noisy wavelet coefficients. The denoising of each of those coefficients is performed through the application of the denoising method based on MMSE Estimate of Spectral Amplitude. The SBWT inverse, S B W T − 1 , is applied to the obtained denoised stationary wavelet coefficients for finally obtaining the enhanced speech signal. The proposed technique’s performance is proved by the calculation of the Signal to Noise Ratio (SNR), the Segmental SNR (SSNR), and the Perceptual Evaluation of Speech Quality (PESQ).


2021 ◽  
Vol 9 (2) ◽  
pp. 742-768
Author(s):  
H. Derya Arslan ◽  
Kemal Yıldırım

Author(s):  
Mohammad Fallah ◽  
Seyyedeh Maryam Khoddami ◽  
Shohreh Jalaie ◽  
Keyvan Aghazadeh ◽  
Amin Rezaei Rad

Introduction: This study aimed to investigate the effect of Mathieson Laryngeal Manual Therapy (MLMT) following a therapeutic course in patients with primary Muscle Tension Dysphonia (MTD). Materials and Methods: Twelve patients with primary MTD participated in this study. At first, videostroboscopy and perceptual voice assessment was performed, and the Persian version of Vocal Tract Discomfort (VTDp) scale was completed. After two and a half weeks that patients received no treatment, the assessments were repeated to evaluate the effect of spontaneous recovery. For studying the effect of MLMT, it was presented in five sessions. Then, all assessments were repeated. The frequency of supraglottic activity was elicited. For the perceptual evaluation and VTDp, the Wilcoxon nonparametric test was used to study and compare the effect of spontaneous recovery and MLMT. Results: After spontaneous recovery, a significant difference was observed only in strain (P<0.05). After MLMT, the frequency of supraglottic activity decreased, and perceptual voice parameters significantly changed (P<0.05), but the VTDp showed no significant difference (P>0.05). There was no significant difference between spontanous recovery and MLMT based on the paerceptual voice evaluation and VTDp scale (P>0.05). Conclusion: The MLMT can remarkably improve the supraglottic activity and perceptual characteristics of voice in primary MTD after a therapeutic course. Further studies are recommended to confirm the effectiveness of MLMT on decreasing VTD sensations.


Author(s):  
Kanako Kondo ◽  
Masanobu Mizuta ◽  
Yoshitaka Kawai ◽  
Tohru Sogami ◽  
Shintaro Fujimura ◽  
...  

Purpose: Auditory-perceptual evaluation is essential for the assessment of voice quality. The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) provides a standardized protocol and assessment form for clinicians to analyze the voice quality and has been adapted into several different languages. The aims of this study were to develop the Japanese version of the CAPE-V and to investigate its reliability and validity. Method: The Japanese CAPE-V consisted of the same three speech contexts (vowels, sentences, and conversation) as developed in the original English version. The sentences were designed according to the concepts of the original version and reviewed by Japanese phoneticians. To validate the usefulness of the Japanese CAPE-V, voices of 173 Japanese-speaking subjects (76 subjects with dysphonia and 97 without voice complaints) were evaluated by five experienced judges, according to the Japanese CAPE-V as well as the GRBAS (Grade, Roughness, Breathiness, Asthenia, Strain) scale. Results: The Japanese CAPE-V provided a high interrater reliability (intraclass correlation coefficients [ICCs] > .85 for all the parameters) as well as a high intrarater reliability (ICCs > .85 for all the parameters). In addition, overall severity, roughness, and breathiness in the Japanese CAPE-V were highly correlated with the corresponding dimensions in the GRBAS scale, having Spearman correlation coefficients greater than .8. Conclusion: This study demonstrated the reliability and validity of the newly developed Japanese CAPE-V as an auditory-perceptual evaluation instrument.


Sign in / Sign up

Export Citation Format

Share Document