Analysis of emotional expression by visualization of the human and synthesized speech signal sets — A consideration of audio-visual advantage-

2021 ◽

Vol 24 (2) ◽

pp. 14-20

Author(s):

G. Lan ◽

◽

A. S. Fadeev ◽

A. N. Morgunov ◽

◽

...

Keyword(s):

Speech Signal ◽

Analytical Description ◽

Frequency Characteristics ◽

Frequency Spectra ◽

Human Voice ◽

Synthesized Speech

This article details the development of methods for the synthesis of phonemes of the human voice based on the analytical description of individual formants. A technique for analyzing the spectrum and spectrograms of original phonemes to obtain the main amplitude-frequency characteristics of the signal components is presented. An algorithm to reconstruct a speech signal based on the obtained sets of parameters is proposed. A technique to assess the quality of synthesized speech elements is described

Download Full-text

Method for processing speech signal using sub-converting functions and a weighting function to produce synthesized speech

The Journal of the Acoustical Society of America ◽

10.1121/1.423407 ◽

1998 ◽

Vol 104 (2) ◽

pp. 620

Author(s):

Naoto Iwahashi

Keyword(s):

Speech Signal ◽

Weighting Function ◽

Synthesized Speech

Download Full-text

Effects of Discourse Context on the Intelligibility of Synthesized Speech for Young Adult and Older Adult Listeners

Journal of Speech Language and Hearing Research ◽

10.1044/1092-4388(2001/083) ◽

2001 ◽

Vol 44 (5) ◽

pp. 1052-1057 ◽

Cited By ~ 20

Author(s):

Kathryn D. R. Drager ◽

Joe E. Reichle

Keyword(s):

Young Adult ◽

Older Adult ◽

Speech Signal ◽

Augmentative And Alternative Communication ◽

Speech Synthesis ◽

Electronic Communication ◽

Communication Aids ◽

Alternative Communication ◽

Discourse Context ◽

Synthesized Speech

The use of speech synthesis in electronic communication aids allows individuals who use augmentative and alternative communication (AAC) devices to communicate with a variety of partners. However, communication will only be effective if the speech signal is readily understood by the listener. The intelligibility of synthesized speech is influenced by a variety of factors, including the provision of context. Although the facilitative effects of context have been demonstrated extensively in studies with young adults, there are few investigations into older adults' ability to decode the synthesized speech signal. The present study investigated whether discourse context affected the intelligibility of synthesized sentences for young adult and older adult listeners. Listeners were asked to repeat 15-word sentences that were either presented in isolation or preceded by a story that set the context for the sentence. Participants correctly repeated significantly more words in the sentences when they were preceded by related sentences than when the sentences were presented in isolation. This research shows a facilitating effect of context in discourse, wherein previous words and sentences are related to later sentences, for both younger and older adult listeners. These results have direct implications for AAC system message transmission.

Download Full-text

Perceptual Learning of Vocoded Speech With and Without Contralateral Hearing: Implications for Cochlear Implant Rehabilitation

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-20-00385 ◽

2020 ◽

pp. 1-10

Author(s):

Martin Chavant ◽

Alexis Hervais-Adelman ◽

Olivier Macherey

Keyword(s):

Cochlear Implant ◽

Perceptual Learning ◽

Speech Signal ◽

Training Phase ◽

Monosyllabic Words ◽

Low Pass ◽

Contralateral Ear ◽

Number Of Individuals ◽

Insight Into ◽

Vocoded Speech

Purpose An increasing number of individuals with residual or even normal contralateral hearing are being considered for cochlear implantation. It remains unknown whether the presence of contralateral hearing is beneficial or detrimental to their perceptual learning of cochlear implant (CI)–processed speech. The aim of this experiment was to provide a first insight into this question using acoustic simulations of CI processing. Method Sixty normal-hearing listeners took part in an auditory perceptual learning experiment. Each subject was randomly assigned to one of three groups of 20 referred to as NORMAL, LOWPASS, and NOTHING. The experiment consisted of two test phases separated by a training phase. In the test phases, all subjects were tested on recognition of monosyllabic words passed through a six-channel “PSHC” vocoder presented to a single ear. In the training phase, which consisted of listening to a 25-min audio book, all subjects were also presented with the same vocoded speech in one ear but the signal they received in their other ear differed across groups. The NORMAL group was presented with the unprocessed speech signal, the LOWPASS group with a low-pass filtered version of the speech signal, and the NOTHING group with no sound at all. Results The improvement in speech scores following training was significantly smaller for the NORMAL than for the LOWPASS and NOTHING groups. Conclusions This study suggests that the presentation of normal speech in the contralateral ear reduces or slows down perceptual learning of vocoded speech but that an unintelligible low-pass filtered contralateral signal does not have this effect. Potential implications for the rehabilitation of CI patients with partial or full contralateral hearing are discussed.

Download Full-text

Perception of Distorted "R" Sounds in the Synthesized Speech of Chlldren and Adults

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.2604.516 ◽

1983 ◽

Vol 26 (4) ◽

pp. 516-524 ◽

Cited By ~ 6

Author(s):

Donald J. Sharf ◽

Ralph N. Ohde

Keyword(s):

Speech Language Pathology ◽

Graduate Level ◽

Point Scale ◽

Synthesized Speech ◽

Language Pathology ◽

High Degree

Adult and Child manifolds were generated by synthesizing 5 X 5 matrices of/Cej/ type utterances in which F2 and F3 frequencies were systematically varied. Manifold stimuli were presented to 11 graduate-level speech-language pathology students in two conditions: (a) a rating condition in which stimuli were rated on a 4-point scale between good /r/and good /w/; and (b) a labeling condition in which stimuli were labeled as "R," "W," "distorted R." or "N" (for none of the previous choices). It was found that (a) stimuli with low F2 and high F3 frequencies were rated 1.0nmdas;1.4; those with high F2 and low F3 frequencies were rated 3.6–4.0, and those with intermediate values were rated 1.5–3.5; (b) stimuli rated 1.0–1.4 were labeled as "W" and stimuli rated 3.6–4.0 were labeled as "R"; (c) none of the Child manifold stimuli were labeled as distorted "R" and one of the Adult manifold stimuli approached a level of identification that approached the percentage of identification for "R" and "W": and (d) rating and labeling tasks were performed with a high degree of reliability.

Download Full-text

“I Can See What You’re Saying”: Clinical Utility of Spectral Moment Analysis

Perspectives on Speech Science and Orofacial Disorders ◽

10.1044/ssod21.2.44 ◽

2011 ◽

Vol 21 (2) ◽

pp. 44-54

Author(s):

Kerry Callahan Mandulak

Keyword(s):

Speech Production ◽

Speech Signal ◽

Clinical Utility ◽

Acoustic Analysis ◽

Moment Analysis ◽

Analysis Tool ◽

Spectral Moment ◽

Clinical Measure ◽

Perceptual Analysis ◽

Disordered Speech

Spectral moment analysis (SMA) is an acoustic analysis tool that shows promise for enhancing our understanding of normal and disordered speech production. It can augment auditory-perceptual analysis used to investigate differences across speakers and groups and can provide unique information regarding specific aspects of the speech signal. The purpose of this paper is to illustrate the utility of SMA as a clinical measure for both clinical speech production assessment and research applications documenting speech outcome measurements. Although acoustic analysis has become more readily available and accessible, clinicians need training with, and exposure to, acoustic analysis methods in order to integrate them into traditional methods used to assess speech production.

Download Full-text

Beeinträchtigtes Erkennen emotionaler Gesichtsausdrücke durch psychiatrische Patientinnen bei Alexithymie

Zeitschrift für Psychiatrie Psychologie und Psychotherapie ◽

10.1024/1661-4747/a000135 ◽

2013 ◽

Vol 61 (1) ◽

pp. 7-15 ◽

Cited By ~ 1

Author(s):

Daniel Dittrich ◽

Gregor Domes ◽

Susi Loebel ◽

Christoph Berger ◽

Carsten Spitzer ◽

...

Keyword(s):

Emotion Recognition ◽

Emotional Expression ◽

Depression Scale ◽

Facial Emotion Recognition ◽

Facial Emotion ◽

Check List ◽

Psychosomatische Erkrankungen ◽

Psychiatrische Patienten ◽

Symptom Check List ◽

Tas 20

Die vorliegende Studie untersucht die Hypothese eines mit Alexithymie assoziierten Defizits beim Erkennen emotionaler Gesichtsaudrücke an einer klinischen Population. Darüber hinaus werden Hypothesen zur Bedeutung spezifischer Emotionsqualitäten sowie zu Gender-Unterschieden getestet. 68 ambulante und stationäre psychiatrische Patienten (44 Frauen und 24 Männer) wurden mit der Toronto-Alexithymie-Skala (TAS-20), der Montgomery-Åsberg Depression Scale (MADRS), der Symptom-Check-List (SCL-90-R) und der Emotional Expression Multimorph Task (EEMT) untersucht. Als Stimuli des Gesichtererkennungsparadigmas dienten Gesichtsausdrücke von Basisemotionen nach Ekman und Friesen, die zu Sequenzen mit sich graduell steigernder Ausdrucksstärke angeordnet waren. Mittels multipler Regressionsanalyse untersuchten wir die Assoziation von TAS-20 Punktzahl und facial emotion recognition (FER). Während sich für die Gesamtstichprobe und den männlichen Stichprobenteil kein signifikanter Zusammenhang zwischen TAS-20-Punktzahl und FER zeigte, sahen wir im weiblichen Stichprobenteil durch die TAS-20 Punktzahl eine signifikante Prädiktion der Gesamtfehlerzahl (β = .38, t = 2.055, p < 0.05) und den Fehlern im Erkennen der Emotionen Wut und Ekel (Wut: β = .40, t = 2.240, p < 0.05, Ekel: β = .41, t = 2.214, p < 0.05). Für wütende Gesichter betrug die Varianzaufklärung durch die TAS-20-Punktzahl 13.3 %, für angeekelte Gesichter 19.7 %. Kein Zusammenhang bestand zwischen der Zeit, nach der die Probanden die emotionalen Sequenzen stoppten, um ihre Bewertung abzugeben (Antwortlatenz) und Alexithymie. Die Ergebnisse der Arbeit unterstützen das Vorliegen eines mit Alexithymie assoziierten Defizits im Erkennen emotionaler Gesichtsausdrücke bei weiblchen Probanden in einer heterogenen, klinischen Stichprobe. Dieses Defizit könnte die Schwierigkeiten Hochalexithymer im Bereich sozialer Interaktionen zumindest teilweise begründen und so eine Prädisposition für psychische sowie psychosomatische Erkrankungen erklären.

Download Full-text

Emotional Expression and Recognition in Breast Cancer and Noncancer Groups

PsycEXTRA Dataset ◽

10.1037/e413782005-426 ◽

1999 ◽

Author(s):

Michele C. Fejfar ◽

Lee Blonder ◽

Michael Andrykowski

Keyword(s):

Breast Cancer ◽

Emotional Expression

Download Full-text

Differences between Women and Men in the Range of Appropriate Emotional Expression

PsycEXTRA Dataset ◽

10.1037/e413782005-483 ◽

1999 ◽

Author(s):

Traci Y. Craig ◽

Janice R. Kelly

Keyword(s):

Emotional Expression

Download Full-text

Pleasant and Angry Emotional Expression Via Email, Phone and in Person

PsycEXTRA Dataset ◽

10.1037/e413802005-603 ◽

2001 ◽

Author(s):

Katy E. Crowley ◽

Nicole M. Traxel

Keyword(s):

Emotional Expression

Download Full-text

Analysis of emotional expression by visualization of the human and synthesized speech signal sets — A consideration of audio-visual advantage-

Synthesis of Human Voice Fragments Based on Frequency Spectra Reconstruction

Method for processing speech signal using sub-converting functions and a weighting function to produce synthesized speech

Effects of Discourse Context on the Intelligibility of Synthesized Speech for Young Adult and Older Adult Listeners

Perceptual Learning of Vocoded Speech With and Without Contralateral Hearing: Implications for Cochlear Implant Rehabilitation

Perception of Distorted "R" Sounds in the Synthesized Speech of Chlldren and Adults

“I Can See What You’re Saying”: Clinical Utility of Spectral Moment Analysis

Beeinträchtigtes Erkennen emotionaler Gesichtsausdrücke durch psychiatrische Patientinnen bei Alexithymie

Emotional Expression and Recognition in Breast Cancer and Noncancer Groups

Differences between Women and Men in the Range of Appropriate Emotional Expression

Pleasant and Angry Emotional Expression Via Email, Phone and in Person

Export Citation Format