word duration Latest Research Papers

Emotional Speech Processing in 3- to 12-Month-Old Infants: Influences of Emotion Categories and Acoustic Parameters

Journal of Speech Language and Hearing Research ◽

10.1044/2021_jslhr-21-00234 ◽

2022 ◽

pp. 1-14

Author(s):

Chieh Kao ◽

Maria D. Sera ◽

Yang Zhang

Keyword(s):

Speech Processing ◽

Speech Rate ◽

Voice Quality ◽

Intensity Variation ◽

Acoustic Properties ◽

Emotional Speech ◽

Spectral Centroid ◽

Acoustic Correlates ◽

Spoken Words ◽

Word Duration

Purpose: The aim of this study was to investigate infants' listening preference for emotional prosodies in spoken words and identify their acoustic correlates. Method: Forty-six 3- to-12-month-old infants ( M age = 7.6 months) completed a central fixation (or look-to-listen) paradigm in which four emotional prosodies (happy, sad, angry, and neutral) were presented. Infants' looking time to the string of words was recorded as a proxy of their listening attention. Five acoustic variables—mean fundamental frequency (F0), word duration, intensity variation, harmonics-to-noise ratio (HNR), and spectral centroid—were also analyzed to account for infants' attentiveness to each emotion. Results: Infants generally preferred affective over neutral prosody, with more listening attention to the happy and sad voices. Happy sounds with breathy voice quality (low HNR) and less brightness (low spectral centroid) maintained infants' attention more. Sad speech with shorter word duration (i.e., faster speech rate), less breathiness, and more brightness gained infants' attention more than happy speech did. Infants listened less to angry than to happy and sad prosodies, and none of the acoustic variables were associated with infants' listening interests in angry voices. Neutral words with a lower F0 attracted infants' attention more than those with a higher F0. Neither age nor sex effects were observed. Conclusions: This study provides evidence for infants' sensitivity to the prosodic patterns for the basic emotion categories in spoken words and how the acoustic properties of emotional speech may guide their attention. The results point to the need to study the interplay between early socioaffective and language development.

Brain changes underlying progression of speech motor programming impairment

Brain Communications ◽

10.1093/braincomms/fcab205 ◽

2021 ◽

Author(s):

Ramon Landin-Romero ◽

Cheng T Liang ◽

Penelope A Monroe ◽

Yuichi Higashiyama ◽

Cristian E Leyton ◽

...

Keyword(s):

Speech Production ◽

Primary Progressive Aphasia ◽

Brain Regions ◽

Motor Programming ◽

Apraxia Of Speech ◽

Progressive Aphasia ◽

Primary Progressive ◽

Word Duration ◽

Variability Index ◽

Speech Motor

Abstract Aquired apraxia of speech is a disorder that impairs speech production, despite intact peripheral neuromotor function. Its pathomechanism remains to be established. Neurodegenerative lesion models provide an unequalled opportunity to explore the neural correlates of apraxia of speech, which is present in a subset of patients diagnosed with non-semantic variants of primary progressive aphasia. The normalised pairwise variability index, an acoustic measure of speech motor programming, has shown high sensitivity and specificity for apraxia of speech in cross-sectional studies. Here, we aimed to examine the strength of the pairwise variability index and overall word duration (i.e. articulation rate) as markers of progressive motor programming deficits in primary progressive aphasia with apraxia of speech. Seventy-nine individuals diagnosed with primary progressive aphasia (39 with non-fluent variant, 40 with logopenic variant) and 40 matched healthy controls participated. Patients were followed-up annually (range 1–6 years, median number of visits = 2). All participants completed a speech assessment task and a high-resolution MRI. Our analyses investigated trajectories of speech production (e.g. pairwise variablity index and word duration) and associations with cortical atrophy in the patients. At first presentation, word duration differentiated the nonfluent and logopenic cases statistically, but the range of scores overlapped substantially across groups. Longitudinally, we observed progressive deterioration in pairwise variability index and word duration specific to the non-fluent group only. The pairwise variability index showed particularly strong associations with progressive atrophy in speech motor programming brain regions. Of novelty, our results uncovered a key role of the right frontal gyrus in underpinning speech motor programming changes in non-fluent cases, highlighting the importance of right brain regions in responding to progressive neurological changes in the speech motor network. Taken together, our findings validate the use of a new metric, the pairwise variability index, as a robust marker of apraxia of speech in contrast to more generic measures of speaking rate. Sensitive/specific neuroimaging biomarkers of the emergence and progression of speech impairments will be useful to inform theories of the pathomechanisms underpinning impaired speech motor control. Our findings justify developing more sensitive measures of rhythmic temporal control of speech that may enable confident detection of emerging speech disturbances and more sensitive tracking of intervention-related changes for pharmacological, neuromodulatory, and behavioural interventions. A more reliable detection of speech disturbances has relevance for patient care, with predominance of progressive apraxia of speech a high-risk factor for later diagnosis of progressive supranuclear palsy or corticobasal degeneration.

The Interaction of Word Complexity and Word Duration in an Agglutinative Language

10.21437/interspeech.2021-594 ◽

2021 ◽

Author(s):

Mária Gósy ◽

Kálmán Abari

Keyword(s):

Word Duration ◽

Agglutinative Language

Automated Analysis of Digitized Letter Fluency Data

Frontiers in Psychology ◽

10.3389/fpsyg.2021.654214 ◽

2021 ◽

Vol 12 ◽

Author(s):

Sunghye Cho ◽

Naomi Nevler ◽

Natalia Parjane ◽

Christopher Cieri ◽

Mark Liberman ◽

...

Keyword(s):

Language Processing ◽

Test Performance ◽

Successful Implementation ◽

Articulation Rate ◽

Start Time ◽

Phonetic Similarity ◽

Word Duration ◽

Language Characteristics ◽

Semantic Distances ◽

Fluency Task

The letter-guided naming fluency task is a measure of an individual’s executive function and working memory. This study employed a novel, automated, quantifiable, and reproducible method to investigate how language characteristics of words produced during a fluency task are related to fluency performance, inter-word response time (RT), and over task duration using digitized F-letter-guided fluency recordings produced by 76 young healthy participants. Our automated algorithm counted the number of correct responses from the transcripts of the F-letter fluency data, and individual words were rated for concreteness, ambiguity, frequency, familiarity, and age of acquisition (AoA). Using a forced aligner, the transcripts were automatically aligned with the corresponding audio recordings. We measured inter-word RT, word duration, and word start time from the forced alignments. Articulation rate was also computed. Phonetic and semantic distances between two consecutive F-letter words were measured. We found that total F-letter score was significantly correlated with the mean values of word frequency, familiarity, AoA, word duration, phonetic similarity, and articulation rate; total score was also correlated with an individual’s standard deviation of AoA, familiarity, and phonetic similarity. RT was negatively correlated with frequency and ambiguity of F-letter words and was positively correlated with AoA, number of phonemes, and phonetic and semantic distances. Lastly, the frequency, ambiguity, AoA, number of phonemes, and semantic distance of words produced significantly changed over time during the task. The method employed in this paper demonstrates the successful implementation of our automated language processing pipelines in a standardized neuropsychological task. This novel approach captures subtle and rich language characteristics during test performance that enhance informativeness and cannot be extracted manually without massive effort. This work will serve as the reference for letter-guided category fluency production similarly acquired in neurodegenerative patients.

Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia

Alzheimer s Research & Therapy ◽

10.1186/s13195-021-00848-x ◽

2021 ◽

Vol 13 (1) ◽

Author(s):

Anthony Yeung ◽

Andrea Iaboni ◽

Elizabeth Rochon ◽

Monica Lavoie ◽

Calvin Santiago ◽

...

Keyword(s):

Cognitive Impairment ◽

Natural Language Processing ◽

Mild Cognitive Impairment ◽

Natural Language ◽

Language Processing ◽

Neurodegenerative Disorders ◽

Speech Analysis ◽

Word Finding ◽

Speech Characteristics ◽

Word Duration

Abstract Background Language impairment is an important marker of neurodegenerative disorders. Despite this, there is no universal system of terminology used to describe these impairments and large inter-rater variability can exist between clinicians assessing language. The use of natural language processing (NLP) and automated speech analysis (ASA) is emerging as a novel and potentially more objective method to assess language in individuals with mild cognitive impairment (MCI) and Alzheimer’s dementia (AD). No studies have analyzed how variables extracted through NLP and ASA might also be correlated to language impairments identified by a clinician. Methods Audio recordings (n=30) from participants with AD, MCI, and controls were rated by clinicians for word-finding difficulty, incoherence, perseveration, and errors in speech. Speech recordings were also transcribed, and linguistic and acoustic variables were extracted through NLP and ASA. Correlations between clinician-rated speech characteristics and the variables were compared using Spearman’s correlation. Exploratory factor analysis was applied to find common factors between variables for each speech characteristic. Results Clinician agreement was high in three of the four speech characteristics: word-finding difficulty (ICC = 0.92, p<0.001), incoherence (ICC = 0.91, p<0.001), and perseveration (ICC = 0.88, p<0.001). Word-finding difficulty and incoherence were useful constructs at distinguishing MCI and AD from controls, while perseveration and speech errors were less relevant. Word-finding difficulty as a construct was explained by three factors, including number and duration of pauses, word duration, and syntactic complexity. Incoherence was explained by two factors, including increased average word duration, use of past tense, and changes in age of acquisition, and more negative valence. Conclusions Variables extracted through automated acoustic and linguistic analysis of MCI and AD speech were significantly correlated with clinician ratings of speech and language characteristics. Our results suggest that correlating NLP and ASA with clinician observations is an objective and novel approach to measuring speech and language changes in neurodegenerative disorders.

The acquisition of prosodic marking of narrow focus in Central Swedish

Journal of Child Language ◽

10.1017/s0305000920000847 ◽

2021 ◽

pp. 1-26

Author(s):

Anna Sara H. ROMØREN ◽

Aoju CHEN

Keyword(s):

Final Position ◽

High Tone ◽

Pitch Range ◽

West Germanic ◽

Narrow Focus ◽

Word Duration ◽

Focus Marking

Abstract We investigated how Central Swedish-speaking four to eleven-year-old children acquire the prosodic marking of narrow focus, compared to adult controls. Three measurements were analysed: placement of the prominence-marking high tone (prominence H), pitch range effects of the prominence H, and word duration. Subject-verb-object sentences were elicited in sentence-medial and sentence-final focus conditions via a semi-spontaneous elicitation task. The children largely performed in an adult-like manner already at four to five: they predominantly added prominence H to focal words and avoided this tone post-focally in both sentence-medial and sentence-final position. The placement or avoidance of prominence H had largely the same effects on pitch range for children and adults. Finally, the four to eight-year-olds also increased the duration of the focal word, similar to adults. Hence, Central Swedish-speaking children master the use of prosody for focus marking at an earlier age, compared to children acquiring a West Germanic language.

The Effects of Word Frequency and Word Probability on Speech Rhythm in Dysarthria

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-19-00389 ◽

2020 ◽

Vol 63 (9) ◽

pp. 2833-2845

Author(s):

Lotte Eijk ◽

Annalise Fletcher ◽

Megan McAuliffe ◽

Esther Janse

Keyword(s):

New Zealand ◽

Word Frequency ◽

Significant Interaction ◽

Full Model ◽

Speech Rhythm ◽

Unique Variance ◽

Muscle Movement ◽

Word Duration

Purpose In healthy speakers, the more frequent and probable a word is in its context, the shorter the word tends to be. This study investigated whether these probabilistic effects were similarly sized for speakers with dysarthria of different severities. Method Fifty-six speakers of New Zealand English (42 speakers with dysarthria and 14 healthy speakers) were recorded reading the Grandfather Passage. Measurements of word duration, frequency, and transitional word probability were taken. Results As hypothesized, words with a higher frequency and probability tended to be shorter in duration. There was also a significant interaction between word frequency and speech severity. This indicated that the more severe the dysarthria, the smaller the effects of word frequency on speakers' word durations. Transitional word probability also interacted with speech severity, but did not account for significant unique variance in the full model. Conclusions These results suggest that, as the severity of dysarthria increases, the duration of words is less affected by probabilistic variables. These findings may be due to reductions in the control and execution of muscle movement exhibited by speakers with dysarthria.

Practice and experience predict coarticulation in child speech

10.31234/osf.io/vwhtk ◽

2020 ◽

Author(s):

Meg Cychosz ◽

Benjamin Munson ◽

jan edwards

Keyword(s):

Motor Control ◽

Receptive Language ◽

Speech Development ◽

Fine Motor ◽

Language Experience ◽

Phonological Representations ◽

Word Duration ◽

Child Speech ◽

Adjacent Segments ◽

Adult Word Count

Much research in child speech development suggests that young children coarticulate more than adults. There are multiple, not mutually-exclusive, explanations for this pattern. For example, children may coarticulate more because they are limited by immature motor control. Or they may coarticulate more if they initially represent phonological segments in larger, more holistic units such as syllables or feet. We tested the importance of several different explanations for coarticulation in child speech by evaluating how four-year-olds' language experience, speech practice, and speech planning predicted their coarticulation between adjacent segments in real words and paired nonwords. Children with larger vocabularies coarticulated less, especially in real words, though there were no reliable coarticulatory differences between real words and nonwords after controlling for word duration. Children who vocalized more throughout a daylong audio recording also coarticulated less. Quantity of child vocalizations was more predictive of the degree of children's coarticulation than a measure of receptive language experience, adult word count. Overall, these results suggest strong roles for children's phonological representations and speech practice, as well as their immature fine motor control, for coarticulatory development.

Five-year-olds produce prosodic cues to distinguish compounds from lists in Australian English

Journal of Child Language ◽

10.1017/s0305000920000227 ◽

2020 ◽

pp. 1-19

Author(s):

Ivan YUEN ◽

Nan XU RATTANASONE ◽

Elaine SCHMIDT ◽

Gretel MACDONALD ◽

Rebecca HOLT ◽

...

Keyword(s):

Ice Cream ◽

Acoustic Cues ◽

Prosodic Cues ◽

Production Experiment ◽

Word Duration ◽

English Speaking ◽

Australian English ◽

Prosodic Structures ◽

Developing Knowledge

Abstract Although previous research has indicated that five-year-olds can use acoustic cues to disambiguate compounds (N1 + N2) from lists (N1, N2) (e.g., ‘ice-cream’ vs. ‘ice, cream’) (Yoshida & Katz, 2004, 2006), their productions are not yet fully adult-like (Wells, Peppé & Goulandris, 2004). The goal of this study was to examine this issue in Australian English-speaking children, with a focus on their use of F0, word duration, and pauses. Twenty-four five-year-olds and 20 adults participated in an elicited production experiment. Like adults, children produced distinct F0 patterns for the two structures. They also used longer word durations and more pauses in lists compared to compounds, indicating the presence of a boundary in lists. However, unlike adults, they also inappropriately inserted more pauses within the compound, suggesting the presence of a boundary in compounds as well. The implications for understanding children's developing knowledge of how to map acoustic cues to prosodic structures are discussed.

Masking auditory feedback does not eliminate repetition reduction

10.31234/osf.io/xydm4 ◽

2019 ◽

Author(s):

Cassandra Leigh Jacobs ◽

Torrey M. Loucks ◽

Duane Watson ◽

Gary S. Dell

Keyword(s):

Auditory Feedback ◽

Multiple Sources ◽

Audience Design ◽

Somatosensory Feedback ◽

Word Duration ◽

Production Mechanisms ◽

Internal Production

Repetition reduces word duration. Explanations of this process have appealed to audience design, internal production mechanisms, and combinations thereof (e.g. Kahn & Arnold, 2015). Jacobs, Yiu, Watson, and Dell (2015) proposed the auditory feedback hypothesis, which states that speakers must hear a word, produced either by themselves or another speaker, in order for duration reduction on a subsequent production. We conducted a strong test of the auditory feedback hypothesis in two experiments, in which we masked auditory feedback and whispering to prevent speakers from hearing themselves fully. Both experiments showed that despite limiting the sources of normal feedback, repetition reduction was observed to equal extents in masked and unmasked conditions, suggesting that repetition reduction may be supported by multiple sources, such as somatosensory feedback and feedforward signals, depending on their availability.

word duration
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Emotional Speech Processing in 3- to 12-Month-Old Infants: Influences of Emotion Categories and Acoustic Parameters

Brain changes underlying progression of speech motor programming impairment

The Interaction of Word Complexity and Word Duration in an Agglutinative Language

Automated Analysis of Digitized Letter Fluency Data

Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia

The acquisition of prosodic marking of narrow focus in Central Swedish

The Effects of Word Frequency and Word Probability on Speech Rhythm in Dysarthria

Practice and experience predict coarticulation in child speech

Five-year-olds produce prosodic cues to distinguish compounds from lists in Australian English

Masking auditory feedback does not eliminate repetition reduction

Export Citation Format

word durationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Emotional Speech Processing in 3- to 12-Month-Old Infants: Influences of Emotion Categories and Acoustic Parameters

Brain changes underlying progression of speech motor programming impairment

The Interaction of Word Complexity and Word Duration in an Agglutinative Language

Automated Analysis of Digitized Letter Fluency Data

Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia

The acquisition of prosodic marking of narrow focus in Central Swedish

The Effects of Word Frequency and Word Probability on Speech Rhythm in Dysarthria

Practice and experience predict coarticulation in child speech

Five-year-olds produce prosodic cues to distinguish compounds from lists in Australian English

Masking auditory feedback does not eliminate repetition reduction

word duration
Recently Published Documents