Lexical stress representation in spoken word recognition

According to a popular model of speech production, stress is underspecified in the lexicon, that is, it is specified only for words with stress patterns other than the default, termed the “default metrics” assumption. Alternatively, stress may be fully specified in the lexicon as part of every lexical representation. In the current study the two accounts are tested in the perceptual domain using behavioral and eye-tracking data in Greek. In a first experiment, cross-modal fragment priming was used in a lexical decision task. According to default metrics, priming should occur for targets with antepenultimate- or final-syllable stress but not for targets with the default penultimate-syllable stress. The same word pairs were used in two subsequent visual world experiments. Default metrics predicts an asymmetric pattern of results, namely that incoming spoken words with the default stress pattern should inhibit the activation of lexical representations with nondefault stress, whereas the converse should not be observed; that is, spoken words with nondefault stress should not inhibit representations of words with the default stress. None of the results provided support for the idea of default metrics, leading to alternative conceptualizations regarding the representation of stress.

Download Full-text

Unstressed Vowel Reduction Across Majorcan Catalan Dialects: Production and Spoken Word Recognition

Language and Speech ◽

10.1177/0023830917736019 ◽

2017 ◽

Vol 61 (3) ◽

pp. 430-465 ◽

Cited By ~ 4

Author(s):

Miquel Llompart ◽

Miquel Simonet

Keyword(s):

Word Recognition ◽

Lexical Processing ◽

Spoken Word Recognition ◽

Spoken Word ◽

Decision Task ◽

Vowel Duration ◽

Lexical Representations ◽

Production Study ◽

Speech Performance

This study investigates the production and auditory lexical processing of words involved in a patterned phonological alternation in two dialects of Catalan spoken on the island of Majorca, Spain. One of these dialects, that of Palma, merges /ɔ/ and /o/ as [o] in unstressed position, and it maintains /u/ as an independent category, [u]. In the dialect of Sóller, a small village, speakers merge unstressed /ɔ/, /o/, and /u/ to [u]. First, a production study asks whether the discrete, rule-based descriptions of the vowel alternations provided in the dialectological literature are able to account adequately for these processes: are mergers complete? Results show that mergers are complete with regards to the main acoustic cue to these vowel contrasts, that is, F1. However, minor differences are maintained for F2 and vowel duration. Second, a lexical decision task using cross-modal priming investigates the strength with which words produced in the phonetic form of the neighboring (versus one’s own) dialect activate the listeners’ lexical representations during spoken word recognition: are words within and across dialects accessed efficiently? The study finds that listeners from one of these dialects, Sóller, process their own and the neighboring forms equally efficiently, while listeners from the other one, Palma, process their own forms more efficiently than those of the neighboring dialect. This study has implications for our understanding of the role of lifelong linguistic experience on speech performance.

Download Full-text

Re-reconsidering the role of temporal order in spoken word recognition

Quarterly Journal of Experimental Psychology ◽

10.1177/1747021819849512 ◽

2019 ◽

Vol 72 (11) ◽

pp. 2574-2583 ◽

Cited By ~ 1

Author(s):

Julie Gregg ◽

Albrecht W Inhoff ◽

Cynthia M Connine

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Lexical Representation ◽

Coarse Grain ◽

Visual World ◽

Visual World Paradigm ◽

Lexical Activation ◽

Stimulus Materials

Spoken word recognition models incorporate the temporal unfolding of word information by assuming that positional match constrains lexical activation. Recent findings challenge the linearity constraint. In the visual world paradigm, Toscano, Anderson, and McMurray observed that listeners preferentially viewed a picture of a target word’s anadrome competitor (e.g., competitor bus for target sub) compared with phonologically unrelated distractors (e.g., well) or competitors sharing an overlapping vowel (e.g., sun). Toscano et al. concluded that spoken word recognition relies on coarse grain spectral similarity for mapping spoken input to a lexical representation. Our experiments aimed to replicate the anadrome effect and to test the coarse grain similarity account using competitors without vowel position overlap (e.g., competitor leaf for target flea). The results confirmed the original effect: anadrome competitor fixation curves diverged from unrelated distractors approximately 275 ms after the onset of the target word. In contrast, the no vowel position overlap competitor did not show an increase in fixations compared with the unrelated distractors. The contrasting results for the anadrome and no vowel position overlap items are discussed in terms of theoretical implications of sequential match versus coarse grain similarity accounts of spoken word recognition. We also discuss design issues (repetition of stimulus materials and display parameters) concerning the use of the visual world paradigm in making inferences about online spoken word recognition.

Download Full-text

Orthographic activation in spoken word recognition of L2 phonological variants

Second language Research ◽

10.1177/0267658320968253 ◽

2020 ◽

pp. 026765832096825

Author(s):

Jeong-Im Han ◽

Song Yi Kim

Keyword(s):

Second Language ◽

Word Recognition ◽

Spoken Word Recognition ◽

First Language ◽

Spoken Word ◽

Advanced Level ◽

Spoken Words ◽

L2 Learners ◽

L1 And L2 ◽

Prime Target

The present study investigated the influence of orthographic input on the recognition of second language (L2) spoken words with phonological variants, when first language (L1) and L2 have different orthographic structures. Lexical encoding for intermediate-to-advanced level Mandarin learners of Korean was assessed using masked cross-modal and within-modal priming tasks. Given that Korean has obstruent nasalization in the syllable coda, prime target pairs were created with and without such phonological variants, but spellings that were provided in the cross-modal task reflected their unaltered, nonnasalized forms. The results indicate that when L2 learners are exposed to transparent alphabetic orthography, they do not show a particular cost for spoken word recognition of L2 phonological variants as long as the variation is regular and rule-governed.

Download Full-text

Exploring the Role of Lexical stress in Lexical Recognition

The Quarterly Journal of Experimental Psychology Section A ◽

10.1080/02724980343000927 ◽

2005 ◽

Vol 58 (2) ◽

pp. 251-273 ◽

Cited By ~ 69

Author(s):

Wilma van Donselaar ◽

Mariëtte Koster ◽

Anne Cutler

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Lexical Stress ◽

Auditory Presentation ◽

Phonological Representations ◽

Conceptual Representations ◽

No Inhibition ◽

Spoken Words

Three cross-modal priming experiments examined the role of suprasegmental information in the processing of spoken words. All primes consisted of truncated spoken Dutch words. Recognition of visually presented word targets was facilitated by prior auditory presentation of the first two syllables of the same words as primes, but only if they were appropriately stressed (e.g., OKTOBER preceded by okTO-); inappropriate stress, compatible with another word (e.g., OKTOBER preceded by OCto-, the beginning of octopus), produced inhibition. Monosyllabic fragments (e.g., OC-) also produced facilitation when appropriately stressed; if inappropriately stressed, they produced neither facilitation nor inhibition. The bisyllabic fragments that were compatible with only one word produced facilitation to semantically associated words, but inappropriate stress caused no inhibition of associates. The results are explained within a model of spoken-word recognition involving competition between simultaneously activated phonological representations followed by activation of separate conceptual representations for strongly supported lexical candidates; at the level of the phonological representations, activation is modulated by both segmental and suprasegmental information.

Download Full-text

Spoken word recognition in a second language: The importance of phonetic details

Second language Research ◽

10.1177/02676583211030604 ◽

2021 ◽

pp. 026765832110306

Author(s):

Félix Desmeules-Trudel ◽

Tania S. Zamuner

Keyword(s):

Second Language ◽

Word Recognition ◽

Native Speakers ◽

Spoken Word Recognition ◽

Spoken Word ◽

Lexical Representations ◽

Canadian French ◽

Fine Grained ◽

Minimal Word ◽

Vowel Nasalization

Spoken word recognition depends on variations in fine-grained phonetics as listeners decode speech. However, many models of second language (L2) speech perception focus on units such as isolated syllables, and not on words. In two eye-tracking experiments, we investigated how fine-grained phonetic details (i.e. duration of nasalization on contrastive and coarticulatory nasalized vowels in Canadian French) influenced spoken word recognition in an L2, as compared to a group of native (L1) listeners. Results from L2 listeners (English-native speakers) indicated that fine-grained phonetics impacted the recognition of words, i.e. they were able to use nasalization duration variability in a way similar to L1-French listeners, providing evidence that lexical representations can be highly specified in an L2. Specifically, L2 listeners were able to distinguish minimal word pairs (differentiated by the presence of phonological vowel nasalization in French) and were able to use variability in a way approximating L1-French listeners. Furthermore, the robustness of the French “nasal vowel” category in L2 listeners depended on age of exposure. Early bilinguals displayed greater sensitivity to some ambiguity in the stimuli than late bilinguals, suggesting that early bilinguals had greater sensitivity to small variations in the signal and thus better knowledge of the phonetic cue associated with phonological vowel nasalization in French, similarly to L1 listeners.

Download Full-text

Suprasegmental information cues morphological anticipation during L1/L2 lexical access

Journal of Second Language Studies ◽

10.1075/jsls.17026.sag ◽

2018 ◽

Vol 1 (1) ◽

pp. 31-59 ◽

Cited By ~ 1

Author(s):

Nuria Sagarra ◽

Joseph V. Casillas

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Advanced Learners ◽

Anticipatory Eye Movements ◽

Prosodic Cues ◽

Information Cues ◽

Beginning Learners ◽

Offline Processing ◽

Syllable Stress

Abstract We use visual-world eye-tracking and gating methods to investigate whether Spanish monolinguals and English late learners of Spanish use prosodic cues (lexical stress) to anticipate morphological information (suffixes) during spoken word recognition, and if they do, whether L2 proficiency and working memory (WM) mediate their anticipatory abilities. Our findings show that the monolinguals used prosodic information to predict word endings in both tasks, regardless of first-syllable stress (stressed, unstressed) and structure (CV, CVC). In contrast, the beginning learners did not use prosodic information to anticipate word suffixes in any task or condition. Importantly, the advanced learners mirrored the monolinguals, except in words with first-syllable CV structure, but were slower than the monolinguals. Finally, WM was not associated with anticipatory eye movements, though results were inconclusive for offline processing. Taken together, the present study shows that suprasegmental information facilitates morphological anticipation during spoken word recognition, and that adult learners can gain anticipatory processing patterns qualitatively, but not quantitatively, similar to monolinguals.

Download Full-text

Word recognition from acoustic onsets and acoustic offsets: Effects of cohort size and syllabic stress

Applied Psycholinguistics ◽

10.1017/s0142716400009887 ◽

1997 ◽

Vol 18 (1) ◽

pp. 85-100 ◽

Cited By ~ 20

Author(s):

Arthur Wingfield ◽

Harold Goodglass ◽

Kimberly C. Lindfield

Keyword(s):

Word Recognition ◽

Goodness Of Fit ◽

Spoken Word Recognition ◽

Spoken Word ◽

Stress Pattern ◽

Cohort Size ◽

Initial Information ◽

Spoken Words ◽

Word Onset ◽

Target Words

ABSTRACTIn the traditional gating technique, subjects hear increasing amounts of word-onset information from spoken words until the words can be correctly identified. The experiment reported here contrasted word-onset gating with results when words were gated from their word endings. A significant recognition advantage for words gated from their onsets was demonstrated. This effect was eliminated, however, when we took into account the number of word possibilities that shared overlapping phonology and the same stress pattern as the target words at their recognition points. These results support the position that the perceptual advantage of word-initial information can be understood within a general goodness-of-fit model of spoken word recognition.

Download Full-text

Hearing taboo words can result in early talker effects in word recognition for female listeners

Quarterly Journal of Experimental Psychology ◽

10.1080/17470218.2016.1253757 ◽

2018 ◽

Vol 71 (2) ◽

pp. 435-448 ◽

Cited By ~ 2

Author(s):

Samantha E. Tuft ◽

Conor T. MᶜLennan ◽

Maura L. Krestar

Keyword(s):

Word Recognition ◽

Time Course ◽

Spoken Word Recognition ◽

Word Meaning ◽

Reaction Times ◽

Theoretical Models ◽

Spoken Word ◽

Decision Task ◽

Fast Processing ◽

Taboo Words

Previous spoken word recognition research using the long-term repetition-priming paradigm found performance costs for stimuli mismatching in talker identity. That is, when words were repeated across the two blocks, and the identity of the talker changed reaction times (RTs) were slower than when the repeated words were spoken by the same talker. Such performance costs, or talker effects, followed a time course, occurring only when processing was relatively slow. More recent research suggests that increased explicit and implicit attention towards the talkers can result in talker effects even during relatively fast processing. The purpose of the current study was to examine whether word meaning would influence the pattern of talker effects in an easy lexical decision task and, if so, whether results would differ depending on whether the presentation of neutral and taboo words was mixed or blocked. Regardless of presentation, participants responded to taboo words faster than neutral words. Furthermore, talker effects for the female talker emerged when participants heard both taboo and neutral words (consistent with an attention-based hypothesis), but not for participants that heard only taboo or only neutral words (consistent with the time-course hypothesis). These findings have important implications for theoretical models of spoken word recognition.

Download Full-text