scholarly journals Lexical stress representation in spoken word recognition

2021 ◽  
Author(s):  
Angeliki Andrikopoulou ◽  
Athanassios Protopapas ◽  
Amalia Arvaniti

According to a popular model of speech production, stress is underspecified in the lexicon, that is, it is specified only for words with stress patterns other than the default, termed the “default metrics” assumption. Alternatively, stress may be fully specified in the lexicon as part of every lexical representation. In the current study the two accounts are tested in the perceptual domain using behavioral and eye-tracking data in Greek. In a first experiment, cross-modal fragment priming was used in a lexical decision task. According to default metrics, priming should occur for targets with antepenultimate- or final-syllable stress but not for targets with the default penultimate-syllable stress. The same word pairs were used in two subsequent visual world experiments. Default metrics predicts an asymmetric pattern of results, namely that incoming spoken words with the default stress pattern should inhibit the activation of lexical representations with nondefault stress, whereas the converse should not be observed; that is, spoken words with nondefault stress should not inhibit representations of words with the default stress. None of the results provided support for the idea of default metrics, leading to alternative conceptualizations regarding the representation of stress.

2017 ◽  
Vol 61 (3) ◽  
pp. 430-465 ◽  
Author(s):  
Miquel Llompart ◽  
Miquel Simonet

This study investigates the production and auditory lexical processing of words involved in a patterned phonological alternation in two dialects of Catalan spoken on the island of Majorca, Spain. One of these dialects, that of Palma, merges /ɔ/ and /o/ as [o] in unstressed position, and it maintains /u/ as an independent category, [u]. In the dialect of Sóller, a small village, speakers merge unstressed /ɔ/, /o/, and /u/ to [u]. First, a production study asks whether the discrete, rule-based descriptions of the vowel alternations provided in the dialectological literature are able to account adequately for these processes: are mergers complete? Results show that mergers are complete with regards to the main acoustic cue to these vowel contrasts, that is, F1. However, minor differences are maintained for F2 and vowel duration. Second, a lexical decision task using cross-modal priming investigates the strength with which words produced in the phonetic form of the neighboring (versus one’s own) dialect activate the listeners’ lexical representations during spoken word recognition: are words within and across dialects accessed efficiently? The study finds that listeners from one of these dialects, Sóller, process their own and the neighboring forms equally efficiently, while listeners from the other one, Palma, process their own forms more efficiently than those of the neighboring dialect. This study has implications for our understanding of the role of lifelong linguistic experience on speech performance.


2019 ◽  
Vol 72 (11) ◽  
pp. 2574-2583 ◽  
Author(s):  
Julie Gregg ◽  
Albrecht W Inhoff ◽  
Cynthia M Connine

Spoken word recognition models incorporate the temporal unfolding of word information by assuming that positional match constrains lexical activation. Recent findings challenge the linearity constraint. In the visual world paradigm, Toscano, Anderson, and McMurray observed that listeners preferentially viewed a picture of a target word’s anadrome competitor (e.g., competitor bus for target sub) compared with phonologically unrelated distractors (e.g., well) or competitors sharing an overlapping vowel (e.g., sun). Toscano et al. concluded that spoken word recognition relies on coarse grain spectral similarity for mapping spoken input to a lexical representation. Our experiments aimed to replicate the anadrome effect and to test the coarse grain similarity account using competitors without vowel position overlap (e.g., competitor leaf for target flea). The results confirmed the original effect: anadrome competitor fixation curves diverged from unrelated distractors approximately 275 ms after the onset of the target word. In contrast, the no vowel position overlap competitor did not show an increase in fixations compared with the unrelated distractors. The contrasting results for the anadrome and no vowel position overlap items are discussed in terms of theoretical implications of sequential match versus coarse grain similarity accounts of spoken word recognition. We also discuss design issues (repetition of stimulus materials and display parameters) concerning the use of the visual world paradigm in making inferences about online spoken word recognition.


2020 ◽  
pp. 026765832096825
Author(s):  
Jeong-Im Han ◽  
Song Yi Kim

The present study investigated the influence of orthographic input on the recognition of second language (L2) spoken words with phonological variants, when first language (L1) and L2 have different orthographic structures. Lexical encoding for intermediate-to-advanced level Mandarin learners of Korean was assessed using masked cross-modal and within-modal priming tasks. Given that Korean has obstruent nasalization in the syllable coda, prime target pairs were created with and without such phonological variants, but spellings that were provided in the cross-modal task reflected their unaltered, nonnasalized forms. The results indicate that when L2 learners are exposed to transparent alphabetic orthography, they do not show a particular cost for spoken word recognition of L2 phonological variants as long as the variation is regular and rule-governed.


2005 ◽  
Vol 58 (2) ◽  
pp. 251-273 ◽  
Author(s):  
Wilma van Donselaar ◽  
Mariëtte Koster ◽  
Anne Cutler

Three cross-modal priming experiments examined the role of suprasegmental information in the processing of spoken words. All primes consisted of truncated spoken Dutch words. Recognition of visually presented word targets was facilitated by prior auditory presentation of the first two syllables of the same words as primes, but only if they were appropriately stressed (e.g., OKTOBER preceded by okTO-); inappropriate stress, compatible with another word (e.g., OKTOBER preceded by OCto-, the beginning of octopus), produced inhibition. Monosyllabic fragments (e.g., OC-) also produced facilitation when appropriately stressed; if inappropriately stressed, they produced neither facilitation nor inhibition. The bisyllabic fragments that were compatible with only one word produced facilitation to semantically associated words, but inappropriate stress caused no inhibition of associates. The results are explained within a model of spoken-word recognition involving competition between simultaneously activated phonological representations followed by activation of separate conceptual representations for strongly supported lexical candidates; at the level of the phonological representations, activation is modulated by both segmental and suprasegmental information.


2021 ◽  
pp. 026765832110306
Author(s):  
Félix Desmeules-Trudel ◽  
Tania S. Zamuner

Spoken word recognition depends on variations in fine-grained phonetics as listeners decode speech. However, many models of second language (L2) speech perception focus on units such as isolated syllables, and not on words. In two eye-tracking experiments, we investigated how fine-grained phonetic details (i.e. duration of nasalization on contrastive and coarticulatory nasalized vowels in Canadian French) influenced spoken word recognition in an L2, as compared to a group of native (L1) listeners. Results from L2 listeners (English-native speakers) indicated that fine-grained phonetics impacted the recognition of words, i.e. they were able to use nasalization duration variability in a way similar to L1-French listeners, providing evidence that lexical representations can be highly specified in an L2. Specifically, L2 listeners were able to distinguish minimal word pairs (differentiated by the presence of phonological vowel nasalization in French) and were able to use variability in a way approximating L1-French listeners. Furthermore, the robustness of the French “nasal vowel” category in L2 listeners depended on age of exposure. Early bilinguals displayed greater sensitivity to some ambiguity in the stimuli than late bilinguals, suggesting that early bilinguals had greater sensitivity to small variations in the signal and thus better knowledge of the phonetic cue associated with phonological vowel nasalization in French, similarly to L1 listeners.


2018 ◽  
Vol 1 (1) ◽  
pp. 31-59 ◽  
Author(s):  
Nuria Sagarra ◽  
Joseph V. Casillas

Abstract We use visual-world eye-tracking and gating methods to investigate whether Spanish monolinguals and English late learners of Spanish use prosodic cues (lexical stress) to anticipate morphological information (suffixes) during spoken word recognition, and if they do, whether L2 proficiency and working memory (WM) mediate their anticipatory abilities. Our findings show that the monolinguals used prosodic information to predict word endings in both tasks, regardless of first-syllable stress (stressed, unstressed) and structure (CV, CVC). In contrast, the beginning learners did not use prosodic information to anticipate word suffixes in any task or condition. Importantly, the advanced learners mirrored the monolinguals, except in words with first-syllable CV structure, but were slower than the monolinguals. Finally, WM was not associated with anticipatory eye movements, though results were inconclusive for offline processing. Taken together, the present study shows that suprasegmental information facilitates morphological anticipation during spoken word recognition, and that adult learners can gain anticipatory processing patterns qualitatively, but not quantitatively, similar to monolinguals.


1997 ◽  
Vol 18 (1) ◽  
pp. 85-100 ◽  
Author(s):  
Arthur Wingfield ◽  
Harold Goodglass ◽  
Kimberly C. Lindfield

ABSTRACTIn the traditional gating technique, subjects hear increasing amounts of word-onset information from spoken words until the words can be correctly identified. The experiment reported here contrasted word-onset gating with results when words were gated from their word endings. A significant recognition advantage for words gated from their onsets was demonstrated. This effect was eliminated, however, when we took into account the number of word possibilities that shared overlapping phonology and the same stress pattern as the target words at their recognition points. These results support the position that the perceptual advantage of word-initial information can be understood within a general goodness-of-fit model of spoken word recognition.


2018 ◽  
Vol 71 (2) ◽  
pp. 435-448 ◽  
Author(s):  
Samantha E. Tuft ◽  
Conor T. MᶜLennan ◽  
Maura L. Krestar

Previous spoken word recognition research using the long-term repetition-priming paradigm found performance costs for stimuli mismatching in talker identity. That is, when words were repeated across the two blocks, and the identity of the talker changed reaction times (RTs) were slower than when the repeated words were spoken by the same talker. Such performance costs, or talker effects, followed a time course, occurring only when processing was relatively slow. More recent research suggests that increased explicit and implicit attention towards the talkers can result in talker effects even during relatively fast processing. The purpose of the current study was to examine whether word meaning would influence the pattern of talker effects in an easy lexical decision task and, if so, whether results would differ depending on whether the presentation of neutral and taboo words was mixed or blocked. Regardless of presentation, participants responded to taboo words faster than neutral words. Furthermore, talker effects for the female talker emerged when participants heard both taboo and neutral words (consistent with an attention-based hypothesis), but not for participants that heard only taboo or only neutral words (consistent with the time-course hypothesis). These findings have important implications for theoretical models of spoken word recognition.


Sign in / Sign up

Export Citation Format

Share Document