COSMO-Onset: A Neurally-Inspired Computational Model of Spoken Word Recognition, Combining Top-Down Prediction and Bottom-Up Detection of Syllabic Onsets

Frontiers in Systems Neuroscience ◽

10.3389/fnsys.2021.653975 ◽

2021 ◽

Vol 15 ◽

Author(s):

Mamady Nabé ◽

Jean-Luc Schwartz ◽

Julien Diard

Keyword(s):

Word Recognition ◽

Computational Models ◽

Spoken Word Recognition ◽

Gamma Oscillations ◽

Spoken Word ◽

Theta Oscillations ◽

Onset Detection ◽

Top Down ◽

Bottom Up ◽

Oscillatory Processes

Recent neurocognitive models commonly consider speech perception as a hierarchy of processes, each corresponding to specific temporal scales of collective oscillatory processes in the cortex: 30–80 Hz gamma oscillations in charge of phonetic analysis, 4–9 Hz theta oscillations in charge of syllabic segmentation, 1–2 Hz delta oscillations processing prosodic/syntactic units and the 15–20 Hz beta channel possibly involved in top-down predictions. Several recent neuro-computational models thus feature theta oscillations, driven by the speech acoustic envelope, to achieve syllabic parsing before lexical access. However, it is unlikely that such syllabic parsing, performed in a purely bottom-up manner from envelope variations, would be totally efficient in all situations, especially in adverse sensory conditions. We present a new probabilistic model of spoken word recognition, called COSMO-Onset, in which syllabic parsing relies on fusion between top-down, lexical prediction of onset events and bottom-up onset detection from the acoustic envelope. We report preliminary simulations, analyzing how the model performs syllabic parsing and phone, syllable and word recognition. We show that, while purely bottom-up onset detection is sufficient for word recognition in nominal conditions, top-down prediction of syllabic onset events allows overcoming challenging adverse conditions, such as when the acoustic envelope is degraded, leading either to spurious or missing onset events in the sensory signal. This provides a proposal for a possible computational functional role of top-down, predictive processes during speech recognition, consistent with recent models of neuronal oscillatory processes.

Download Full-text

What phonetic decision making does not tell us about lexical architecture

Behavioral and Brain Sciences ◽

10.1017/s0140525x00353249 ◽

2000 ◽

Vol 23 (3) ◽

pp. 337-338 ◽

Cited By ~ 1

Author(s):

William D. Marslen-Wilson

Keyword(s):

Decision Making ◽

Word Recognition ◽

Lexical Access ◽

Spoken Word Recognition ◽

Spoken Word ◽

Top Down ◽

Connectionist Models ◽

Bottom Up

Norris et al. argue against using evidence from phonetic decision making to support top-down feedback in lexical access on the grounds that phonetic decision relies on processes outside the normal access sequence. This leaves open the possibility that bottom-up connectionist models, with some contextual constraints built into the access process, are still preferred models of spoken-word recognition.

Download Full-text

The data are what the data are: Top‐down processing in spoken word recognition is necessary

The Journal of the Acoustical Society of America ◽

10.1121/1.427855 ◽

1999 ◽

Vol 106 (4) ◽

pp. 2295-2295

Author(s):

Arthur G. Samuel

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Top Down

Download Full-text

Bottom-up inhibition in lexical selection: Phonological mismatch effects in spoken word recognition

Language and Cognitive Processes ◽

10.1080/01690960143000146 ◽

2001 ◽

Vol 16 (5-6) ◽

pp. 583-607 ◽

Cited By ~ 33

Author(s):

Uli H. Frauenfelder ◽

Mark Scholten ◽

Alain Content

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Lexical Selection ◽

Bottom Up ◽

Phonological Mismatch

Download Full-text

Top-down processing in spoken word recognition: Evidence from bimodal bilinguals

PsycEXTRA Dataset ◽

10.1037/e520592012-302 ◽

2010 ◽

Author(s):

Viorica Marian ◽

Anthony J. Shook

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Top Down

Download Full-text

Computational modelling of spoken-word recognition processes

Pragmatics & Cognition ◽

10.1075/pc.18.1.06sch ◽

2010 ◽

Vol 18 (1) ◽

pp. 136-164 ◽

Cited By ~ 8

Author(s):

Odette Scharenborg ◽

Lou Boves

Keyword(s):

Word Recognition ◽

Word Processing ◽

Goodness Of Fit ◽

Computational Models ◽

Spoken Word Recognition ◽

Computational Modelling ◽

Spoken Word ◽

Recognition Process ◽

Depth Analysis ◽

Cognitive Plausibility

Computational modelling has proven to be a valuable approach in developing theories of spoken-word processing. In this paper, we focus on a particular class of theories in which it is assumed that the spoken-word recognition process consists of two consecutive stages, with an ‘abstract’ discrete symbolic representation at the interface between the stages. In evaluating computational models, it is important to bring in independent arguments for the cognitive plausibility of the algorithms that are selected to compute the processes in a theory. This paper discusses the relation between behavioural studies, theories, and computational models of spoken-word recognition. We explain how computational models can be assessed in terms of the goodness of fit with the behavioural data and the cognitive plausibility of the algorithms. An in-depth analysis of several models provides insights into how computational modelling has led to improved theories and to a better understanding of the human spoken-word recognition process.

Download Full-text

Spoken word recognition based on top-down phoneme segmentation

10.1109/icassp.1985.1168458 ◽

2005 ◽

Cited By ~ 9

Author(s):

K. Aikawa ◽

M. Sugiyama ◽

K. Shikano

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Phoneme Segmentation ◽

Top Down

Download Full-text

Does Memory Constrain Utilization of Top-Down Information in Spoken word Recognition? Evidence from Normal Aging

Language and Speech ◽

10.1177/002383099403700301 ◽

1994 ◽

Vol 37 (3) ◽

pp. 221-235 ◽

Cited By ~ 48

Author(s):

Arthur Wingfield ◽

Andrea H. Alexander ◽

Sonia Cavigelli

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Normal Aging ◽

Spoken Word ◽

Top Down

Download Full-text

Computational Models of Spoken Word Recognition

The Cambridge Handbook of Psycholinguistics ◽

10.1017/cbo9781139029377.006 ◽

2018 ◽

pp. 76-103 ◽

Cited By ~ 2

Author(s):

James S. Magnuson ◽

Daniel Mirman ◽

Harlan D. Harris

Keyword(s):

Word Recognition ◽

Computational Models ◽

Spoken Word Recognition ◽

Spoken Word

Download Full-text

Supplemental Material for Effects of Age and Bilingualism on Sensitivity to Native and Nonnative Tone Variation: Evidence From Spoken Word Recognition in Mandarin Chinese Learners

Developmental Psychology ◽

10.1037/dev0001041.supp ◽

2020 ◽

Keyword(s):

Word Recognition ◽

Mandarin Chinese ◽

Spoken Word Recognition ◽

Spoken Word ◽

Chinese Learners

Download Full-text

The Next Word in Spoken Word Recognition: Disentangling Refractory and Priming Effects

PsycEXTRA Dataset ◽

10.1037/e502412013-889 ◽

2012 ◽

Author(s):

Daniel Mirman ◽

Allison E. Britt ◽

Qi Chen

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Priming Effects

Download Full-text