Investigating Speechreading and Deafness

Background: The visual speech signal can provide sufficient information to support successful communication. However, individual differences in the ability to appreciate that information are large, and relatively little is known about their sources. Purpose: Here a body of research is reviewed regarding the development of a theoretical framework in which to study speechreading and individual differences in that ability. Based on the hypothesis that visual speech is processed via the same perceptual-cognitive machinery as auditory speech, a theoretical framework was developed by adapting a theoretical framework originally developed for auditory spoken word recognition. Conclusion: The evidence to date is consistent with the conclusion that visual spoken word recognition is achieved via a process similar to auditory word recognition provided differences in perceptual similarity are taken into account. Words perceptually similar to many other words and that occur infrequently in the input stream are at a distinct disadvantage within this process. The results to date are also consistent with the conclusion that deaf individuals, regardless of speechreading ability, recognize spoken words via a process similar to individuals with hearing.

Download Full-text

LDL-AURIS: Error-driven Learning in Modeling Spoken Word Recognition

10.31234/osf.io/v6cu4 ◽

2020 ◽

Author(s):

Elnaz Shafaei-Bajestan ◽

Masoumeh Moradipour-Tari ◽

Peter Uhrig ◽

R. H. Baayen

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

High Dimensional ◽

Excellent Performance ◽

Auditory Word ◽

Auditory Word Recognition ◽

Multivariate Multiple Regression ◽

Isolated Word ◽

Temporal Granularity

A computational model for auditory word recognition is presented that enhances the model of Arnold et al. (2017). Real-valued features are extracted from the speech signal instead of discrete features. One-hot encoding for words’ meanings is replaced by real-valued semantic vectors, adding a small amount of noise to safeguard discriminability. Instead of learning with Rescorla-Wagner updating, we use multivariate multiple regression, which captures discrimination learning at the limit of experience. These new design features substantially improve prediction accuracy for words extracted from spontaneous conversations. They also provide enhanced temporal granularity, enabling the modeling of cohort-like effects. Clustering with t-SNE shows that the acoustic form space captures phone-like similarities and differences. Thus, wide learning with high-dimensional vectors and no hidden layers, and no abstract mediating phone-like representations is not only possible but achieves excellent performance that approximates the lower bound of human accuracy on the challenging task of isolated word recognition.

Download Full-text

Individual differences in executive function affect spoken word recognition

PsycEXTRA Dataset ◽

10.1037/e520562012-285 ◽

2009 ◽

Author(s):

Julie Mercier ◽

Irina Pivneva ◽

Corinne Haigh ◽

Debra A. Titone

Keyword(s):

Executive Function ◽

Individual Differences ◽

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word

Download Full-text

Individual differences in spoken word recognition: Regional dialect variation.

The Journal of the Acoustical Society of America ◽

10.1121/1.3588997 ◽

2011 ◽

Vol 129 (4) ◽

pp. 2682-2682

Author(s):

Terrin N. Tamati ◽

Jaimie L. Gilbert ◽

David B. Pisoni

Keyword(s):

Individual Differences ◽

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Dialect Variation ◽

Regional Dialect

Download Full-text

ERPs Reflect Lexical Identification in Word Fragment Priming

Journal of Cognitive Neuroscience ◽

10.1162/089892904323057281 ◽

2004 ◽

Vol 16 (4) ◽

pp. 541-552 ◽

Cited By ~ 33

Author(s):

Claudia K. Friedrich ◽

Sonja A. Kotz ◽

Angela D. Friederici ◽

Thomas C. Gunter

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Mental Lexicon ◽

Spoken Word ◽

Word Fragment ◽

Target Pair ◽

Auditory Word ◽

Target Experiment ◽

Behavioral Evidence ◽

Prime Target

Behavioral evidence suggests that spoken word recognition involves the temporary activation of multiple entries in a listener's mental lexicon. This phenomenon can be demonstrated in cross-modal word fragment priming (CMWP). In CMWP, an auditory word fragment (prime) is immediately followed by a visual word or pseudoword (target). Experiment 1 investigated ERPs for targets presented in this paradigm. Half of the targets were congruent with the prime (e.g., in the prime-target pair: AM-AMBOSS [anvil]), half were not (e.g., AM-PENSUM [pensum]). Lexical entries of the congruent targets should receive activation from the prime. Thus, lexical identification of these targets should be facilitated. An ERP effect named P350, two frontal negative ERP deflections, and the N400 were sensitive to prime-target congruency. In Experiment 2, the relation of the formerly observed ERP effects to processes in a modality-independent mental lexicon was investigated by presenting primes visually. Only the P350 effect could be replicated across different fragment lengths. Therefore, the P350 is discussed as a correlate of lexical identification in a modality-independent mental lexicon.

Download Full-text

Assessing Spoken Word Recognition in Children Who Are Deaf or Hard of Hearing: A Translational Approach

Journal of the American Academy of Audiology ◽

10.3766/jaaa.23.6.8 ◽

2012 ◽

Vol 23 (06) ◽

pp. 464-475 ◽

Cited By ~ 14

Author(s):

Karen Iler Kirk ◽

Lindsay Prusick ◽

Brian French ◽

Chad Gotch ◽

Laurie S. Eisenberg ◽

...

Keyword(s):

Word Recognition ◽

Real World ◽

Hard Of Hearing ◽

Recognition Performance ◽

Spoken Word Recognition ◽

Current Model ◽

Spoken Word ◽

Assessment Tools ◽

Visual Speech ◽

Research Activities

Under natural conditions, listeners use both auditory and visual speech cues to extract meaning from speech signals containing many sources of variability. However, traditional clinical tests of spoken word recognition routinely employ isolated words or sentences produced by a single talker in an auditory-only presentation format. The more central cognitive processes used during multimodal integration, perceptual normalization, and lexical discrimination that may contribute to individual variation in spoken word recognition performance are not assessed in conventional tests of this kind. In this article, we review our past and current research activities aimed at developing a series of new assessment tools designed to evaluate spoken word recognition in children who are deaf or hard of hearing. These measures are theoretically motivated by a current model of spoken word recognition and also incorporate “real-world” stimulus variability in the form of multiple talkers and presentation formats. The goal of this research is to enhance our ability to estimate real-world listening skills and to predict benefit from sensory aid use in children with varying degrees of hearing loss.

Download Full-text

Vertical similarity in spoken word recognition: Multiple lexical activation, individual differences, and the role of sentence context

Perception & Psychophysics ◽

10.3758/bf03208356 ◽

1994 ◽

Vol 56 (6) ◽

pp. 624-636 ◽

Cited By ~ 40

Author(s):

Cynthia M. Connine ◽

Dawn G. Blasko ◽

Jian Wang

Keyword(s):

Individual Differences ◽

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Sentence Context ◽

Lexical Activation

Download Full-text

Suprasegmental Lexical Stress Cues in Visual Speech can Guide Spoken-Word Recognition

Quarterly Journal of Experimental Psychology ◽

10.1080/17470218.2013.834371 ◽

2014 ◽

Vol 67 (4) ◽

pp. 793-808 ◽

Cited By ~ 11

Author(s):

Alexandra Jesse ◽

James M. McQueen

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

Visual Speech ◽

Lexical Stress

Download Full-text

Neural systems underlying lexical competition in auditory word recognition and spoken word production: Evidence from aphasia and functional neuroimaging

Lexical Representation ◽

10.1515/9783110224931.123 ◽

2011 ◽

Cited By ~ 1

Author(s):

Sheila E. Blumstein

Keyword(s):

Word Recognition ◽

Functional Neuroimaging ◽

Spoken Word ◽

Word Production ◽

Neural Systems ◽

Lexical Competition ◽

Spoken Word Production ◽

Auditory Word ◽

Auditory Word Recognition

Download Full-text

Individual differences in online spoken word recognition: Implications for SLI

Cognitive Psychology ◽

10.1016/j.cogpsych.2009.06.003 ◽

2010 ◽

Vol 60 (1) ◽

pp. 1-39 ◽

Cited By ~ 102

Author(s):

Bob McMurray ◽

Vicki M. Samelson ◽

Sung Hee Lee ◽

J. Bruce Tomblin

Keyword(s):

Individual Differences ◽

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word

Download Full-text

A prerequisite to L1 homophone effects in L2 spoken-word recognition

Second language Research ◽

10.1177/0267658314534661 ◽

2014 ◽

Vol 31 (1) ◽

pp. 29-52 ◽

Cited By ~ 4

Author(s):

Satsuki Nakai ◽

Shane Lindsay ◽

Mitsuhiko Ota

Keyword(s):

Second Language ◽

Word Recognition ◽

Spoken Word Recognition ◽

First Language ◽

Spoken Word ◽

Lexical Activation ◽

Auditory Word ◽

Processing Abilities ◽

Abstract Level ◽

Vowel Phoneme

When both members of a phonemic contrast in L2 (second language) are perceptually mapped to a single phoneme in one’s L1 (first language), L2 words containing a member of that contrast can spuriously activate L2 words in spoken-word recognition. For example, upon hearing cattle, Dutch speakers of English are reported to experience activation of kettle, as L1 Dutch speakers perceptually map the vowel in the two English words to a single vowel phoneme in their L1. In an auditory word-learning experiment using Greek and Japanese speakers of English, we asked whether such cross-lexical activation in L2 spoken-word recognition necessarily involves inaccurate perception by the L2 listeners, or can also arise from interference from L1 phonology at an abstract level, independent of the listeners’ phonetic processing abilities. Results suggest that spurious activation of L2 words containing L2-specific contrasts in spoken-word recognition is contingent on the L2 listeners’ inadequate phonetic processing abilities.

Download Full-text