Do "Autistic" Traits Predict Audiovisual Integration in Speech Perception?

Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by deficits in social communication and interaction, and restricted interests and behavior patterns. These characteristics are considered as a continuous distribution in the general population. People with ASD show atypical temporal processing in multisensory integration. Regarding the flash–beep illusion, which refers to how a single flash can be illusorily perceived as multiple flashes when multiple auditory beeps are concurrently presented, some studies reported that people with ASD have a wider temporal binding window and greater integration than typically developed people; others found the opposite or inconsistent tendencies. Here, we investigated the relationships between the manner of the flash–beep illusion and the various dimensions of ASD traits by estimating the degree of typically developed participants’ ASD traits including five subscales using the Autism-Spectrum Quotient. We found that stronger ASD traits of communication and social skill were associated with a wider and narrower temporal binding window respectively. These results suggest that specific ASD traits are differently involved in the particular temporal binding processes of audiovisual integration.

Download Full-text

Left and Right Hemifield Advantages of Fusions and Combinations in Audiovisual Speech Perception

The Quarterly Journal of Experimental Psychology Section A ◽

10.1080/14640749508401393 ◽

1995 ◽

Vol 48 (2) ◽

pp. 320-333 ◽

Cited By ~ 15

Author(s):

Eugen Diesch

Keyword(s):

Speech Perception ◽

Audiovisual Integration ◽

Visual Responses ◽

Audiovisual Speech Perception ◽

Visual Component ◽

Place Of Articulation ◽

Auditory Component ◽

Visual Hemifield ◽

The Right ◽

Underlying Processes

If a place-of-articulation contrast is created between the auditory and the visual component syllables of videotaped speech, frequently the syllable that listeners report they have heard differs phonetically from the auditory component. These “McGurk effects”, as they have come to be called, show that speech perception may involve some kind of intermodal process. There are two classes of these phenomena: fusions and combinations. Perception of the syllable /da/ when auditory /ba/ and visual /ga/ are presented provides a clear example of the former, and perception of the string /bga/ after presentation of auditory /ga/ and visual /ba/ an unambiguous instance of the latter. Besides perceptual fusions and combinations, hearing visually presented component syllables also shows an influence of vision on audition. It is argued that these “visual” responses arise from basically the same underlying processes that yield fusions and combinations, respectively. In the present study, the visual component of audiovisually incongruous CV-syllables was presented in the left and the right visual hemifield, respectively. Audiovisual fusion responses showed a left hemifield advantage, and audiovisual combination responses a right hemifield advantage. This finding suggests that the process of audiovisual integration differs between audiovisual fusions and combinations and, furthermore, that the two cerebral hemispheres contribute differentially to the two classes of response.

Download Full-text

Lexical effects on speech perception in individuals with “autistic” traits

Cognition ◽

10.1016/j.cognition.2008.07.010 ◽

2008 ◽

Vol 109 (1) ◽

pp. 157-162 ◽

Cited By ~ 48

Author(s):

Mary E. Stewart ◽

Mitsuhiko Ota

Keyword(s):

Speech Perception ◽

Autistic Traits ◽

Lexical Effects

Download Full-text

Audiovisual integration in speech perception

Frontiers in Systems Neuroscience ◽

10.3389/conf.neuro.01.2009.04.090 ◽

2009 ◽

Vol 3 ◽

Author(s):

Csepe Valeria

Keyword(s):

Speech Perception ◽

Audiovisual Integration

Download Full-text

The early maximum likelihood estimation model of audiovisual integration in speech perception

The Journal of the Acoustical Society of America ◽

10.1121/1.4916691 ◽

2015 ◽

Vol 137 (5) ◽

pp. 2884-2891 ◽

Cited By ~ 6

Author(s):

Tobias S. Andersen

Keyword(s):

Speech Perception ◽

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Likelihood Estimation ◽

Audiovisual Integration ◽

Estimation Model

Download Full-text

Importance of temporal cues in audiovisual integration in speech perception in noise

The Journal of the Acoustical Society of America ◽

10.1121/1.5146815 ◽

2020 ◽

Vol 148 (4) ◽

pp. 2465-2466

Author(s):

Yi Yuan ◽

Yonghee Oh

Keyword(s):

Speech Perception ◽

Audiovisual Integration ◽

Speech Perception In Noise ◽

Temporal Cues

Download Full-text

Enhanced audiovisual integration with aging in speech perception: a heightened McGurk effect in older adults

Frontiers in Psychology ◽

10.3389/fpsyg.2014.00323 ◽

2014 ◽

Vol 5 ◽

Cited By ~ 29

Author(s):

Kaoru Sekiyama ◽

Takahiro Soshi ◽

Shinichi Sakamoto

Keyword(s):

Older Adults ◽

Speech Perception ◽

Audiovisual Integration ◽

Mcgurk Effect

Download Full-text

Predictive power in models of audiovisual integration of speech

Seeing and Perceiving ◽

10.1163/187847612x647379 ◽

2012 ◽

Vol 25 (0) ◽

pp. 105 ◽

Cited By ~ 1

Author(s):

Tobias Søren Andersen

Keyword(s):

Speech Perception ◽

Predictive Power ◽

Internal Representation ◽

Audiovisual Integration ◽

Natural Case ◽

The Face ◽

Fuzzy Logical Model ◽

Mcgurk Illusion ◽

The Voice ◽

Talking Face

Seeing the talking face can influence the phoneme perceived from the voice. This facilitates speech perception in the natural case where the face and voice are congruent and can cause the McGurk illusion when they are not. The classical example of the McGurk illusion is when acoustic /aba/ is perceived as /ada/ when dubbed onto a face articulating /aga/. In order to fully understand the underlying process of integrating information across the senses we need a computational account with predictive power. The Fuzzy Logical Model of Perception is one computational account of audiovisual integration in speech perception. Here we describe alternative accounts in which integration is based on an early continuous internal representation on which the phonetic classes fall. We show that these alternative accounts can provide just as good a fit when corrected for the number of free parameters. We also show, using cross-validation, that they have greater, but not great, predictive power. Finally, we show that introducing a regularization term can amend the lack of predictive power. With regularization, models based on continuous representations have the highest predictive power.

Download Full-text

Audiovisual integration in speech perception is independent from perceptual synchrony between audiovisual signals

The Proceedings of the Annual Convention of the Japanese Psychological Association ◽

10.4992/pacjpa.79.0_2pm-064 ◽

2015 ◽

Vol 79 (0) ◽

pp. 2PM-064-2PM-064

Author(s):

Norimichi Kitagawa ◽

Takemi Mochida ◽

Miho Kitamura

Keyword(s):

Speech Perception ◽

Audiovisual Integration

Download Full-text