scholarly journals Talker variability facilitates the statistical learning of speech sounds

2021 ◽  
Author(s):  
Stephen Charles Van Hedger ◽  
Mykayla Winspear ◽  
Laura Batterink

Natural speech contains many sources of acoustic variability both within and between talkers, which challenges speech recognition in some contexts but may facilitate language understanding in novel listening situations. Despite this ubiquitous variability, most previous studies that have examined the ability to extract sound patterns in speech—known as statistical learning—have used highly controlled, artificial, monotonic streams of syllables. Thus, it is unknown whether variability in speech may help or hinder statistical learning – an important question to resolve if statistical learning does indeed play a role in the segmentation of naturally spoken language, as widely theorized. Here, we assessed whether the use of naturally produced, variable speech sounds produced by multiple talkers benefits or impairs statistical learning, including the ability to generalize patterns to a novel talker. During training, participants listened to approximately 12 minutes of continuous speech made up of repeating trisyllabic words, spoken either by a single talker (single-talker condition) or four talkers speaking for three minutes each (multiple-talker condition). Post-training, all participants completed three assessments of learning: (1) an explicit familiarity rating task, (2) an explicit forced-choice recognition task, and (3) an implicit syllable target detection task. Results indicated that participants in both training conditions showed evidence of statistical learning across all assessments, providing an important demonstration that statistical learning is robust to additional variability in the speech signal. Further, in both the forced-choice recognition task and target detection task, participants in the multiple-talker condition showed evidence of facilitated statistical learning, particularly when listening to a novel talker. In the familiarity rating task, performance was comparable between conditions; however, participants trained with multiple talkers were less likely to conflate word familiarity with talker voice familiarity. Overall, these results suggest that training with multiple talkers can improve aspects of statistical learning across multiple measures of learning.

2021 ◽  
Author(s):  
Ava Kiai ◽  
Lucia Melloni

Statistical learning (SL) allows individuals to rapidly detect regularities in the sensory environment. We replicated previous findings showing that adult participants become sensitive to the implicit structure in a continuous speech stream of repeating tri-syllabic pseudowords within minutes, as measured by standard tests in the SL literature: a target detection task and a 2AFC word recognition task. Consistent with previous findings, we found only a weak correlation between these two measures of learning, leading us to question whether there is overlap between the information captured by these two tasks. Representational similarity analysis on reaction times measured during the target detection task revealed that reaction time data reflect sensitivity to transitional probability, triplet position, word grouping, and duplet pairings of syllables. However, individual performance on the word recognition task was not predicted by similarity measures derived for any of these four features. We conclude that online detection tasks provide richer and multi-faceted information about the SL process, as compared with 2AFC recognition tasks, and may be preferable for gaining insight into the dynamic aspects of SL.


2021 ◽  
Author(s):  
Yaxin Liu ◽  
Stella F. Lourenco

Apparent motion is a robust perceptual phenomenon in which observers perceive a stimulus traversing the vacant visual space between two flashed stimuli. Although it is known that the “filling-in” of apparent motion favors the simplest and most economical path, the interpolative computations remain poorly understood. Here, we tested whether the perception of apparent motion is best characterized by Newtonian physics or kinematic geometry. Participants completed a target detection task while Pacmen- shaped objects were presented in succession to create the perception of apparent motion. We found that target detection was impaired when apparent motion, as predicted by kinematic geometry, not Newtonian physics, obstructed the target’s location. Our findings shed light on the computations employed by the visual system, suggesting specifically that the “filling-in” perception of apparent motion may be dominated by kinematic geometry, not Newtonian physics.


Author(s):  
Md Abdullah Al Fahim ◽  
Mohammad Maifi Hasan Khan ◽  
Theodore Jensen ◽  
Yusuf Albayram ◽  
Emil Coman ◽  
...  

2013 ◽  
Vol 134 (5) ◽  
pp. 4119-4119
Author(s):  
Laura N. Kloepper ◽  
James A. Simmons ◽  
Jason E. Gaudette ◽  
Ryan Himmelwright ◽  
Dan Robitzski

Sign in / Sign up

Export Citation Format

Share Document