Continuous dynamics in behavior reveal perceptual nonlinearities aid speech categorization in noise

2021 ◽  
Author(s):  
Gavin Bidelman ◽  
Jared Carter

Spoken language comprehension requires listeners map continuous features of the speech signal to discrete category labels. Categories are however malleable to surrounding context; listeners’ percept can dynamically shift depending on the sequencing of adjacent stimuli resulting in a warping of the heard phonetic category (i.e., hysteresis). Here, we investigated whether such perceptual nonlinearities—which amplify categorical hearing—might further aid speech processing in noise-degraded listening scenarios. We measured continuous dynamics in perception and category judgments of an acoustic-phonetic vowel gradient via mouse tracking. Tokens were presented in serial vs. random orders to induce more/less perceptual warping while listeners categorized continua in clean and noise conditions. Listeners’ responses were faster and their mouse trajectories closer to the ultimate behavioral selection (marked visually on the screen) in serial vs. random order, suggesting increased perceptual attraction to category exemplars. Interestingly, order effects emerged earlier and persisted later in the trial time course when categorizing speech in noise. These data describe a new functional benefit of perceptual nonlinearities to speech perception yet undocumented: warping strengthens the behavioral attraction to relevant speech categories while simultaneously assisting perception in degraded acoustic environments.

1991 ◽  
Vol 70 (2) ◽  
pp. 504-510 ◽  
Author(s):  
D. W. Ray ◽  
A. Garland ◽  
C. Hernandez ◽  
S. Eappen ◽  
L. Alger ◽  
...  

We examined the effects of hyperpnea duration and abrupt changes in inspired gas heat and water content on the magnitude and time course of hyperpnea-induced bronchoconstriction (HIB) in anesthetized mechanically ventilated male Hartley guinea pigs. In 12 animals subjected to 5, 10, and 15 min (random order) of dry gas isocapnic hyperpnea [tidal volume (VT) 4-6 ml, 150 breaths/min) followed by quiet breathing of humidified air (VT 2-3 ml, 60 breaths/min), severe bronchoconstriction developed only after the cessation of hyperpnea; the magnitude of respiratory system resistance (Rrs) increased with the duration of dry gas hyperpnea [peak Rrs 1.0 +/- 0.2, 1.8 +/- 0.3, and 2.3 +/- 0.3 (SE) cmH2O.ml-1.s, respectively]. Seven other guinea pigs received, in random order, 10 min of warm humidified gas hyperpnea, 10 min of room temperature dry gas hyperpnea, and 5 min of dry gas hyperpnea immediately followed by 5 min of warm humidified gas hyperpnea. After each hyperpnea period, the animal was returned to quiet breathing of humidified gas. Rrs rose appreciably after the 10 min of dry and 5 min of dry-5 min of humidified hyperpnea challenges (peak Rrs 1.3 +/- 0.2 and 0.7 +/- 0.2 cmH2O.ml-1.s, respectively) but not after 10 min of humidified hyperpnea (0.2 +/- 0.04 cmH2O.ml-1.s). An additional five animals received 10 min of room temperature dry gas hyperpnea followed by quiet breathing of warm humidified air and 10 min of room temperature dry gas hyperpnea followed by 30 min of warm humidified gas hyperpnea in random order.(ABSTRACT TRUNCATED AT 250 WORDS)


2013 ◽  
Vol 25 (3) ◽  
pp. 465-483 ◽  
Author(s):  
Elena Rusconi ◽  
Martynas Dervinis ◽  
Frederick Verbruggen ◽  
Christopher D. Chambers

Neuropsychological, neurophysiological, and neuroimaging studies suggest that right frontoparietal circuits may be necessary for the processing of mental number space, also known as the mental number line (MNL). Here we sought to specify the critical time course of three nodes that have previously been related to MNL processing: right posterior parietal cortex (rPPC), right FEF (rFEF), and right inferior frontal gyrus (rIFG). The effects of single-pulse TMS delivered at 120% distance-adjusted individual motor threshold were investigated in 21 participants, within a window of 0–400 msec (sampling interval = 33 msec) from the onset of a central digit (1–9, 5 excluded). Pulses were delivered in a random order and with equal probability at each time point, intermixed with noTMS trials. To analyze whether and when TMS interfered with MNL processing, we fitted bimodal Gaussian functions to the observed data and measured effects on changes in the Spatial–Numerical Association of Response Codes (SNARC) effect (i.e., an advantage for left- over right-key responses to small numbers and right- over left-key responses to large numbers) and in overall performance efficiency. We found that, during magnitude judgment with unimanual key-press responses, TMS reduced the SNARC effect in the earlier period of the fitted functions (∼25–60 msec) when delivered over rFEF (small and large numbers) and rIFG (small numbers); TMS further reduced the SNARC effect for small numbers in a later period when delivered to rFEF (∼200 msec). In contrast, TMS of rPPC did not interfere with the SNARC effect but generally reduced performance for small numbers and enhanced it for large numbers, thus producing a pattern reminiscent of “neglect” in mental number space. Our results confirm the causal role of an intact right frontoparietal network in the processing of mental number space. They also indicate that rPPC is specifically tied to explicit number magnitude processing and that rFEF and rIFG contribute to interfacing mental visuospatial codes with lateralized response codes. Overall, our findings suggest that both ventral and dorsal frontoparietal circuits are causally involved and functionally connected in the mapping of numbers to space.


PLoS ONE ◽  
2021 ◽  
Vol 16 (9) ◽  
pp. e0257367
Author(s):  
Kosuke Takeuchi ◽  
Kazunori Akizuki ◽  
Masatoshi Nakamura

Objectives The purpose of this study was to examine the time course of changes in the range of motion and muscle-tendon unit stiffness of the hamstrings after two different intensities of static stretching. Methods Fourteen healthy men (20.9 ± 0.7 years, 169.1 ± 7.5cm, 61.6 ± 6.5kg) received static stretching for 60 seconds at two different intensities based on the point of discomfort (100%POD and 120%POD) of each participant, in random order. To evaluate the time course of changes in the flexibility of the hamstrings, the knee extension range of motion (ROM), passive torque at end ROM, and muscle-tendon unit stiffness were measured pre-stretching, post-stretching, and at both 10 and 20 minutes after static stretching. Results For both intensities, ROM and passive torque at pre-stretching were significantly smaller than those at post-stretching (p < 0.01 in both intensities), 10 minutes (p < 0.01 in both intensities), and 20 minutes (p < 0.01 in both intensities). The muscle-tendon unit stiffness at pre-stretching was significantly higher than that at post-stretching (p < 0.01), 10 minutes (p < 0.01), and 20 minutes (p < 0.01) only in the 120%POD, but it showed no change in the 100%POD. Conclusion The results showed that ROM and passive torque increased in both intensities, and the effects continued for at least 20 minutes after stretching regardless of stretching intensity. However, the muscle-tendon unit stiffness of the hamstrings decreased only after static stretching at the intensity of 120%POD, and the effects continued for at least 20 minutes after stretching.


Sensors ◽  
2021 ◽  
Vol 21 (18) ◽  
pp. 6320
Author(s):  
Piotr Odya ◽  
Jozef Kotus ◽  
Adam Kurowski ◽  
Bozena Kostek

The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking slowly. It is possible to use algorithms that reduce the rate of speech (RoS) in real time. Therefore, the study aims to find recommended values of RoS in the context of STI (speech transmission index) in different acoustic environments. In the experiments, speech intelligibility for six impulse responses recorded in spaces with different STIs is investigated using a sentence test (for the Polish language). Fifteen subjects with normal hearing participated in these tests. The results of the analytical analysis enabled us to propose a curve specifying the maximum RoS values translating into understandable speech under given acoustic conditions. This curve can be used in speech processing control technology as well as compressive reverse acoustic sensing.


2021 ◽  
Vol 12 ◽  
Author(s):  
Elif Canseza Kaplan ◽  
Anita E. Wagner ◽  
Paolo Toffanin ◽  
Deniz Başkent

Earlier studies have shown that musically trained individuals may have a benefit in adverse listening situations when compared to non-musicians, especially in speech-on-speech perception. However, the literature provides mostly conflicting results. In the current study, by employing different measures of spoken language processing, we aimed to test whether we could capture potential differences between musicians and non-musicians in speech-on-speech processing. We used an offline measure of speech perception (sentence recall task), which reveals a post-task response, and online measures of real time spoken language processing: gaze-tracking and pupillometry. We used stimuli of comparable complexity across both paradigms and tested the same groups of participants. In the sentence recall task, musicians recalled more words correctly than non-musicians. In the eye-tracking experiment, both groups showed reduced fixations to the target and competitor words’ images as the level of speech maskers increased. The time course of gaze fixations to the competitor did not differ between groups in the speech-in-quiet condition, while the time course dynamics did differ between groups as the two-talker masker was added to the target signal. As the level of two-talker masker increased, musicians showed reduced lexical competition as indicated by the gaze fixations to the competitor. The pupil dilation data showed differences mainly in one target-to-masker ratio. This does not allow to draw conclusions regarding potential differences in the use of cognitive resources between groups. Overall, the eye-tracking measure enabled us to observe that musicians may be using a different strategy than non-musicians to attain spoken word recognition as the noise level increased. However, further investigation with more fine-grained alignment between the processes captured by online and offline measures is necessary to establish whether musicians differ due to better cognitive control or sound processing.


Nutrients ◽  
2018 ◽  
Vol 10 (12) ◽  
pp. 1911 ◽  
Author(s):  
Pedapati S.C. Sri Harsha ◽  
Roshaida Abdul Wahab ◽  
Catalina Cuparencu ◽  
Lars Dragsted ◽  
Lorraine Brennan

A significant body of evidence demonstrates that isoflavone metabolites are good markers of soy intake, while research is lacking on specific markers of other leguminous sources such as peas. In this context, the objective of our current study was to identify biomarkers of pea intake using an untargeted metabolomics approach. A randomized cross-over acute intervention study was conducted on eleven participants who consumed peas and couscous (control food) in random order. The urine samples were collected in fasting state and postprandially at regular intervals and were further analysed by ultra-performance liquid chromatography coupled to quadrupole time of flight mass spectrometry (UPLC-QTOF-MS). Multivariate statistical analysis resulted in robust Partial least squares Discriminant Analysis (PLS-DA) models obtained for comparison of fasting against the postprandial time points (0 h vs. 4 h, (R2X = 0.41, Q2 = 0.4); 0 h vs. 6 h, ((R2X = 0.517, Q2 = 0.495)). Variables with variable importance of projection (VIP) scores ≥1.5 obtained from the PLS-DA plot were considered discriminant between the two time points. Repeated measures analysis of variance (ANOVA) was performed to identify features with a significant time effect. Assessment of the time course profile revealed that ten features displayed a differential time course following peas consumption compared to the control food. The interesting features were tentatively identified using accurate mass data and confirmed by tandem mass spectrometry (MS using commercial spectral databases and authentic standards. 2-Isopropylmalic acid, asparaginyl valine and N-carbamoyl-2-amino-2-(4-hydroxyphenyl) acetic acid were identified as markers reflecting pea intake. The three markers also increased in a dose-dependent manner in a randomized intervention study and were further confirmed in an independent intervention study. Overall, key validation criteria were met for the successfully identified pea biomarkers. Future work will examine their use in nutritional epidemiology studies.


2017 ◽  
Vol 60 (10) ◽  
pp. 3001-3008 ◽  
Author(s):  
Lori J. Leibold

Purpose The ability to hear and understand speech in complex acoustic environments follows a prolonged time course of development. The purpose of this article is to provide a general overview of the literature describing age effects in susceptibility to auditory masking in the context of speech recognition, including a summary of findings related to the maturation of processes thought to facilitate segregation of target from competing speech. Method Data from published and ongoing studies are discussed, with a focus on synthesizing results from studies that address age-related changes in the ability to perceive speech in the presence of a small number of competing talkers. Conclusions This review provides a summary of the current state of knowledge that is valuable for researchers and clinicians. It highlights the importance of considering listener factors, such as age and hearing status, as well as stimulus factors, such as masker type, when interpreting masked speech recognition data. Presentation Video http://cred.pubs.asha.org/article.aspx?articleid=2601620


2017 ◽  
Vol 12 (1) ◽  
pp. 51-70 ◽  
Author(s):  
Lisa R. Van Havermaet ◽  
Lee H. Wurm

Abstract Several previous studies have shown that the time-course of word recognition is determined in part by an interaction between connotations of Danger and Usefulness. A small, mostly separate literature has investigated the role of Body-Object Interaction (BOI) in lexical processing. BOI is defined as the ease with which one can interact with an object. To date the lexical decision study of Van Havermaet and Wurm (2014) is the only study to include all three of these constructs. Stimuli in the current study were black-and-white line drawings corresponding to the common nouns used by Van Havermaet and Wurm (2014). Participants viewed the stimuli one at a time in a random order and had to name them as quickly as possible. Naming times revealed a significant three-way interaction between Danger, Usefulness, and BOI similar to that found for visual lexical decision: The familiar Danger x Usefulness interaction, observed in many previous studies, was observed only for items relatively lower on BOI. The interaction between semantic and embodied processing variables is not restricted to purely linguistic stimuli.


2018 ◽  
Vol 7 (2.24) ◽  
pp. 436
Author(s):  
S Nageswara Rao ◽  
K Jaya Sankar ◽  
C D. Naidu

This paper proposes a new speech enhancement framework to improve the quality of speeches recorded under adverse acoustic environments based on the speech presence uncertainty. Since the uncertainty evaluation gives a more and clear discrimination about the speech and noise, this paper proposes a new uncertainty evaluation mechanism as a preprocessing mechanism to the noise suppression methods.  This mechanism relates with energies of a noisy speech signal and classifies the speech segments and noise segments more perfectly. In addition to the quality enhancement, this approach also reduces the unnecessary computational burden over the speech processing system. Extensive simulations are carried out over the speech signals with different types of non-stationary noises like babble noise, exhibition noise, restaurant noise and train station noises and the performance is measured with the performance metrics namely the Output SNR, AvgSegSNR, PESQ and COMP. The comparative analysis of proposed approach over the conventional approaches shows an outstanding performance in all environments.  


Author(s):  
Christian Frings ◽  
Christina Bermeitinger ◽  
Dirk Wentura

In the paradigm of repeated masked semantic priming ( Wentura & Frings, 2005 ), prime and mask are repeatedly and rapidly interchanged. Using this technique in a semantic priming task with category labels as primes and category exemplars as targets (related, e.g., BIRD – swan → BIRD – finch; unrelated, e.g., BIRD – lily → FRUIT – finch), Wentura and Frings found a negatively signed priming effect. Here we used the repeated masking technique with category exemplars as targets and primes (i.e., identity priming) for analyzing, whether this effect reflects center-surround or spreading inhibition. If the repeated masked technique reflects spreading inhibition, a negative effect should also appear for identity priming. In contrast, a center-surround approach would predict a positive effect. In accordance with the latter hypothesis, we found a significant positive effect in identity priming (Experiment 1a) and significant difference to the negatively signed semantic priming effect when primes were category labels (Experiment 1b). This is indicative of the repeated masked semantic priming effect being a negatively signed semantic priming effect due to a center-surround mechanism.


Sign in / Sign up

Export Citation Format

Share Document