scholarly journals Behavioral Account of Attended Stream Enhances Neural Tracking

2021 ◽  
Vol 15 ◽  
Author(s):  
Moïra-Phoebé Huet ◽  
Christophe Micheyl ◽  
Etienne Parizet ◽  
Etienne Gaudrain

During the past decade, several studies have identified electroencephalographic (EEG) correlates of selective auditory attention to speech. In these studies, typically, listeners are instructed to focus on one of two concurrent speech streams (the “target”), while ignoring the other (the “masker”). EEG signals are recorded while participants are performing this task, and subsequently analyzed to recover the attended stream. An assumption often made in these studies is that the participant’s attention can remain focused on the target throughout the test. To check this assumption, and assess when a participant’s attention in a concurrent speech listening task was directed toward the target, the masker, or neither, we designed a behavioral listen-then-recall task (the Long-SWoRD test). After listening to two simultaneous short stories, participants had to identify keywords from the target story, randomly interspersed among words from the masker story and words from neither story, on a computer screen. To modulate task difficulty, and hence, the likelihood of attentional switches, masker stories were originally uttered by the same talker as the target stories. The masker voice parameters were then manipulated to parametrically control the similarity of the two streams, from clearly dissimilar to almost identical. While participants listened to the stories, EEG signals were measured and subsequently, analyzed using a temporal response function (TRF) model to reconstruct the speech stimuli. Responses in the behavioral recall task were used to infer, retrospectively, when attention was directed toward the target, the masker, or neither. During the model-training phase, the results of these behavioral-data-driven inferences were used as inputs to the model in addition to the EEG signals, to determine if this additional information would improve stimulus reconstruction accuracy, relative to performance of models trained under the assumption that the listener’s attention was unwaveringly focused on the target. Results from 21 participants show that information regarding the actual – as opposed to, assumed – attentional focus can be used advantageously during model training, to enhance subsequent (test phase) accuracy of auditory stimulus-reconstruction based on EEG signals. This is the case, especially, in challenging listening situations, where the participants’ attention is less likely to remain focused entirely on the target talker. In situations where the two competing voices are clearly distinct and easily separated perceptually, the assumption that listeners are able to stay focused on the target is reasonable. The behavioral recall protocol introduced here provides experimenters with a means to behaviorally track fluctuations in auditory selective attention, including, in combined behavioral/neurophysiological studies.

2021 ◽  
pp. 174702182199003
Author(s):  
Andy J Kim ◽  
David S Lee ◽  
Brian A Anderson

Previously reward-associated stimuli have consistently been shown to involuntarily capture attention in the visual domain. Although previously reward-associated but currently task-irrelevant sounds have also been shown to interfere with visual processing, it remains unclear whether such stimuli can interfere with the processing of task-relevant auditory information. To address this question, we modified a dichotic listening task to measure interference from task-irrelevant but previously reward-associated sounds. In a training phase, participants were simultaneously presented with a spoken letter and number in different auditory streams and learned to associate the correct identification of each of three letters with high, low, and no monetary reward, respectively. In a subsequent test phase, participants were again presented with the same auditory stimuli but were instead instructed to report the number while ignoring spoken letters. In both the training and test phases, response time measures demonstrated that attention was biased in favour of the auditory stimulus associated with high value. Our findings demonstrate that attention can be biased towards learned reward cues in the auditory domain, interfering with goal-directed auditory processing.


2021 ◽  
Author(s):  
Laurianne Cabrera ◽  
Bonnie K. Lau

The processing of auditory temporal information is important for the extraction of voice pitch, linguistic information, as well as the overall temporal structure of speech. However, many aspects regarding its early development remains not well understood. This paper reviews the development of different aspects of auditory temporal processing during the first year of life when infants are acquiring their native language. First, potential mechanisms of neural immaturity are discussed in the context of neurophysiological studies. Next, what is known about infant auditory capabilities is considered with a focus on psychophysical studies involving non-speech stimuli to investigate the perception of temporal fine structure and envelope cues. This is followed by a review of studies involving speech stimuli, including those that present vocoded signals as a method of degrading the spectro-temporal information available to infant listeners. Finally, we highlight key findings from the cochlear implant literature that illustrate the importance of temporal cues in speech perception.


Geophysics ◽  
1981 ◽  
Vol 46 (9) ◽  
pp. 1304-1313 ◽  
Author(s):  
Ronald A. van Overmeeren

In the savannah belt of central Sudan, near the town of Kosti, a regional geophysical survey has been carried out forming part of a groundwater project. Because of the presence of detectable and significant contrasts in physical properties of the subsoil, integrated use could be made of electrical resistivity, seismic refraction, and gravity methods. In the interpretation of multilayer electrical sounding curves, additional subsurface information such as lithological well descriptions and geophysical well logs is normally a necessity for solving the problems of equivalence. Along a profile in the eastern part of the area studied, where additional subsurface information was scarce, 16 vertical electrical soundings have been made. A preliminary simple mathematical interpretation suggested possibilities for the presence of fresh groundwater in the eastern part of the profile. In order to solve the equivalence problem, seismic refraction work was carried out at some selected places; that yielded additional information on depths to bedrock. These seismic data made possible a unique solution of the electrical sounding curves, from which it could be concluded that all groundwater in the area is saline. Subsequent test drilling confirmed these findings. A regional relative Bouguer anomaly map provided a picture of the general geologic structures and made possible rough estimates of depths to bedrock. In areas where the basement rocks are relatively close to the surface, as is the case with the profile presented, the gravity anomalies cannot be correlated with bedrock relief, because the effect is strongly influenced by lateral density variations within the bedrock itself. This is an example of a case where only an integrated application of several geophysical exploration methods can provide the desired hydrogeologic information in an acceptable balance between reliability and cost.


ReCALL ◽  
2017 ◽  
Vol 30 (1) ◽  
pp. 24-47 ◽  
Author(s):  
Fidel Çakmak ◽  
Gülcan Erçetin

AbstractThis study investigates the effects of multimedia glosses on text recall and incidental vocabulary learning in a mobile-assisted L2 listening task. A total of 88 participants with a low level of proficiency in English were randomly assigned to one of four conditions that involved single channel (textual-only, pictorial-only) and dual-channel (textual-plus-pictorial) glosses as well as a control condition where no glosses were provided. The participants listened to a story through their mobile phones and were engaged in an immediate free recall task and unannounced vocabulary tests after listening. The findings indicated that access to glosses facilitated recognition and production of vocabulary with the type of gloss having no effect. On the other hand, glosses had no effect on text recall.


Author(s):  
Szymon Drgas ◽  
Magdalena Blaszak ◽  
Anna Przekoracka-Krawczyk

Purpose The acoustic source that is attended to by the listener in a mixture can be identified with a certain accuracy on the basis of their neural response recorded during listening, and various phenomena may be used to detect attention. For example, neural tracking (NT) and alpha power lateralization (APL) may be utilized in order to obtain information concerning attention. However, these methods of auditory attention detection (AAD) are typically tested in different experimental setups, which makes it impossible to compare their accuracy. The aim of this study is to compare the accuracy of AAD based on NT, APL, and their combination for a dichotic natural speech listening task. Method Thirteen adult listeners were presented with dichotic speech stimuli and instructed to attend to one of them. Electroencephalogram of the subjects was continuously recorded during the experiment using a set of 32 active electrodes. The accuracy of AAD was evaluated for trial lengths of 50, 25, and 12.5 s. AAD was tested for various parameters of NT- and APL-based modules. Results The obtained results suggest that NT of natural running speech provides similar accuracy to APL. The statistically significant improvement of the accuracy of AAD using a combined method has been observed not only for the longest duration of test samples (50 s, p = .005) but also for shorter ones (25 s, p = .011). Conclusions It seems that the combination of standard NT and APL significantly increases the effectiveness of accurate identification of the traced signal perceived by a listener under dichotic conditions. It has been demonstrated that, under certain conditions, the combination of NT and APL may provide a benefit for AAD in cocktail party scenarios.


2021 ◽  
Vol 12 ◽  
Author(s):  
Ivine Kuruvila ◽  
Jan Muncke ◽  
Eghart Fischer ◽  
Ulrich Hoppe

Human brain performs remarkably well in segregating a particular speaker from interfering ones in a multispeaker scenario. We can quantitatively evaluate the segregation capability by modeling a relationship between the speech signals present in an auditory scene, and the listener's cortical signals measured using electroencephalography (EEG). This has opened up avenues to integrate neuro-feedback into hearing aids where the device can infer user's attention and enhance the attended speaker. Commonly used algorithms to infer the auditory attention are based on linear systems theory where cues such as speech envelopes are mapped on to the EEG signals. Here, we present a joint convolutional neural network (CNN)—long short-term memory (LSTM) model to infer the auditory attention. Our joint CNN-LSTM model takes the EEG signals and the spectrogram of the multiple speakers as inputs and classifies the attention to one of the speakers. We evaluated the reliability of our network using three different datasets comprising of 61 subjects, where each subject undertook a dual-speaker experiment. The three datasets analyzed corresponded to speech stimuli presented in three different languages namely German, Danish, and Dutch. Using the proposed joint CNN-LSTM model, we obtained a median decoding accuracy of 77.2% at a trial duration of 3 s. Furthermore, we evaluated the amount of sparsity that the model can tolerate by means of magnitude pruning and found a tolerance of up to 50% sparsity without substantial loss of decoding accuracy.


2018 ◽  
Vol 29 (8) ◽  
pp. 3253-3265 ◽  
Author(s):  
Sebastian Puschmann ◽  
Sylvain Baillet ◽  
Robert J Zatorre

Abstract Musical training has been demonstrated to benefit speech-in-noise perception. It is however unknown whether this effect translates to selective listening in cocktail party situations, and if so what its neural basis might be. We investigated this question using magnetoencephalography-based speech envelope reconstruction and a sustained selective listening task, in which participants with varying amounts of musical training attended to 1 of 2 speech streams while detecting rare target words. Cortical frequency-following responses (FFR) and auditory working memory were additionally measured to dissociate musical training-related effects on low-level auditory processing versus higher cognitive function. Results show that the duration of musical training is associated with a reduced distracting effect of competing speech on target detection accuracy. Remarkably, more musical training was related to a robust neural tracking of both the to-be-attended and the to-be-ignored speech stream, up until late cortical processing stages. Musical training-related increases in FFR power were associated with a robust speech tracking in auditory sensory areas, whereas training-related differences in auditory working memory were linked to an increased representation of the to-be-ignored stream beyond auditory cortex. Our findings suggest that musically trained persons can use additional information about the distracting stream to limit interference by competing speech.


MedChemComm ◽  
2011 ◽  
Vol 2 (1) ◽  
pp. 16-30 ◽  
Author(s):  
Gerard J. P. van Westen ◽  
Jörg K. Wegner ◽  
Adriaan P. IJzerman ◽  
Herman W. T. van Vlijmen ◽  
A. Bender

Proteochemometric modeling is founded on the principles of QSAR but is able to benefit from additional information in model training due to the inclusion of target information.


2021 ◽  
Vol 12 ◽  
Author(s):  
Irit Shapira-Lichter ◽  
Noga Oren ◽  
Anita Asvadurian ◽  
Rachel Ben-Hayun ◽  
Tali Fisher ◽  
...  

Characterizing episodic memory abilities is highly important in the diagnosis of Alzheimer's disease (AD) and mild cognitive impairment (MCI), and usually includes wordlist learning and recall tasks. Clinical evaluations typically focus on the number of words recalled, ignoring additional information, like serial position. Here, we tested the potential value of two serial positioning measures for clinical diagnosis – how retrieval is initiated, as measured by the first word recalled, and how it proceeds – using data from patients with AD and MCI that completed a wordlist learning and recall task. Our results show that during the early stages of learning, patients with AD are less prone to retrieve the first word from the wordlist, manifested as lower primacy effect in the first word recalled, compared with MCI patients. The first word recalled measure adds to the differentiation between the groups over and above the total number of words learned. Thus, the first word recalled during word list learning and recall tasks may be used as a simple complementary measure to distinguish between MCI and AD during standard neuropsychological evaluations.


2021 ◽  
Vol 12 ◽  
Author(s):  
Marco Rüth ◽  
Johannes Breuer ◽  
Daniel Zimmermann ◽  
Kai Kaspar

Testing is an effective learning method, and it is the basis of mobile quiz apps. Quiz apps have the potential to facilitate remote and self-regulated learning. In this context, automatized feedback plays a crucial role. In two experimental studies, we examined the effects of two feedback types of quiz apps on performance, namely, the standard corrective feedback of quiz apps and a feedback that incorporates additional information related to the correct response option. We realized a controlled lab setting (n = 68, Study 1) and an unsupervised mobile setting (n = 150, Study 2). In the learning phase, participants used the quiz app and received feedback. They also completed a subsequent test as well as a follow-up test 1 week later by using the same quiz app. Irrespective of feedback type and setting, cognitive outcomes (quiz scores) and metacognitive outcomes (response certainty) increased similarly in the short term and long term. Feedback effects were not moderated by participants' overall response certainty during learning, their prior knowledge, and the difficulty of quiz items. Moreover, we found that participants perceived the quiz app to be similarly attractive, interesting, and enjoyable in both feedback conditions and that they spent slightly more time to process quiz items in the lab setting. We discuss these results in detail, including the role of moderating and mediating factors and prospects for further research and practice. Overall, our results underline that quiz apps are useful and effective tools that can support the acquisition and retention of semantic knowledge in different learning settings.


Sign in / Sign up

Export Citation Format

Share Document