scholarly journals Characterizing speech rhythm using spectral coherence between jaw displacement and speech temporal envelope

Loquens ◽  
2021 ◽  
Vol 7 (2) ◽  
pp. e074
Author(s):  
Lei He ◽  
Yu Zhang

Lower modulation rates in the temporal envelope (ENV) of the acoustic signal are believed to be the rhythmic backbone in speech, facilitating speech comprehension in terms of neuronal entrainments at δ- and θ-rates (these rates are comparable to the foot- and syllable-rates phonetically). The jaw plays the role of a carrier articulator regulating mouth opening in a quasi-cyclical way, which correspond to the low-frequency modulations as a physical consequence. This paper describes a method to examine the joint roles of jaw oscillation and ENV in realizing speech rhythm using spectral coherence. Relative powers in the frequency bands corresponding to the δ-and θ-oscillations in the coherence (respectively notated as %δ and %θ) were quantified as one possible way of revealing the amount of concomitant foot- and syllable-level rhythmicities carried by both acoustic and articulatory domains. Two English corpora (mngu0 and MOCHA-TIMIT) were used for the proof of concept. %δ and %θ were regressed on utterance duration for an initial analysis. Results showed that the degrees of foot- and syllable-sized rhythmicities are different and are contingent upon the utterance length.

2019 ◽  
Author(s):  
Lei He

Lower modulation rates in the temporal envelope (ENV) constitute the rhythmic frames in speech and are crucial for speech comprehension in terms of neuronal entrainments at δ- and θ-rates. The jaw plays the role of a carrier articulator producing the low-frequency modulations in this process. This paper introduces a method to examine the joint roles of jaw oscillation and ENV in formulating rhythmic frames using spectral coherence. Relative powers in the frequency bands corresponding to the δ- and θ-oscillations were quantified and regressed on utterance duration. Two English corpora were analyzed for the proof of concept.


2010 ◽  
Vol 104 (5) ◽  
pp. 2500-2511 ◽  
Author(s):  
Mary F. Howard ◽  
David Poeppel

Speech stimuli give rise to neural activity in the listener that can be observed as waveforms using magnetoencephalography. Although waveforms vary greatly from trial to trial due to activity unrelated to the stimulus, it has been demonstrated that spoken sentences can be discriminated based on theta-band (3–7 Hz) phase patterns in single-trial response waveforms. Furthermore, manipulations of the speech signal envelope and fine structure that reduced intelligibility were found to produce correlated reductions in discrimination performance, suggesting a relationship between theta-band phase patterns and speech comprehension. This study investigates the nature of this relationship, hypothesizing that theta-band phase patterns primarily reflect cortical processing of low-frequency (<40 Hz) modulations present in the acoustic signal and required for intelligibility, rather than processing exclusively related to comprehension (e.g., lexical, syntactic, semantic). Using stimuli that are quite similar to normal spoken sentences in terms of low-frequency modulation characteristics but are unintelligible (i.e., their time-inverted counterparts), we find that discrimination performance based on theta-band phase patterns is equal for both types of stimuli. Consistent with earlier findings, we also observe that whereas theta-band phase patterns differ across stimuli, power patterns do not. We use a simulation model of the single-trial response to spoken sentence stimuli to demonstrate that phase-locked responses to low-frequency modulations of the acoustic signal can account not only for the phase but also for the power results. The simulation offers insight into the interpretation of the empirical results with respect to phase-resetting and power-enhancement models of the evoked response.


2019 ◽  
Vol 85 (1(I)) ◽  
pp. 64-71 ◽  
Author(s):  
M. M. Gadenin

The cycle configuration at two-frequency loading regimes depends on the number of parameters including the absolute values of the frequencies and amplitudes of the low-frequency and high-frequency loads added during this mode, the ratio of their frequencies and amplitudes, as well as the phase shift between these harmonic components, the latter having a significant effect only with a small ratio of frequencies. Presence of such two-frequency regimes or service loading conditions for parts of machines and structures schematized by them can significantly reduce their endurance. Using the results of experimental studies of changes in the endurance of a two-frequency loading of specimens of cyclically stable, cyclically softened and cyclically hardened steels under rigid conditions we have shown that decrease in the endurance under the aforementioned conditions depends on the ratio of frequencies and amplitudes of operation low-frequency low-cycle and high-frequency vibration stresses, and, moreover, the higher the level of the ratios of amplitudes and frequencies of those stacked harmonic processes of loading the greater the effect. It is shown that estimation of such a decrease in the endurance compared to a single frequency loading equal in the total stress (strains) amplitudes can be carried out using an exponential expression coupling those endurances through a parameter (reduction factor) containing the ratio of frequencies and amplitudes of operation cyclic loads and characteristic of the material. The reduction is illustrated by a set of calculation-experimental curves on the corresponding diagrams for each of the considered types of materials and compared with the experimental data.


2021 ◽  
Vol 25 ◽  
pp. 233121652110101
Author(s):  
Dmitry I. Nechaev ◽  
Olga N. Milekhina ◽  
Marina S. Tomozova ◽  
Alexander Y. Supin

The goal of the study was to investigate the role of combination products in the higher ripple-density resolution estimates obtained by discrimination between a spectrally rippled and a nonrippled noise signal than that obtained by discrimination between two rippled signals. To attain this goal, a noise band was used to mask the frequency band of expected low-frequency combination products. A three-alternative forced-choice procedure with adaptive ripple-density variation was used. The mean background (unmasked) ripple-density resolution was 9.8 ripples/oct for rippled reference signals and 21.8 ripples/oct for nonrippled reference signals. Low-frequency maskers reduced the ripple-density resolution. For masker levels from −10 to 10 dB re. signal, the ripple-density resolution for nonrippled reference signals was approximately twice as high as that for rippled reference signals. At a masker level as high as 20 dB re. signal, the ripple-density resolution decreased in both discrimination tasks. This result leads to the conclusion that low-frequency combination products are not responsible for the task-dependent difference in ripple-density resolution estimates.


Cancers ◽  
2021 ◽  
Vol 13 (6) ◽  
pp. 1443
Author(s):  
Leonie D. H. Gossel ◽  
Catrin Heim ◽  
Lisa-Marie Pfeffermann ◽  
Laura M. Moser ◽  
Halvard B. Bönig ◽  
...  

The dismal prognosis of pediatric and young adult patients with high-risk rhabdomyosarcoma (RMS) underscores the need for novel treatment options for this patient group. In previous studies, the tumor-associated surface antigen ERBB2 (HER2/neu) was identified as targetable in high-risk RMS. As a proof of concept, in this study, a novel treatment approach against RMS tumors using a genetically modified natural killer (NK)-92 cell line (NK-92/5.28.z) as an off-the-shelf ERBB2-chimeric antigen receptor (CAR)-engineered cell product was preclinically explored. In cytotoxicity assays, NK-92/5.28.z cells specifically recognized and efficiently eliminated RMS cell suspensions, tumor cell monolayers, and 3D tumor spheroids via the ERBB2-CAR even at effector-to-target ratios as low as 1:1. In contrast to unmodified parental NK-92 cells, which failed to lyse RMS cells, NK-92/5.28.z cells proliferated and became further activated through contact with ERBB2-positive tumor cells. Furthermore, high amounts of effector molecules, such as proinflammatory and antitumoral cytokines, were found in cocultures of NK-92/5.28.z cells with tumor cells. Taken together, our data suggest the enormous potential of this approach for improving the immunotherapy of treatment-resistant tumors, revealing the dual role of NK-92/5.28.z cells as CAR-targeted killers and modulators of endogenous adaptive immunity even in the inhibitory tumor microenvironment of high-risk RMS.


2021 ◽  
Vol 11 (5) ◽  
pp. 1990
Author(s):  
Vinod Devaraj ◽  
Philipp Aichinger

The characterization of voice quality is important for the diagnosis of a voice disorder. Vocal fry is a voice quality which is traditionally characterized by a low frequency and a long closed phase of the glottis. However, we also observed amplitude modulated vocal fry glottal area waveforms (GAWs) without long closed phases (positive group) which we modelled using an analysis-by-synthesis approach. Natural and synthetic GAWs are modelled. The negative group consists of euphonic, i.e., normophonic GAWs. The analysis-by-synthesis approach fits two modelled GAWs for each of the input GAW. One modelled GAW is modulated to replicate the amplitude and frequency modulations of the input GAW and the other modelled GAW is unmodulated. The modelling errors of the two modelled GAWs are determined to classify the GAWs into the positive and the negative groups using a simple support vector machine (SVM) classifier with a linear kernel. The modelling errors of all vocal fry GAWs obtained using the modulating model are smaller than the modelling errors obtained using the unmodulated model. Using the two modelling errors as predictors for classification, no false positives or false negatives are obtained. To further distinguish the subtypes of amplitude modulated vocal fry GAWs, the entropy of the modulator’s power spectral density and the modulator-to-carrier frequency ratio are obtained.


Animals ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 2156
Author(s):  
Jo White ◽  
Ruth Sims

This paper explores the potential for interventions to develop pro-animal welfare habitual behaviours (PAWHBs) in people to improve the lives of animals. Human behavioural research indicates that opportunities exist to deliver lasting change through developing positive habitual behaviours. The routine nature of many equine care and management practices lends itself to habit formation and maintenance. This proof-of-concept paper aims to evaluate a theory-based intervention of developing and maintaining a PAWHB in people caring for equines. Qualitative research methods were used. A 30 day PAWHB intervention (PAWHBInt) of providing enrichment to an equine by scratching them in a consistent context linked to an existing routine behaviour was undertaken. Participants (n = 9) then engaged in semi-structured interviews that were analysed using thematic analysis, where the participants self-reported the outcomes they observed during the intervention. The study findings suggest that the PAWHBInt had a positive impact on human behaviour and habit formation. The research helps to address the dearth of evidence regarding the application of habit theory to equine welfare interventions and emphasised linking a desired new behaviour to an existing routine behaviour when developing PAWHBs. The research also highlights the role of mutual benefit for human and equine, and emotion in providing feedback and potential reward, supporting the link to the cue-routine-reward principle of habit theory.


2012 ◽  
Vol 2012 ◽  
pp. 1-11 ◽  
Author(s):  
Xinyan Gao ◽  
Yongfa Qiao ◽  
Baohui Jia ◽  
Xianghong Jing ◽  
Bin Cheng ◽  
...  

Previous studies have demonstrated the efficacy of electroacupuncture at ST36 for patients with gastrointestinal motility disorders. While several lines of evidence suggest that the effect may involve vagal reflex, the precise molecular mechanism underlying this process still remains unclear. Here we report that the intragastric pressure increase induced by low frequency electric stimulation at ST36 was blocked by AP-5, an antagonist of N-methyl-D-aspartate receptors (NMDARs). Indeed, stimulating ST36 enhanced NMDAR-mediated, but not 2-amino-3-(5-methyl-3-oxo-1,2-oxazol-4-yl)propanoic-acid-(AMPA-) receptor-(AMPAR-) mediated synaptic transmission in gastric-projecting neurons of the dorsal motor nucleus of the vagus (DMV). We also identified that suppression of presynapticμ-opioid receptors may contribute to upregulation of NMDAR-mediated synaptic transmission induced by electroacupuncture at ST36. Furthermore, we determined that the glutamate-receptor-2a-(NR2A-) containing NMDARs are essential for NMDAR-mediated enhancement of gastric motility caused by stimulating ST36. Taken together, our results reveal an important role of NMDA receptors in mediating enhancement of gastric motility induced by stimulating ST36.


Neuron ◽  
2007 ◽  
Vol 53 (1) ◽  
pp. 135-145 ◽  
Author(s):  
Angela D. Friederici ◽  
D. Yves von Cramon ◽  
Sonja A. Kotz

Sign in / Sign up

Export Citation Format

Share Document