Perception of Mixture of Musical Instruments with Spectral Overlap Removed

2012 ◽  
Vol 37 (3) ◽  
pp. 355-363 ◽  
Author(s):  
Piotr Kleczkowski

Abstract The issue of auditory segregation of simultaneous sound sources has been addressed in speech research but was given less attention in musical acoustics. In perception of concurrent speech, or speech with noise, the operation of time-frequency masking was often used as a research tool. In this work, an ex- tension of time-frequency masking, leading to the removal of spectro-temporal overlap between sound sources, was applied to musical instruments playing together. The perception of the original mixture was compared with the perception of the same mixture with all spectral overlap electronically removed. Ex- periments differed in the method of listening (headphones or a loudspeaker), sets of instruments mixed, and populations of participants. The main findings were: (i) in one of the experimental conditions the removal of spectro-temporal overlap was imperceptible, (ii) perception of the effect increased when removal of spectro-temporal overlap was performed in larger time-frequency regions rather than in small ones, (iii) perception of the effect decreased in loudspeaker listening. The results support both the multiple looks hypothesis and the “glimpsing” hypothesis known from speech perception.

2009 ◽  
Vol 125 (6) ◽  
pp. 4006-4022 ◽  
Author(s):  
Douglas S. Brungart ◽  
Peter S. Chang ◽  
Brian D. Simpson ◽  
DeLiang Wang

2010 ◽  
pp. 447-473
Author(s):  
Pedro Gómez-Vilda ◽  
José Manuel Ferrández-Vicente ◽  
Victoria Rodellar-Biarge ◽  
Rafael Martínez-Olalla ◽  
Víctor Nieto-Lluis ◽  
...  

Current trends in the search for improvements in well-established technologies imitating human abilities, as speech perception, try to find inspiration in the explanation of certain capabilities hidden in the natural system which are not yet well understood. A typical case is that of speech recognition, where the semantic gap going from spectral time-frequency representations to the symbolic translation into phonemes and words, and the construction of morpho-syntactic and semantic structures find many hidden phenomena not well understood yet. The present chapter is intended to explore some of these facts at a simplifying level under two points of view: that of top-down analysis provided from speech perception, and the symmetric from bottom-up synthesis provided by the biological architecture of auditory pathways. An application-driven design of a Neuromorphic Speech Processing Architecture is presented and its performance analyzed. Simulation details provided by a parallel implementation of the architecture in a supercomputer will be also shown and discussed.


1998 ◽  
Vol 21 (2) ◽  
pp. 275-275 ◽  
Author(s):  
Dominic W. Massaro

Sussman et al. describe an ecological property of the speech signal that is putatively functional in perception. An important issue, however, is whether their putative cue is an emerging feature or whether the second formant (F2) onset and the F2 vowel actually provide independent cues to perceptual categorization. Regardless of the outcome of this issue, an important goal of speech research is to understand how multiple cues are evaluated and integrated to achieve categorization.


SIAM Review ◽  
2002 ◽  
Vol 44 (3) ◽  
pp. 457-476 ◽  
Author(s):  
Jeremy F. Alm ◽  
James S. Walker

1998 ◽  
Vol 53 (1-3) ◽  
pp. 1-13 ◽  
Author(s):  
F. Poisson ◽  
J.C. Valiere ◽  
P. Herzog

2016 ◽  
Vol 1 (1) ◽  
Author(s):  
Meifal Rusli

<p class="TTPParagraphothers"><em>The paper discusses means to predict sound source position emitted by fault machine components based on a single microphone moving in a linear track with constant speed.</em> The position of sound source that consists of some frequency spectrum is detected by time-frequency distribution of the sound signal through Short Time Fourier Transform (STFT) and Continues Wavelet Transform (CWT). <em>As the amplitude of sound pressure increases when the microphone moves closer, the source position and frequency are predicted from the peaks of time-frequency contour map</em><em>. </em>Firstly, numerical simulation is conducted using two sound sources that generate four different frequencies of sound. The second case is experimental analysis using rotating machine being monitored with unbalanced, misalignment and bearing defect. The result shows that application of both STFT and CWT are able to detect multiple sound sources position with multiple frequency peaks caused by machine fault. The STFT can indicate the frequency very clearly, but not for the peak position. On the other hand, the CWT is able to predict the position of sound at low frequency very clearly. However, it is failed to detect the exact frequency because of overlapping.</p>


2020 ◽  
Vol 43 (2-3) ◽  
pp. 31-47 ◽  
Author(s):  
Stefan Bilbao ◽  
James Perry ◽  
Paul Graham ◽  
Alan Gray ◽  
Kostas Kavoussanakis ◽  
...  

Sound synthesis using physical modeling, emulating systems of a complexity approaching and even exceeding that of real-world acoustic musical instruments, is becoming possible, thanks to recent theoretical developments in musical acoustics and algorithm design. Severe practical difficulties remain, both at the level of the raw computational resources required, and at the level of user control. An approach to the first difficulty is through the use of large-scale parallelization, and results for a variety of physical modeling systems are presented here. Any progress with regard to the second difficulty requires, necessarily, the experience and advice of professional musicians. A basic interface to a parallelized large-scale physical modeling synthesis system is presented here, accompanied by first-hand descriptions of the working methods of five composers, each of whom generated complete multichannel pieces using the system.


2006 ◽  
Vol 95 (6) ◽  
pp. 3571-3584 ◽  
Author(s):  
Matthew W. Spitzer ◽  
Terry T. Takahashi

We examined the accuracy and precision with which the barn owl ( Tyto alba) turns its head toward sound sources under conditions that evoke the precedence effect (PE) in humans. Stimuli consisted of 25-ms noise bursts emitted from two sources, separated horizontally by 40°, and temporally by 3–50 ms. At delays from 3 to 10 ms, head turns were always directed at the leading source, and were nearly as accurate and precise as turns toward single sources, indicating that the leading source dominates perception. This lead dominance is particularly remarkable, first, because on some trials, the lagging source was significantly higher in amplitude than the lead, arising from the directionality of the owl's ears, and second, because the temporal overlap of the two sounds can degrade the binaural cues with which the owl localizes sounds. With increasing delays, the influence of the lagging source became apparent as the head saccades became increasingly biased toward the lagging source. Furthermore, on some of the trials at delays ≥20 ms, the owl turned its head, first, in the direction of one source, and then the other, suggesting that it was able to resolve two separately localizable sources. At all delays <50 ms, response latencies were longer for paired sources than for single sources. With the possible exception of response latency, these findings demonstrate that the owl exhibits precedence phenomena in sound localization similar to those in humans and cats, and provide a basis for comparison with neurophysiological data.


2009 ◽  
Vol 72 (4-6) ◽  
pp. 820-830 ◽  
Author(s):  
Pedro Gómez-Vilda ◽  
José M. Ferrández-Vicente ◽  
Victoria Rodellar-Biarge ◽  
Roberto Fernández-Baíllo

Chemosensors ◽  
2020 ◽  
Vol 8 (2) ◽  
pp. 25 ◽  
Author(s):  
Ersin Demir ◽  
Hulya Silah

A novel highly selective, sensitive and simple analytical technique was recommended for the investigation of anthelmintic veterinary drug oxyclozanide based on square wave anodic stripping voltammetry (SWASV) by using a carbon paste electrode (CPE). According to the cyclic voltammetric data, the oxidation and electron transfer processes of oxyclozanide were found as irreversible and adsorption-controlled, respectively. The voltammetric anodic peak response was characterized with respect to pH, accumulation potential, accumulation time, frequency and pulse amplitude, etc. Under these optimized experimental conditions, the anodic peak density of oxyclozanide was linear to oxyclozanide concentrations in the range from 0.058 to 4.00 mg/L. The described electrochemical method was successfully carried out for the oxyclozanide in pharmaceutical formulation and tap water with mean percentage recovery of 101.5 % and 102.2 %, respectively. The results of pharmaceutical formulation studies were statistically compared to the high-performance liquid chromatographic method.


Sign in / Sign up

Export Citation Format

Share Document