psychoacoustic model Latest Research Papers

The recent development and deployment of Wireless Acoustic Sensor Networks (WASN) present new ways to address urban acoustic challenges in a smart city context. A focus on improving quality of life forms the core of smart-city design paradigms and cannot be limited to simply measuring objective environmental factors, but should also consider the perceptual, psychological and health impacts on citizens. This study therefore makes use of short (1–2.7 s) recordings sourced from a WASN in Milan which were grouped into various environmental sound source types and given an annoyance rating via an online survey with N=100 participants. A multilevel psychoacoustic model was found to achieve an overall R2=0.64 which incorporates Sharpness as a fixed effect regardless of the sound source type and Roughness, Impulsiveness and Tonality as random effects whose coefficients vary depending on the sound source. These results present a promising step toward implementing an on-sensor annoyance model which incorporates psychoacoustic features and sound source type, and is ultimately not dependent on sound level.

Download Full-text

Audio Information Hiding in Sub-signals by deploying Singular Spectrum Analysis and Psychoacoustic Model

2021 18th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) ◽

10.1109/ecti-con51831.2021.9454804 ◽

2021 ◽

Author(s):

Phondanai Khanti ◽

Ekachai Phaisangittisagul ◽

Takahiro Shinozaki ◽

Jessada Kamjana

Keyword(s):

Spectrum Analysis ◽

Information Hiding ◽

Singular Spectrum Analysis ◽

Psychoacoustic Model ◽

Singular Spectrum ◽

Audio Information

Download Full-text

Validating a psychoacoustic model of voice quality

The Journal of the Acoustical Society of America ◽

10.1121/10.0003331 ◽

2021 ◽

Vol 149 (1) ◽

pp. 457-465

Author(s):

Jody Kreiman ◽

Yoonjeong Lee ◽

Marc Garellek ◽

Robin Samlan ◽

Bruce R. Gerratt

Keyword(s):

Voice Quality ◽

Psychoacoustic Model

Download Full-text

Auditory Alarms Design Tool: Spectral Masking Estimation Based on a Psychoacoustic Model

Springer Series in Design and Innovation - Advances in Design, Music and Arts ◽

10.1007/978-3-030-55700-3_43 ◽

2020 ◽

pp. 621-639

Author(s):

Frederico Pereira ◽

Rui Marques ◽

Joana Vieria

Keyword(s):

Design Tool ◽

Psychoacoustic Model ◽

Spectral Masking

Download Full-text

Forensic Investigation of MP3 Audio Recordings

Theory and Practice of Forensic Science ◽

10.30764//1819-2785-2019-14-4-125-136 ◽

2020 ◽

Vol 14 (4) ◽

pp. 125-136

Author(s):

A. G. Boyarov ◽

I. S. Siparov

Keyword(s):

Group Identification ◽

Detection Methods ◽

Signal Spectrum ◽

Spectral Amplitude ◽

Forensic Investigation ◽

Psychoacoustic Model ◽

Audio Recordings ◽

Spectral Components ◽

Exact Position ◽

Audio Information

Special aspects of MP3-recordings technical investigation are addressed. The following features of formation and research of MP3 phonograms are explained: traces of MP3 coding in time and spectral domain, special aspects of MP3-files structure analysis, detection methods of re-coding of MP3-recordings, methods of group identification of MP3-recorders and MP3-codecs.MP3 coding leaves certain traces of its usage. Due to the psychoacoustic model inaudible spectral components are deleted from the signal spectrum. Traces of psychoacoustic codecs usage are also clearly seen via dynamic spectrogram as rectangular areas of zero spectral amplitude. The methods discussed in this paper enable the investigating expert to detect the exact position of the MP3 frame in the signal by its properties even without any information from the file header. This method reveals the coding itself, multiple coding and also audio editing by the investigation of the periodicity of the extracted frames’ positions.MP3 file format specifies the structure of the frame header providing a perfect instrument to detect any periodicity of any peculiarities of MP3 frames. The tool based on this approach reveals MP3 frames disorder caused by editing in the “digital” domain – manual deletion of audio information using HEX editor.

Download Full-text

New Speech Compression Technique based on Filter Bank Design and Psychoacoustic Model

10.20855/ijav.2019.24.41455 ◽

2019 ◽

Vol 24 (4) ◽

pp. 728-735

Author(s):

Mourad Talbi ◽

Med Salim Bouhlel

Keyword(s):

Compression Ratio ◽

Filter Bank ◽

Signal To Noise Ratio ◽

Speech Signals ◽

Speech Compression ◽

Signal To Noise ◽

Compression Technique ◽

Psychoacoustic Model ◽

Perceptual Evaluation ◽

Noise Ratio

In this paper, a new speech compression technique is proposed. This technique applies a Psychoacoustic Model and a general approach for Filter Bank Design using optimization. It is evaluated and compared with a compression technique using a MDCT (Modified Discrete Cosine Transform) Filter Bank of 32 Filters and a Psychoacoustic Model. This evaluation and comparison is performed by calculating bits before and after compression, PSNR (Peak Signal to Noise Ratio), NRMSE (Normalized Root Mean Square Error), SNR (Signal to Noise Ratio) and PESQ (Perceptual evaluation of speech quality) computations. The two techniques are tested and applied to a number of speech signals that are sampled at 8 kHz. The results obtained from this evaluation show that the proposed technique outperforms the second compression technique (based on a Psychoacoustic Model and MDCT filter Bank) in terms of Bits after compression and compression ratio. In fact, the proposed technique yields higher values for the compression ratio than the second compression technique. Moreover, the proposed compression technique presents reconstructed speech signals with acceptable perceptual qualities. This is justified by the values of SNR, PSNR and NRMSE and PESQ.

Download Full-text