Low complexity decomposition for the characteristic waveform of speech signal

2021 ◽

Vol 263 (2) ◽

pp. 4570-4580

Author(s):

Liu Ting ◽

Luo Xinwei

Keyword(s):

Neural Network ◽

Speech Signal ◽

Signal To Noise Ratio ◽

Threshold Value ◽

Low Complexity ◽

Noise Signal ◽

Sequential Decision ◽

Signal To Noise ◽

Speech Segment ◽

Double Threshold

The recognition accuracy of speech signal and noise signal is greatly affected under low signal-to-noise ratio. The neural network with parameters obtained from the training set can achieve good results in the existing data, but is poor for the samples with different the environmental noises. This method firstly extracts the features based on the physical characteristics of the speech signal, which have good robustness. It takes the 3-second data as samples, judges whether there is speech component in the data under low signal-to-noise ratios, and gives a decision tag for the data. If a reasonable trajectory which is like the trajectory of speech is found, it is judged that there is a speech segment in the 3-second data. Then, the dynamic double threshold processing is used for preliminary detection, and then the global double threshold value is obtained by K-means clustering. Finally, the detection results are obtained by sequential decision. This method has the advantages of low complexity, strong robustness, and adaptability to multi-national languages. The experimental results show that the performance of the method is better than that of traditional methods under various signal-to-noise ratios, and it has good adaptability to multi language.

Download Full-text

Perceptual Learning of Vocoded Speech With and Without Contralateral Hearing: Implications for Cochlear Implant Rehabilitation

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-20-00385 ◽

2020 ◽

pp. 1-10

Author(s):

Martin Chavant ◽

Alexis Hervais-Adelman ◽

Olivier Macherey

Keyword(s):

Cochlear Implant ◽

Perceptual Learning ◽

Speech Signal ◽

Training Phase ◽

Monosyllabic Words ◽

Low Pass ◽

Contralateral Ear ◽

Number Of Individuals ◽

Insight Into ◽

Vocoded Speech

Purpose An increasing number of individuals with residual or even normal contralateral hearing are being considered for cochlear implantation. It remains unknown whether the presence of contralateral hearing is beneficial or detrimental to their perceptual learning of cochlear implant (CI)–processed speech. The aim of this experiment was to provide a first insight into this question using acoustic simulations of CI processing. Method Sixty normal-hearing listeners took part in an auditory perceptual learning experiment. Each subject was randomly assigned to one of three groups of 20 referred to as NORMAL, LOWPASS, and NOTHING. The experiment consisted of two test phases separated by a training phase. In the test phases, all subjects were tested on recognition of monosyllabic words passed through a six-channel “PSHC” vocoder presented to a single ear. In the training phase, which consisted of listening to a 25-min audio book, all subjects were also presented with the same vocoded speech in one ear but the signal they received in their other ear differed across groups. The NORMAL group was presented with the unprocessed speech signal, the LOWPASS group with a low-pass filtered version of the speech signal, and the NOTHING group with no sound at all. Results The improvement in speech scores following training was significantly smaller for the NORMAL than for the LOWPASS and NOTHING groups. Conclusions This study suggests that the presentation of normal speech in the contralateral ear reduces or slows down perceptual learning of vocoded speech but that an unintelligible low-pass filtered contralateral signal does not have this effect. Potential implications for the rehabilitation of CI patients with partial or full contralateral hearing are discussed.

Download Full-text

“I Can See What You’re Saying”: Clinical Utility of Spectral Moment Analysis

Perspectives on Speech Science and Orofacial Disorders ◽

10.1044/ssod21.2.44 ◽

2011 ◽

Vol 21 (2) ◽

pp. 44-54

Author(s):

Kerry Callahan Mandulak

Keyword(s):

Speech Production ◽

Speech Signal ◽

Clinical Utility ◽

Acoustic Analysis ◽

Moment Analysis ◽

Analysis Tool ◽

Spectral Moment ◽

Clinical Measure ◽

Perceptual Analysis ◽

Disordered Speech

Spectral moment analysis (SMA) is an acoustic analysis tool that shows promise for enhancing our understanding of normal and disordered speech production. It can augment auditory-perceptual analysis used to investigate differences across speakers and groups and can provide unique information regarding specific aspects of the speech signal. The purpose of this paper is to illustrate the utility of SMA as a clinical measure for both clinical speech production assessment and research applications documenting speech outcome measurements. Although acoustic analysis has become more readily available and accessible, clinicians need training with, and exposure to, acoustic analysis methods in order to integrate them into traditional methods used to assess speech production.

Download Full-text

Low-complexity synthesis of incompletely specified multiple-output mod-2 sums

IEE Proceedings E Computers and Digital Techniques ◽

10.1049/ip-e.1992.0051 ◽

1992 ◽

Vol 139 (4) ◽

pp. 355 ◽

Cited By ~ 6

Author(s):

M.W. Riege ◽

Ph.W. Besslich

Keyword(s):

Low Complexity

Download Full-text

Robust low-complexity backward adaptive pitch predictor for low-delay speech coding

IEE Proceedings I Communications Speech and Vision ◽

10.1049/ip-i-2.1991.0044 ◽

1991 ◽

Vol 138 (4) ◽

pp. 338 ◽

Cited By ~ 6

Author(s):

V. Cuperman ◽

R. Pettigrew

Keyword(s):

Speech Coding ◽

Low Complexity ◽

Low Delay ◽

Delay Speech

Download Full-text

Method for measuring distortions of a speech signal during its transmission over a communication channel to a biometric identification system

Izmeritel`naya Tekhnika ◽

10.32446/0368-1025it.2020-11-65-72 ◽

2020 ◽

pp. 65-72

Author(s):

V. V. Savchenko ◽

A. V. Savchenko

Keyword(s):

Speech Signal ◽

Communication Channel ◽

Speaker Identification ◽

A Priori ◽

Small Samples ◽

Practical Implementation ◽

Identification System ◽

New Information ◽

Discrimination Criterion ◽

A Priori Uncertainty

This paper is devoted to the presence of distortions in a speech signal transmitted over a communication channel to a biometric system during voice-based remote identification. We propose to preliminary correct the frequency spectrum of the received signal based on the pre-distortion principle. Taking into account a priori uncertainty, a new information indicator of speech signal distortions and a method for measuring it in conditions of small samples of observations are proposed. An example of fast practical implementation of the method based on a parametric spectral analysis algorithm is considered. Experimental results of our approach are provided for three different versions of communication channel. It is shown that the usage of the proposed method makes it possible to transform the initially distorted speech signal into compliance on the registered voice template by using acceptable information discrimination criterion. It is demonstrated that our approach may be used in existing biometric systems and technologies of speaker identification.

Download Full-text