Method and apparatus to compensate for fundamental frequency changes and artifacts and reduce sensitivity to pitch information in a frame-based speech processing system

2010 ◽  
Vol 127 (6) ◽  
pp. 3870
Author(s):  
Jiri Navratil
1984 ◽  
Vol 27 (2) ◽  
pp. 311-317 ◽  
Author(s):  
B. J. Guillemi ◽  
D. T. Nguyen

Durational measurements of frication, aspiration, prevoicing, and voice onset are often difficult to perform from the spectrogram, and the resolution is limited to about 5 ms. In many instances, a higher resolution can be obtained from a study of waveforms than from a study of spectrum. We present a microprocessor-based speech acquisition and processing system which uses waveform analysis techniques to extract measurements from the acoustic signal. The system is low cost and portable; it operates in "real time" and employs noninvasive data-capturing techniques. The usefulness of the system is demonstrated in the VOT measurement of CV clusters and in the measurement of fundamental frequency.


2021 ◽  
pp. 2150022
Author(s):  
Caio Cesar Enside de Abreu ◽  
Marco Aparecido Queiroz Duarte ◽  
Bruno Rodrigues de Oliveira ◽  
Jozue Vieira Filho ◽  
Francisco Villarreal

Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.


Author(s):  
Margaret M. Kehoe ◽  
Emilie Cretton

Purpose This study examines intraword variability in 40 typically developing French-speaking monolingual and bilingual children, aged 2;6–4;8 (years;months). Specifically, it measures rate of intraword variability and investigates which factors best account for it. They include child-specific ones such as age, expressive vocabulary, gender, bilingual status, and speech sound production ability, and word-specific factors, such as phonological complexity (including number of syllables), phonological neighborhood density (PND), and word frequency. Method A variability test was developed, consisting of 25 words, which differed in terms of phonological complexity, PND, and word frequency. Children produced three exemplars of each word during a single session, and productions of words were coded as variable or not variable. In addition, children were administered an expressive vocabulary test and two tests tapping speech motor ability (oral motor assessment and diadochokinetic test). Speech sound ability was also assessed by measuring percent consonants correct on all words produced by the children during the session. Data were entered into a binomial logistic regression. Results Average intraword variability was 29% across all children. Several factors were found to predict intraword variability including age, gender, bilingual status, speech sound production ability, phonological complexity, and PND. Conclusions Intraword variability was found to be lower in French than what has been reported in English, consistent with phonological differences between French and English. Our findings support those of other investigators in indicating that the factors influencing intraword variability are multiple and reflect sources at various levels in the speech processing system.


2018 ◽  
Vol 8 (5) ◽  
pp. 27
Author(s):  
Abdul Abbasi ◽  
Mansoor Channa ◽  
Masood Memon ◽  
Stephen John ◽  
Irtaza Ahmed ◽  
...  

The purpose of this investigation was to document acoustic characteristics of Pakistani English (PaKE) vowel sounds. The experiment was designed to examine the properties of ten vowels produced by Pakistani ESL learners. The analysis is based on the voice samples of recorded 50 CVC words. Total 5000 (10  10  50=5000) voiced samples were analyzed. The data consisted of 50 words of ten English vowel sounds [i: ɪ e ɔ: æ ə ɑ: u: ɒ ʊ]. Ten ESL speakers recorded their voice samples on Praat speech processing tool installed on laptop. Three parameters were considered i.e., fundamental frequency (F0), vowel quality (F1-F2) and duration. Formant patterns were judged manually by visual inspection on Praat Speech Processing Tool. Analysis of formant frequency shows numerous differences between male and female of F1 and F2, fundamental frequency and duration of English vowels. The voice samples provide evidence for higher and lower frequency of vowel sounds. Additionally, the data analysis illustrates that there were statistical differences in the values of short and long vowels coupled with vowel space plot showing explicit differences in locating the production of vowels of male & female vowel space acoustic realizations.


Sign in / Sign up

Export Citation Format

Share Document