Fundamental Frequency Extraction of Noisy Speech Signals

In this paper, we proposed a correlation based method which is a new approach using the autocorrelation function is weighted by the reciprocal of the YIN and very useful for accurate fundamental frequency extraction. The autocorrelation function and also YIN is a popular measurement in estimating fundamental frequency in time domain. In our proposed method, instead of the original signal, we employ its center clipping signal for obtaining the autocorrelation function and this function is weighted by the reciprocal of the YIN for fundamental frequency detection. Comparative results on female and male voices in white and exhibition noise shows that the proposed method can detect fundamental frequency with better accuracy in terms of gross pitch errors as compared to other related methods.

Download Full-text

Auto-Associative Initialization of LSTM Neural Networks for Fundamental Frequency Detection in Noisy Speech Signals

2018 Seventeenth Mexican International Conference on Artificial Intelligence (MICAI) ◽

10.1109/micai46078.2018.00011 ◽

2018 ◽

Author(s):

Marvin Coto-Jimenez

Keyword(s):

Neural Networks ◽

Fundamental Frequency ◽

Speech Signals ◽

Noisy Speech ◽

Frequency Detection

Download Full-text

Pitch Detection Method for Noisy Speech Signals Based on Wavelet Transform and Autocorrelation Function

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing ◽

10.1109/iih-msp.2013.47 ◽

2013 ◽

Cited By ~ 1

Author(s):

Li Ru-Wei ◽

Cao Long-Tao ◽

Li Yang

Keyword(s):

Wavelet Transform ◽

Autocorrelation Function ◽

Detection Method ◽

Speech Signals ◽

Pitch Detection ◽

Noisy Speech

Download Full-text

On the Estimation of Fundamental Frequency From Nonstationary Noisy Speech Signals Based on the Hilbert–Huang Transform

IEEE Signal Processing Letters ◽

10.1109/lsp.2017.2782267 ◽

2018 ◽

Vol 25 (2) ◽

pp. 248-252 ◽

Cited By ~ 1

Author(s):

L. Zao ◽

R. Coelho

Keyword(s):

Fundamental Frequency ◽

Speech Signals ◽

Noisy Speech ◽

Hilbert Huang Transform

Download Full-text

Estimation and tracking of pitch for noisy speech signals using EMD based autocorrelation function algorithm

2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) ◽

10.1109/rteict.2017.8256964 ◽

2017 ◽

Author(s):

K Pratibha ◽

H M Chandrashekar

Keyword(s):

Autocorrelation Function ◽

Speech Signals ◽

Noisy Speech

Download Full-text

Estimation of Fundamental Frequency of Noisy Speech Signals using Correlogram based on Subband Filtering

2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS) ◽

10.1109/icetas48360.2019.9117333 ◽

2019 ◽

Author(s):

Ashishkumar Gudmalwar ◽

Anirban Dutta ◽

V. Rama Rao

Keyword(s):

Fundamental Frequency ◽

Speech Signals ◽

Noisy Speech

Download Full-text

Harmonic Differences Method for Robust Fundamental Frequency Detection in Wideband and Narrowband Speech Signals

Mathematical Problems in Engineering ◽

10.1155/2021/6658951 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Cevahir Parlak ◽

Yusuf Altun

Keyword(s):

Neural Networks ◽

Fundamental Frequency ◽

Convolutional Neural Networks ◽

Speech Signal ◽

Speech Signals ◽

Pitch Detection ◽

Frequency Detection ◽

Temporal Smoothing ◽

Telephone Speech ◽

Musical Sounds

In this article, a novel pitch determination algorithm based on harmonic differences method (HDM) is proposed. Most of the algorithms today rely on autocorrelation, cepstrum, and lastly convolutional neural networks, and they have some limitations (small datasets, wideband or narrowband, musical sounds, temporal smoothing, etc.), accuracy, and speed problems. There are very rare works exploiting the spacing between the harmonics. HDM is designed for both wideband and exclusively narrowband (telephone) speech and tries to find the most repeating difference between the harmonics of speech signal. We use three vowel databases in our experiments, namely, Hillenbrand Vowel Database, Texas Vowel Database, and Vowels from the TIMIT corpus. We compare HDM with autocorrelation, cepstrum, YIN, YAAPT, CREPE, and FCN algorithms. Results show that harmonic differences are reliable and fast choice for robust pitch detection. Also, it is superior to others in most cases.

Download Full-text

PENGUCAPAN MAKHRAJ DARI UNIT BUNYI TERKECIL HURUF HIJAIYAH BERDASARKAN FREKUENSI DASAR DAN FREKUENSI FORMANT UNTUK MEDIA PEMBELAJARAN MEMBACA ALQURAN

ALQALAM ◽

10.32678/alqalam.v32i2.552 ◽

2015 ◽

Vol 32 (2) ◽

pp. 284

Author(s):

Muhammad Subali ◽

Miftah Andriansyah ◽

Christanto Sinambela

Keyword(s):

College Student ◽

Fundamental Frequency ◽

Autocorrelation Function ◽

Adult Male ◽

Formant Frequency ◽

Formant Frequencies ◽

Male Speaker ◽

Similarities And Differences

This article aims to look at the similarities and differences in the fundamental frequency and formant frequencies using the autocorrelation function and LPCfunction in GUI MATLAB 2012b on sound hijaiyah letters for adult male speaker beginner and expert based on makhraj pronunciation and both of speaker will be analysis on matching distance of the sound use DTW method on cepstrum. Subject for speech beginner makhraj pronunciation are taken from college student of Universitas Gunadarma and SITC aged 22 years old Data of the speech beginner makhraj pronunciation is recorded using MATLAB algorithm on GUI Subject for speech expert makhraj pronunciation are taken from previous research. They are 20-30 years old from the time of taking data. The sound will be extracted to get the value of the fundamental frequency and formant frequency. After getting both frequencies, it will be obtained analysis of the similarities and differences in the fundamental frequency and formant frequencies of speech beginner and expert and it will shows matching distance of both speech. The result is all of speech beginner and expert based on makhraj pronunciation have different values of fundamental frequency and formant frequency. Then the results of the analysis matching distance using method DTW showed that obtained in the range of 28.9746 to 136.4 between speech beginner and expert based on makhraj pronunciation. Keywords: fundamental frequency, formant frequency, hijaiyah letters, makhraj

Download Full-text

High-Accuracy Fundamental Frequency Detection in Power System

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.383-390.4962 ◽

2011 ◽

Vol 383-390 ◽

pp. 4962-4966

Author(s):

Ling Li ◽

Guo Bin Jin ◽

Shao Ping Huang ◽

Xiao Peng

Keyword(s):

Power System ◽

Fundamental Frequency ◽

Frequency Measurement ◽

Least Square ◽

Signal Model ◽

Least Square Estimation ◽

Signal Process ◽

Frequency Detection ◽

Novel Method ◽

Subspace Estimation

A novel method on frequency measurement based on improved TLS-ESPRIT (total least square estimation of signal parameters via rotational invariance techniques) is proposed in this paper with the research on fundamental frequency measurement in power system. TLS-ESPRIT is belong to subspace estimation in modern signal process. Noise is included in signal model, so it is independent on noise. But the same multi-poles cannot be taken when signal is in noise and based on TLS-ESPRIT. Multiple poles restoring is presented to take the true poles accurately. It is revealed that fundamental frequency is detected accurately in harmonics, interharmonics, noise and frequency fluctuations and better anti-noise ability in particular better adaptiveness on time varying signal in amplitude by simulation results.

Download Full-text

The Time-Domain Compensation Contour Integral Method (TD-C2IM) - A New Approach to the Analysis of Irregularly-Shaped Parallel-Plane Circuits

2018 IEEE Symposium on Electromagnetic Compatibility, Signal Integrity and Power Integrity (EMC, SI & PI) ◽

10.1109/emcsi.2018.8495443 ◽

2018 ◽

Author(s):

Martin Stumpf

Keyword(s):

Time Domain ◽

Integral Method ◽

Parallel Plane ◽

Contour Integral ◽

New Approach ◽

The Time Domain

Download Full-text

Comparison of fundamental frequency and formants frequency measurements in two speech tasks

Revista CEFAC ◽

10.1590/1982-0216/201921612819 ◽

2019 ◽

Vol 21 (6) ◽

Author(s):

Flávia Viegas ◽

Danieli Viegas ◽

Glaucio Serra Guimarães ◽

Margareth Maria Gomes de Souza ◽

Ronir Raggio Luiz ◽

...

Keyword(s):

Fundamental Frequency ◽

Effect Size ◽

Oral Communication ◽

Speech Disorders ◽

Brazilian Portuguese ◽

T Test ◽

Speech Signals ◽

Frequency Measurements ◽

Age Range ◽

Voice And Speech

ABSTRACT Purpose: to compare the measurements of fundamental frequency (F0) and frequency of the first two formants (F1 and F2) of the seven oral vowels of the Brazilian Portuguese in two speech tasks, in adults without voice and speech disorders. Methods: eighty participants in the age range 18 and 40 years, paired by gender, were selected after orofacial, orthodontic and auditory-perceptual assessments of voice and speech. The speech signals were obtained from carrier phrases and sustained vowels and the values of the F0 and frequencies of F1 and F2 were estimated. The differences were verified through the t Test, and the effect size was calculated. Results: differences were found in the F0 measurements between the two speech tasks, in two vowels in males, and in five vowels, in females. In the F1 frequencies, differences were noted in six vowels, in men, and in two, in women. In the F2 frequencies, there was a difference in four vowels, in men, and three, in women. Conclusion: based on the differences found, it is concluded that the speech task for evaluation of fundamental frequency and formants’ frequencies, in the Brazilian Portuguese, can show distinct results in both glottal and supraglottal measures in the production of different oral vowels of this language. Thus, it is suggested that clinicians and researchers consider both forms of emission for a more accurate interpretation of the implications of these data in the evaluation of oral communication and therapeutic conducts.

Download Full-text