Fundamental frequency modeling using wavelets for emotional voice conversion

Voice Conversion Using Pitch Shifting Algorithm by Time Stretching with PSOLA and Re-SamplingVoice changing has many applications in the industry and commercial filed. This paper emphasizes voice conversion using a pitch shifting method which depends on detecting the pitch of the signal (fundamental frequency) using Simplified Inverse Filter Tracking (SIFT) and changing it according to the target pitch period using time stretching with Pitch Synchronous Over Lap Add Algorithm (PSOLA), then resampling the signal in order to have the same play rate. The same study was performed to see the effect of voice conversion when some Arabic speech signal is considered. Treatment of certain Arabic voiced vowels and the conversion between male and female speech has shown some expansion or compression in the resulting speech. Comparison in terms of pitch shifting is presented here. Analysis was performed for a single frame and a full segmentation of speech.

Download Full-text

Method for analyzing fundamental frequency information and voice conversion method and system implementing said analysis method

The Journal of the Acoustical Society of America ◽

10.1121/1.3457088 ◽

2010 ◽

Vol 127 (6) ◽

pp. 3869

Author(s):

Taoufik En-Najjary ◽

Olivier Rosec

Keyword(s):

Fundamental Frequency ◽

Voice Conversion ◽

Analysis Method ◽

Frequency Information ◽

Conversion Method

Download Full-text

Complex Cepstrum Based Voice Conversion Using Radial Basis Function

ISRN Signal Processing ◽

10.1155/2014/357048 ◽

2014 ◽

Vol 2014 ◽

pp. 1-13 ◽

Cited By ~ 10

Author(s):

Jagannath Nirmal ◽

Suprava Patnaik ◽

Mukesh Zaveri ◽

Pramod Kachare

Keyword(s):

Radial Basis Function ◽

Fundamental Frequency ◽

Basis Function ◽

Speech Signal ◽

Vocal Tract ◽

Voice Conversion ◽

Radial Basis ◽

Complex Cepstrum ◽

Source Excitation ◽

Mel Cepstrum

The complex cepstrum vocoder is used to modify the speaker specific characteristics of the source speaker speech to that of the target speaker speech. The low time and high time liftering are used to split the calculated cepstrum into the vocal tract and the source excitation parameters. The obtained mixed phase vocal tract and source excitation parameters with finite impulse response preserve the phase properties of the resynthesized speech frame. The radial basis function is explored to capture the nonlinear mapping function for modifying the complex cepstrum based real and imaginary components of the vocal tract and source excitation of the speech signal. The state-of-the-art Mel cepstrum envelope and the fundamental frequency (F0) are considered to represent the vocal tract and the source excitation of the speech frame, respectively. Radial basis function is used to capture and formulate the nonlinear relations between the Mel cepstrum envelope of the source and target speakers. Mean and standard deviation approach is employed to modify the fundamental frequency (F0). The Mel log spectral approximation filter is used to reconstruct the speech signal from the modified Mel cepstrum envelope and fundamental frequency. A comparison of the proposed complex cepstrum based model has been made with the state-of-the-art Mel Cepstrum Envelope based voice conversion model with objective and subjective evaluations. The evaluation measures reveal that the proposed complex cepstrum based voice conversion system approximate the converted speech signal with better accuracy than the model based on the Mel cepstrum envelope based voice conversion.

Download Full-text

Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency

Speech Communication ◽

10.1016/j.specom.2011.05.001 ◽

2011 ◽

Vol 53 (7) ◽

pp. 973-985 ◽

Cited By ~ 8

Author(s):

Takashi Nose ◽

Takao Kobayashi

Keyword(s):

Fundamental Frequency ◽

Voice Conversion ◽

Adaptive Quantization ◽

Speaker Independent

Download Full-text

Fundamental Frequency (F0) Fusion Transformation-Based on BLSTM for Voice Conversion

Science Discovery ◽

10.11648/j.sd.20180604.21 ◽

2018 ◽

Vol 6 (4) ◽

pp. 298

Author(s):

Miao Xiaokong

Keyword(s):

Fundamental Frequency ◽

Voice Conversion

Download Full-text

A Tape Striation Counting Method for Determining Fundamental Frequency

Language Speech and Hearing Services in Schools ◽

10.1044/0161-1461.1004.246 ◽

1979 ◽

Vol 10 (4) ◽

pp. 246-248 ◽

Cited By ~ 2

Author(s):

Peter B. Mueller ◽

Marla Adams ◽

Jean Baehr-Rouse ◽

Debbie Boos

Keyword(s):

Fundamental Frequency ◽

Counting Method ◽

Male And Female ◽

Fundamental Frequencies ◽

Counting Procedure ◽

Female Subjects

Mean fundamental frequencies of male and female subjects obtained with FLORIDA I and a tape striation counting procedure were compared. The fundamental frequencies obtained with these two methods were similar and it appears that the tape striation counting procedure is a viable, simple, and inexpensive alternative to more costly and complicated procedures and instrumentation.

Download Full-text

The Application of Laboratory Formulas to Clinical Voice Management

American Journal of Speech-Language Pathology ◽

10.1044/1058-0360.0402.62 ◽

1995 ◽

Vol 4 (2) ◽

pp. 62-69 ◽

Cited By ~ 12

Author(s):

Katherine Verdolini ◽

Ingo R. Titze

Keyword(s):

Fundamental Frequency ◽

Voice Disorders ◽

Clinical Interventions ◽

Case Examples ◽

Interactive Nature ◽

Mathematical Formulas

In this paper, we discuss the application of mathematical formulas to guide the development of clinical interventions in voice disorders. Discussion of case examples includes fundamental frequency and intensity deviations, pitch and loudness abnormalities, laryngeal hyperand hypoadduction, and phonatory effort. The paper illustrates the interactive nature of theoretical and applied work in vocology

Download Full-text