A 1.8kbps vocoder based on Mixed Excitation Linear Prediction

The paper presents a speech generative model that provides an efficient way of generating speech waveform from its amplitude spectral envelopes. The model is based on hybrid speech representation that includes deterministic (harmonic) and stochastic (noise) components. The main idea behind the approach originates from the fact that speech signal has a determined spectral structure that is statistically bound with deterministic/stochastic energy distribution in the spectrum. The performance of the model is evaluated using an experimental low-bitrate wide-band speech coder. The quality of reconstructed speech is evaluated using objective and subjective methods. Two objective quality characteristics were calculated: Modified Bark Spectral Distortion (MBSD) and Perceptual Evaluation of Speech Quality (PESQ). Narrow-band and wide-band versions of the proposed solution were compared with MELP (Mixed Excitation Linear Prediction) speech coder and AMR (Adaptive Multi-Rate) speech coder, respectively. The speech base of two female and two male speakers were used for testing. The performed tests show that overall performance of the proposed approach is speaker-dependent and it is better for male voices. Supposedly, this difference indicates the influence of pitch highness on separation accuracy. In that way, using the proposed approach in experimental speech compression system provides decent MBSD values and comparable PESQ values with AMR speech coder at 6,6 kbit/s. Additional subjective listening testsdemonstrate that the implemented coding system retains phonetic content and speaker’s identity. It proves consistency of the proposed approach.

Download Full-text

An improved mixed excitation linear prediction (MELP) coder

1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258) ◽

10.1109/icassp.1999.758108 ◽

1999 ◽

Cited By ~ 10

Author(s):

T. Unno ◽

T.P. Barnwell ◽

Kwan Truong

Keyword(s):

Linear Prediction ◽

Mixed Excitation Linear Prediction

Download Full-text

A low bit rate speech codec using mixed excitation linear prediction for private mobile radio

Electronics and Communications in Japan (Part II Electronics) ◽

10.1002/ecjb.20096 ◽

2004 ◽

Vol 87 (6) ◽

pp. 69-81

Author(s):

Seishi Sasaki ◽

Teruo Fumoto

Keyword(s):

Linear Prediction ◽

Mobile Radio ◽

Bit Rate ◽

Low Bit Rate ◽

Speech Codec ◽

Mixed Excitation Linear Prediction ◽

Private Mobile Radio

Download Full-text

A fractional bit allocation algorithm based on Mixed Excitation Linear Prediction

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS ◽

10.1109/icosp.2010.5656151 ◽

2010 ◽

Author(s):

Xu Jingde ◽

Wei Xuan ◽

Ji Zhe ◽

Cui Huijuan ◽

Tang Kun

Keyword(s):

Linear Prediction ◽

Bit Allocation ◽

Allocation Algorithm ◽

Mixed Excitation Linear Prediction

Download Full-text

Speech Coding Algorithm with Dynamic Weighted Inter-Frame Linear Prediction

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.798-799.769 ◽

2013 ◽

Vol 798-799 ◽

pp. 769-772

Author(s):

Zhao An Su ◽

Chun Di Xiu

Keyword(s):

Speech Coding ◽

Linear Prediction ◽

The Other ◽

Dimensional Vector ◽

Bit Rate ◽

Reconstruction Accuracy ◽

Mixed Excitation Linear Prediction ◽

Super Frame ◽

Inter Frame

A new multi-frame joint quantization algorithm with dynamic weighted inter-frame linear prediction based on mixed excitation linear prediction (MELP) is proposed in this paper. In encoding stage, a super-frame consists of three adjacent single-frames. Fourier magnitudes and aperiodic jitter flag are eliminated. The other parameters are jointly quantized. LSF of the first and third frame are quantized as a 20-dimensional vector. According to the BPVs of super-frame, pitch is quantized with codebooks of dynamic size. In decoding stage, parameters are indexed from corresponding codebook. The LSF of middle frame are predicted from the first and third frame. The weighted factors keep changing in accordance with the BPVs of adjacent five frames. Results show that the reconstruction accuracy of LSF is significantly improved using dynamic weighted inter-frame linear prediction. Meanwhile the coding bit rate is reduced to 0.6 kbps.

Download Full-text

A 1.8kbps vocoder based on Mixed Excitation Linear Prediction

A variable-bit-rate speech coding algorithm based on enhanced mixed excitation linear prediction

Hidden data transmission in mixed excitation linear prediction coded speech using quantisation index modulation

Bit stream based wireless speech recognition using mixed excitation linear prediction (MELP) vocoder

An intelligibility enhancement for the mixed excitation linear prediction speech coder

An extended Levinson-Durbin algorithm and its application in mixed excitation linear prediction

AN EFFICIENT SPEECH GENERATIVE MODEL BASED ON DETERMINISTIC/STOCHASTIC SEPARATION OF SPECTRAL ENVELOPES

An improved mixed excitation linear prediction (MELP) coder

A low bit rate speech codec using mixed excitation linear prediction for private mobile radio

A fractional bit allocation algorithm based on Mixed Excitation Linear Prediction

Speech Coding Algorithm with Dynamic Weighted Inter-Frame Linear Prediction

Export Citation Format