New Speech Compression Technique based on Filter Bank Design and Psychoacoustic Model

In this paper, a new speech compression technique is proposed. This technique applies a Psychoacoustic Model and a general approach for Filter Bank Design using optimization. It is evaluated and compared with a compression technique using a MDCT (Modified Discrete Cosine Transform) Filter Bank of 32 Filters and a Psychoacoustic Model. This evaluation and comparison is performed by calculating bits before and after compression, PSNR (Peak Signal to Noise Ratio), NRMSE (Normalized Root Mean Square Error), SNR (Signal to Noise Ratio) and PESQ (Perceptual evaluation of speech quality) computations. The two techniques are tested and applied to a number of speech signals that are sampled at 8 kHz. The results obtained from this evaluation show that the proposed technique outperforms the second compression technique (based on a Psychoacoustic Model and MDCT filter Bank) in terms of Bits after compression and compression ratio. In fact, the proposed technique yields higher values for the compression ratio than the second compression technique. Moreover, the proposed compression technique presents reconstructed speech signals with acceptable perceptual qualities. This is justified by the values of SNR, PSNR and NRMSE and PESQ.

Download Full-text

Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank

The Journal of the Acoustical Society of America ◽

10.1121/1.426926 ◽

1999 ◽

Vol 105 (5) ◽

pp. 2554

Author(s):

Marvin L. Vis ◽

Aruna Bayya

Keyword(s):

Adaptive Filtering ◽

Filter Bank ◽

Signal To Noise Ratio ◽

Speech Signals ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Singular Values Decomposition and Lifting Wavelet Transform for Speech Signal Embedding into Digital Image

Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) ◽

10.2174/2352096511666180511151646 ◽

2019 ◽

Vol 12 (2) ◽

pp. 138-151

Author(s):

Mourad Talbi ◽

Med Salim Bouhlel

Keyword(s):

Wavelet Transform ◽

Speech Signal ◽

Signal To Noise Ratio ◽

Perceptual Quality ◽

Lifting Wavelet Transform ◽

Signal To Noise ◽

Perceptual Evaluation ◽

Lifting Wavelet ◽

Noise Ratio

Background: In this paper, we propose a secure image watermarking technique which is applied to grayscale and color images. It consists in applying the SVD (Singular Value Decomposition) in the Lifting Wavelet Transform domain for embedding a speech image (the watermark) into the host image. Methods: It also uses signature in the embedding and extraction steps. Its performance is justified by the computation of PSNR (Pick Signal to Noise Ratio), SSIM (Structural Similarity), SNR (Signal to Noise Ratio), SegSNR (Segmental SNR) and PESQ (Perceptual Evaluation Speech Quality). Results: The PSNR and SSIM are used for evaluating the perceptual quality of the watermarked image compared to the original image. The SNR, SegSNR and PESQ are used for evaluating the perceptual quality of the reconstructed or extracted speech signal compared to the original speech signal. Conclusion: The Results obtained from computation of PSNR, SSIM, SNR, SegSNR and PESQ show the performance of the proposed technique.

Download Full-text

A Pitch Detection Method for Speech Signals with Low Signal-to-Noise Ratio

2007 International Symposium on Signals, Systems and Electronics ◽

10.1109/issse.2007.4294497 ◽

2007 ◽

Author(s):

C. Shahnaz ◽

W.-P. Zhu ◽

M. O. Ahmad

Keyword(s):

Detection Method ◽

Signal To Noise Ratio ◽

Speech Signals ◽

Pitch Detection ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

A supervised signal-to-noise ratio estimation of speech signals

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2014.6855207 ◽

2014 ◽

Cited By ~ 4

Author(s):

Pavlos Papadopoulos ◽

Andreas Tsiartas ◽

James Gibson ◽

Shrikanth Narayanan

Keyword(s):

Signal To Noise Ratio ◽

Speech Signals ◽

Ratio Estimation ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Blind Determination of the Signal to Noise Ratio of Speech Signals Based on Estimation Combination of Multiple Features

APCCAS 2006 - 2006 IEEE Asia Pacific Conference on Circuits and Systems ◽

10.1109/apccas.2006.342229 ◽

2006 ◽

Cited By ~ 2

Author(s):

Russell Ondusko ◽

Matthew Marbach ◽

Andrew McClellan ◽

Ravi P. Ramachandran ◽

Linda M. Head ◽

...

Keyword(s):

Signal To Noise Ratio ◽

Speech Signals ◽

Signal To Noise ◽

Multiple Features ◽

Noise Ratio

Download Full-text

The effects of compression ratio, signal-to-noise ratio, and level on speech recognition in normal-hearing listeners

The Journal of the Acoustical Society of America ◽

10.1121/1.1369105 ◽

2001 ◽

Vol 109 (6) ◽

pp. 2964-2973 ◽

Cited By ~ 25

Author(s):

Benjamin W. Y. Hornsby ◽

Todd A. Ricketts

Keyword(s):

Speech Recognition ◽

Compression Ratio ◽

Signal To Noise Ratio ◽

Normal Hearing ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Effects of Using Static Methods with Contourlet Transformation on Speech Compression

Iraqi Journal of Science ◽

10.24996/ijs.2021.62.8.31 ◽

2021 ◽

pp. 2784-2795

Author(s):

Esraa Abd Alsalam ◽

Shaymaa Ahmed Razoqi ◽

Eman Fathi Ahmed

Keyword(s):

Speech Signal ◽

Cross Correlation ◽

Signal To Noise Ratio ◽

Speech Compression ◽

Signal To Noise ◽

Absolute Deviation ◽

Limited Bandwidth ◽

Normalized Cross Correlation ◽

Noise Ratio ◽

And Storage

Compression of speech signal is an essential field in signal processing. Speech compression is very important in today’s world, due to the limited bandwidth transmission and storage capacity. This paper explores a Contourlet transformation based methodology for the compression of the speech signal. In this methodology, the speech signal is analysed using Contourlet transformation coefficients with statistic methods as threshold values, such as Interquartile Filter (IQR), Average Absolute Deviation (AAD), Median Absolute Deviation (MAD) and standard deviation (STD), followed by the application of (Run length encoding) They are exploited for recording speech in different times (5, 30, and 120 seconds). A comparative study of performance of different transforms is made in terms of (Signal to Noise Ratio,Peak Signal to Noise Ratio,Normalized Cross-Correlation, Normalized Cross-Correlation) and the compression ratio (CR). The best stable result of implementing our algorithm for compressing speech is at level1 with AAD or MAD, adopting Matlab 2013a language.

Download Full-text

Performance on FBMC-OQAM and OFDM Multicarrier Systems

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.a1036.0881019 ◽

2019 ◽

Vol 8 (10) ◽

pp. 1564-1568

Keyword(s):

Orthogonal Frequency Division Multiplexing ◽

Filter Bank ◽

Signal To Noise Ratio ◽

Cyclic Prefix ◽

Frequency Division Multiplexing ◽

Frequency Division ◽

Signal To Noise ◽

Multicarrier Systems ◽

Noise Ratio ◽

Power Equality

Filter Bank Multi Carrier (FBMC) offers best detestable properties took a gander at over orthogonal frequency division multiplexing (OFDM) to the attack of nonexistent hindrance. FBMC system is a multicarrier structure, particularly sensible for 5G remote correspondences. FBMC beats OFDM as a result of proficient use of the open information move limit and without usage of cyclic prefix (CP). In this paper, we address the issue of remarkable enrollment at the pilot territory and used to audit the channels with pilot picture, in like way consider the fundamental conditions for utilization of the assistant pilot pictures. First and two partner pictures for each pilot plans with power equality uses instead of one picture; it can attainable inspirations driving necessity of OFDM and FBMC depending upon signal to noise ratio (SNR) what's relentlessly possible to improve the introduction of one frivolity pictures by using multiple associate pictures. Finally autonomous the BER execution reenactment results and adornment pilot pictures

Download Full-text

Improving the Compression Ratio and Peak Signal to Noise Ratio of Medical Image Sequence by using SPIHT, STW, and Block Matching Algorithms

i-manager’s Journal on Image Processing ◽

10.26634/jip.4.1.13518 ◽

2017 ◽

Vol 4 (1) ◽

pp. 1

Author(s):

Rai Jayant Kumar ◽

Keyword(s):

Compression Ratio ◽

Medical Image ◽

Signal To Noise Ratio ◽

Image Sequence ◽

Block Matching ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Optimization Before Biomedical Image Compression Using CLAHE and DCS

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.27.17884 ◽

2018 ◽

Vol 7 (3.27) ◽

pp. 236

Author(s):

Satyawati S. Magar ◽

Bhavani Sridharan

Keyword(s):

Medical Imaging ◽

Image Compression ◽

Image Enhancement ◽

Compression Ratio ◽

Signal To Noise Ratio ◽

Histogram Equalization ◽

Signal To Noise ◽

Biomedical Image ◽

Adaptive Histogram Equalization ◽

Noise Ratio

In current years, improving the Compression Ratio (CR) in medical imaging is essential and becomes big challenge in the field of biomedical. In that direction we have done optimization before biomedical image compression. For the same we have used the image enhancement techniques. For the enhancement of an image we have used Contrast Limited Adaptive Histogram Equalization (CLAHE) and Decorrelation Stretch (DCS) algorithms. By optimizing an image before compression we have achieved better Compression Ratio (CR) and Peak Signal to Noise Ratio (PSNR) than existing methods of an image compression. Mainly results are compared with Oscillation Concept method of an image compression with and without optimization.

Download Full-text