scholarly journals Content Based Audio Watermarking and Retrieval Using Time-Frequency Analysis

Author(s):  
Shahrzad Esmaili

This research focuses on the application of joint time-frequency (TF) analysis for watermarking and classifying different audio signals. Time frequency analysis which originated in the 1930s has often been used to model the non-stationary behaviour of speech and audio signals. By taking into consideration the human auditory system which has many non-linear effects and its masking properties, we can extract efficient features from the TF domain to watermark or classify signals. This novel audio watermarking scheme is based on spread spectrum techniques and uses content-based analysis to detect the instananeous mean frequency (IMF) of the input signal. The watermark is embedded in this perceptually significant region such that it will resist attacks. Audio watermarking offers a solution to data privacy and helps to protect the rights of the artists and copyright holders. Using the IMF, we aim to keep the watermark imperceptible while maximizing its robustness. In this case, 25 bits are embedded and recovered witin a 5 s sample of an audio signal. This scheme has shown to be robust against various signal processing attacks including filtering, MP3 compression, additive moise and resampling with a bit error rate in the range of 0-13%. In addition content-based classification is performed using TF analysis to classify sounds into 6 music groups consisting of rock, classical, folk, jazz and pop. The features that are extracted include entropy, centroid, centroid ratio, bandwidth, silence ratio, energy ratio, frequency location of minimum and maximum energy. Using a database of 143 signals, a set of 10 time-frequncy features are extracted and an accuracy of classification of around 93.0% using regular linear discriminant analysis or 92.3% using leave one out method is achieved.

2021 ◽  
Author(s):  
Shahrzad Esmaili

This research focuses on the application of joint time-frequency (TF) analysis for watermarking and classifying different audio signals. Time frequency analysis which originated in the 1930s has often been used to model the non-stationary behaviour of speech and audio signals. By taking into consideration the human auditory system which has many non-linear effects and its masking properties, we can extract efficient features from the TF domain to watermark or classify signals. This novel audio watermarking scheme is based on spread spectrum techniques and uses content-based analysis to detect the instananeous mean frequency (IMF) of the input signal. The watermark is embedded in this perceptually significant region such that it will resist attacks. Audio watermarking offers a solution to data privacy and helps to protect the rights of the artists and copyright holders. Using the IMF, we aim to keep the watermark imperceptible while maximizing its robustness. In this case, 25 bits are embedded and recovered witin a 5 s sample of an audio signal. This scheme has shown to be robust against various signal processing attacks including filtering, MP3 compression, additive moise and resampling with a bit error rate in the range of 0-13%. In addition content-based classification is performed using TF analysis to classify sounds into 6 music groups consisting of rock, classical, folk, jazz and pop. The features that are extracted include entropy, centroid, centroid ratio, bandwidth, silence ratio, energy ratio, frequency location of minimum and maximum energy. Using a database of 143 signals, a set of 10 time-frequncy features are extracted and an accuracy of classification of around 93.0% using regular linear discriminant analysis or 92.3% using leave one out method is achieved.


2021 ◽  
Author(s):  
Serhat Erküçük

In this study, we present novel applications of time-frequency analysis to spread spectrum based communication and audio watermarking systems. Our objective is to detect and estimate non-stationary signals, such as chirps, that are characterized by directional elements in the time-frequency plane. Towards this goal, we model non-stationary signals using the matching pursuit decomposition algorithm, generate a positive time-frequency representation of the signal model using the Wigner-Ville distribution and estimate the energy varying directional elements using a line detection algorithm based on the Hough-Radon transform. Spread spectrum communication systems frequently encounter nonstationary signals with energy varying directional elements as hostile jamming signals. In this thesis, we develop a new interference excision algorithm for spread spectrum communication systems based on the directional element estimation algorithm. At the receiver, we first excise the interference from the spread spectrum signal before despreading and data symbol detection. The new algorithm can excise single and multicomponent interferences such that the spread spectrum system can reliably detect the transmitted message symbols even, when the interference power exceeds the jamming margin of the system. We verify the effectiveness of the interference excision algorithm using simulation studies. Watermarking is the process of embedding imperceptible data into the host signal for marking the copyright ownership. The embedded data should be extractable to prove ownership. Watermarking systems face problems similar to those in spread spectrum communication systems, namely, intentional attacks by the adversaries. In watermarking, the adversaries try to obliterate the embedded watermark in order to prevent its detection by authorized parties. In this thesis, we develop a spread spectrum audio watermarking scheme, where we embed perceptually shaped linear chirps as watermark messages. The directional elements of the chirp signals represent different watermark messages. We extract the watermark by first detecting the transmitted message symbols in the spread spectrum signal. We then use the directional element estimation algorithm based on the time-frequency analysis as a post-processing tool to minimize the effects of hostile attacks on the extractability of the embedded watermark. We demonstrate the robustness of the algorithm by extracting the watermark correctly after common signal processing operations representing hostile attacks by adversaries.


2021 ◽  
Author(s):  
Serhat Erküçük

In this study, we present novel applications of time-frequency analysis to spread spectrum based communication and audio watermarking systems. Our objective is to detect and estimate non-stationary signals, such as chirps, that are characterized by directional elements in the time-frequency plane. Towards this goal, we model non-stationary signals using the matching pursuit decomposition algorithm, generate a positive time-frequency representation of the signal model using the Wigner-Ville distribution and estimate the energy varying directional elements using a line detection algorithm based on the Hough-Radon transform. Spread spectrum communication systems frequently encounter nonstationary signals with energy varying directional elements as hostile jamming signals. In this thesis, we develop a new interference excision algorithm for spread spectrum communication systems based on the directional element estimation algorithm. At the receiver, we first excise the interference from the spread spectrum signal before despreading and data symbol detection. The new algorithm can excise single and multicomponent interferences such that the spread spectrum system can reliably detect the transmitted message symbols even, when the interference power exceeds the jamming margin of the system. We verify the effectiveness of the interference excision algorithm using simulation studies. Watermarking is the process of embedding imperceptible data into the host signal for marking the copyright ownership. The embedded data should be extractable to prove ownership. Watermarking systems face problems similar to those in spread spectrum communication systems, namely, intentional attacks by the adversaries. In watermarking, the adversaries try to obliterate the embedded watermark in order to prevent its detection by authorized parties. In this thesis, we develop a spread spectrum audio watermarking scheme, where we embed perceptually shaped linear chirps as watermark messages. The directional elements of the chirp signals represent different watermark messages. We extract the watermark by first detecting the transmitted message symbols in the spread spectrum signal. We then use the directional element estimation algorithm based on the time-frequency analysis as a post-processing tool to minimize the effects of hostile attacks on the extractability of the embedded watermark. We demonstrate the robustness of the algorithm by extracting the watermark correctly after common signal processing operations representing hostile attacks by adversaries.


Author(s):  
Annachiara Strazza ◽  
Federica Verdini ◽  
Laura Burattini ◽  
Sandro Fioretti ◽  
Francesco Di Nardo

2014 ◽  
Vol 543-547 ◽  
pp. 2551-2554
Author(s):  
Gang Fu ◽  
Dong Xu Zhu ◽  
Yue Feng

In recent decades, Hybrid Spread Spectrum (DS/FH) signals has been rapidly developed and extensively used in the field of both military communications and aerospace monitoring. Through the study of hybrid spread spectrum signal, firstly use time-frequency analysis to get the time-frequency diagram of hybrid spread spectrum signal, and then wipe off the DS pseudo-code, lastly use the high-resolution frequency to estimate the parameters of hopping frequency signal, in order to provide a kind of ideas and reference to test hopping frequency parameter of hybrid spread spectrum signal.


2017 ◽  
Vol 57 ◽  
pp. 9-10 ◽  
Author(s):  
A. Strazza ◽  
F. Verdini ◽  
L. Burattini ◽  
S. Fioretti ◽  
F. Di Nardo

1997 ◽  
Vol 117 (3) ◽  
pp. 338-345 ◽  
Author(s):  
Masatake Kawada ◽  
Masakazu Wada ◽  
Zen-Ichiro Kawasaki ◽  
Kenji Matsu-ura ◽  
Makoto Kawasaki

Sign in / Sign up

Export Citation Format

Share Document