Dictionary learning and shift-invariant sparse coding denoising for CSEM data combined with CEEMD
Controlled-source electromagnetic (CSEM) data recorded in industrialized areas are inevitably contaminated by strong cultural noise. Traditional noise attenuation methods are often ineffective for intricate aperiodic noise. To address the problem mentioned above, we propose a novel noise isolation method based on fast Fourier transform (FFT), complementary ensemble empirical mode decomposition (CEEMD) and shift-invariant sparse coding (SISC, an unsupervised machine learning algorithm under a data-driven framework). First, large powerline noise is accurately subtracted in the frequency domain. Then, the CEEMD based algorithm is used to correct the large baseline drift. Finally, taking advantage of the sparsity of periodic signals, SISC is applied to autonomously learn a feature atom (the useful signal with a length of one period) from the detrended signal and recover CSEM signal with high accuracy. We demonstrate the performance of the SISC by comparing with other three promising signal processing methods, including the mathematic morphology filtering (MMF), soft-threshold wavelet filtering, and K-SVD (another dictionary learning method) sparse decomposition. Experimental results illustrate that SISC provides the best performance. Robustness test results show that SISC can increase the signal-to-noise ratio (SNR) of noisy signal from 0 dB to more than 15 dB. Case studies of synthetic and real data collected in the Chinese Provinces Sichuan and Yunnan show that the proposed method is capable of effectively recovering the useful signal from the observed data contaminated with different kinds of strong ambient noise. The curves of U/I and apparent resistivity after applying the proposed method improved greatly. Moreover, the proposed method performs better than the robust estimation method based on correlation analysis.