scholarly journals DoPAMINE: Double-Sided Masked CNN for Pixel Adaptive Multiplicative Noise Despeckling

Author(s):  
Sunghwan Joo ◽  
Sungmin Cha ◽  
Taesup Moon

We propose DoPAMINE, a new neural network based multiplicative noise despeckling algorithm. Our algorithm is inspired by Neural AIDE (N-AIDE), which is a recently proposed neural adaptive image denoiser. While the original NAIDE was designed for the additive noise case, we show that the same framework, i.e., adaptively learning a network for pixel-wise affine denoisers by minimizing an unbiased estimate of MSE, can be applied to the multiplicative noise case as well. Moreover, we derive a double-sided masked CNN architecture which can control the variance of the activation values in each layer and converge fast to high denoising performance during supervised training. In the experimental results, we show our DoPAMINE possesses high adaptivity via fine-tuning the network parameters based on the given noisy image and achieves significantly better despeckling results compared to SAR-DRN, a state-of-the-art CNN-based algorithm.

2021 ◽  
Vol 2083 (4) ◽  
pp. 042017
Author(s):  
Yingdong Ru

Abstract Music symbol recognition is an important part of Optical Music Recognition (OMR), Chord recognition is one of the most important research contents in the field of music information retrieval. It plays an important role in information processing, music structure analysis, and recommendation systems. Aiming at the problem of low chord recognition accuracy in the OMR recognition model, the article proposes a chord recognition method based on the YOLOV4 neural network model. First, the YOLOV4 network model is used to train single-voice scores to obtain the best training model. Then, the scores containing chords are trained through neural network fine-tuning technology. The experimental results show that the method recognizes the chords with great results, the model was tested on the test set generated by MuseScore. The experimental results show that the accuracy of note recognition is high, which can reach the accuracy of duration value of 0.96 which is higher than the accuracy of note recognition of other score recognition models.


2021 ◽  
Author(s):  
Noor Ahmad ◽  
Muhammad Aminu ◽  
Mohd Halim Mohd Noor

Deep learning approaches have attracted a lot of attention in the automatic detection of Covid-19 and transfer learning is the most common approach. However, majority of the pre-trained models are trained on color images, which can cause inefficiencies when fine-tuning the models on Covid-19 images which are often grayscale. To address this issue, we propose a deep learning architecture called CovidNet which requires a relatively smaller number of parameters. CovidNet accepts grayscale images as inputs and is suitable for training with limited training dataset. Experimental results show that CovidNet outperforms other state-of-the-art deep learning models for Covid-19 detection.


2011 ◽  
Vol 317-319 ◽  
pp. 897-900
Author(s):  
Zhen Hua Zhao ◽  
Xiao Hong Hao

A novel illumination compensate method is proposed in this paper to improve recognition performance. A modified lighting model called Lambertin which includes additive noise and multiplicative noise are presented firstly. Then, additive noise is removed by using wavelet packet transformation. Next, the processed image is transformed into logarithm domain and the multiplicative noise, which has been named additive noise, is removed by means of the same above algorithm. Finally, a compensated face image is obtained. We examine the proposed method on Yale extended B database compared with other methods. Experimental results show that our algorithm improves by 3%~12% recognition rate. It can effectively adjust the facial images for varying illumination conditions and also improve the recognition performance and robustness.


2021 ◽  
pp. 1-10
Author(s):  
Gayatri Pattnaik ◽  
Vimal K. Shrivastava ◽  
K. Parvathi

Pests are major threat to economic growth of a country. Application of pesticide is the easiest way to control the pest infection. However, excessive utilization of pesticide is hazardous to environment. The recent advances in deep learning have paved the way for early detection and improved classification of pest in tomato plants which will benefit the farmers. This paper presents a comprehensive analysis of 11 state-of-the-art deep convolutional neural network (CNN) models with three configurations: transfers learning, fine-tuning and scratch learning. The training in transfer learning and fine tuning initiates from pre-trained weights whereas random weights are used in case of scratch learning. In addition, the concept of data augmentation has been explored to improve the performance. Our dataset consists of 859 tomato pest images from 10 categories. The results demonstrate that the highest classification accuracy of 94.87% has been achieved in the transfer learning approach by DenseNet201 model with data augmentation.


2021 ◽  
Author(s):  
Noor Ahmad ◽  
Muhammad Aminu ◽  
Mohd Halim Mohd Noor

Deep learning approaches have attracted a lot of attention in the automatic detection of Covid-19 and transfer learning is the most common approach. However, majority of the pre-trained models are trained on color images, which can cause inefficiencies when fine-tuning the models on Covid-19 images which are often grayscale. To address this issue, we propose a deep learning architecture called CovidNet which requires a relatively smaller number of parameters. CovidNet accepts grayscale images as inputs and is suitable for training with limited training dataset. Experimental results show that CovidNet outperforms other state-of-the-art deep learning models for Covid-19 detection.


2020 ◽  
Author(s):  
Somdip Dey ◽  
Amit Singh ◽  
Dilip Kumar Prasad ◽  
Klaus D. Mcdonald-Maier

<div><div><div><p>This paper proposes a novel human-inspired methodology called IRON-MAN (Integrated RatiONal prediction and Motionless ANalysis of videos) on mobile multi-processor systems-on-chips (MPSoCs). The methodology integrates analysis of the previous image frames of the video to represent the analysis of the current frame in order to perform Temporal Motionless Analysis of the Video (TMAV). This is the first work on TMAV using Convolutional Neural Network (CNN) for scene prediction in MPSoCs. Experimental results show that our methodology outperforms state-of-the-art. We also introduce a metric named, Energy Consumption per Training Image (ECTI) to assess the suitability of using a CNN model in mobile MPSoCs with a focus on energy consumption of the device.</p></div></div></div>


2019 ◽  
Vol 11 (20) ◽  
pp. 2379 ◽  
Author(s):  
Ting Pan ◽  
Dong Peng ◽  
Wen Yang ◽  
Heng-Chao Li

Despeckling is a longstanding topic in synthetic aperture radar (SAR) images. Recently, many convolutional neural network (CNN) based methods have been proposed and shown state-of-the-art performance for SAR despeckling problem. However, these CNN based methods always need many training data or can only deal with specific noise level. To solve these problems, we directly embed an efficient CNN pre-trained model for additive white Gaussian noise (AWGN) with Multi-channel Logarithm with Gaussian denoising (MuLoG) algorithm to deal with the multiplicative noise in SAR images. This flexible pre-trained CNN model takes the noise level as input, thus only a single pre-trained model is needed to deal with different noise levels. We also use a detector to find the homogeneous region automatically to estimate the noise level of image as input. Embedded with MuLoG, our proposed filter can despeckle not only single channel but also multi-channel SAR images. Finally, both simulated and real (Pol)SAR images were tested in experiments, and the results show that the proposed method has better and more robust performance than others.


2018 ◽  
Vol 226 ◽  
pp. 04048
Author(s):  
Nikolay V. Gapon ◽  
Evgenii A. Semenishchev ◽  
Oxana S. Balabaeva ◽  
Arina A. Skorikova ◽  
Olga A. Tokareva ◽  
...  

This article examines the method of image reconstruction, which aims to restore the exposed areas on MRI images. The algorithm is based on a geometric model for patch synthesis. The lost pixels are recovered by copying pixel values from the source using a similarity criterion. We used a trained neural network to choose the “best similar” patch. Experimental results show that the proposed method outperforms widely used state-of-the-art methods.


Sensors ◽  
2020 ◽  
Vol 20 (11) ◽  
pp. 3204
Author(s):  
S. M. Nadim Uddin ◽  
Yong Ju Jung

Deep-learning-based image inpainting methods have shown significant promise in both rectangular and irregular holes. However, the inpainting of irregular holes presents numerous challenges owing to uncertainties in their shapes and locations. When depending solely on convolutional neural network (CNN) or adversarial supervision, plausible inpainting results cannot be guaranteed because irregular holes need attention-based guidance for retrieving information for content generation. In this paper, we propose two new attention mechanisms, namely a mask pruning-based global attention module and a global and local attention module to obtain global dependency information and the local similarity information among the features for refined results. The proposed method is evaluated using state-of-the-art methods, and the experimental results show that our method outperforms the existing methods in both quantitative and qualitative measures.


Electronics ◽  
2021 ◽  
Vol 10 (13) ◽  
pp. 1518
Author(s):  
António S. Pinto ◽  
Sebastian Böck ◽  
Jaime S. Cardoso ◽  
Matthew E. P. Davies

The extraction of the beat from musical audio signals represents a foundational task in the field of music information retrieval. While great advances in performance have been achieved due the use of deep neural networks, significant shortcomings still remain. In particular, performance is generally much lower on musical content that differs from that which is contained in existing annotated datasets used for neural network training, as well as in the presence of challenging musical conditions such as rubato. In this paper, we positioned our approach to beat tracking from a real-world perspective where an end-user targets very high accuracy on specific music pieces and for which the current state of the art is not effective. To this end, we explored the use of targeted fine-tuning of a state-of-the-art deep neural network based on a very limited temporal region of annotated beat locations. We demonstrated the success of our approach via improved performance across existing annotated datasets and a new annotation-correction approach for evaluation. Furthermore, we highlighted the ability of content-specific fine-tuning to learn both what is and what is not the beat in challenging musical conditions.


Sign in / Sign up

Export Citation Format

Share Document