Identifying natural images and computer-generated graphics based on convolutional neural network

Remote-sensing image scene classification can provide significant value, ranging from forest fire monitoring to land-use and land-cover classification. Beginning with the first aerial photographs of the early 20th century to the satellite imagery of today, the amount of remote-sensing data has increased geometrically with a higher resolution. The need to analyze these modern digital data motivated research to accelerate remote-sensing image classification. Fortunately, great advances have been made by the computer vision community to classify natural images or photographs taken with an ordinary camera. Natural image datasets can range up to millions of samples and are, therefore, amenable to deep-learning techniques. Many fields of science, remote sensing included, were able to exploit the success of natural image classification by convolutional neural network models using a technique commonly called transfer learning. We provide a systematic review of transfer learning application for scene classification using different datasets and different deep-learning models. We evaluate how the specialization of convolutional neural network models affects the transfer learning process by splitting original models in different points. As expected, we find the choice of hyperparameters used to train the model has a significant influence on the final performance of the models. Curiously, we find transfer learning from models trained on larger, more generic natural images datasets outperformed transfer learning from models trained directly on smaller remotely sensed datasets. Nonetheless, results show that transfer learning provides a powerful tool for remote-sensing scene classification.

Download Full-text

Correcting aspect ratio distortion of natural images by convolutional neural network

2017 14th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) ◽

10.1109/urai.2017.7992894 ◽

2017 ◽

Author(s):

Ryuhei Sakurai ◽

Sasuke Yamane ◽

Joo-Ho Lee

Keyword(s):

Neural Network ◽

Aspect Ratio ◽

Convolutional Neural Network ◽

Natural Images ◽

Ratio Distortion

Download Full-text

Deepening into the suitability of using pre-trained models of ImageNet against a lightweight convolutional neural network in medical imaging: an experimental study

PeerJ Computer Science ◽

10.7717/peerj-cs.715 ◽

2021 ◽

Vol 7 ◽

pp. e715

Author(s):

Laith Alzubaidi ◽

Ye Duan ◽

Ayad Al-Dujaili ◽

Ibraheem Kasim Ibraheem ◽

Ahmed H. Alkenani ◽

...

Keyword(s):

Neural Network ◽

Experimental Study ◽

Medical Imaging ◽

Convolutional Neural Network ◽

Natural Images ◽

Training Data ◽

Learning Models ◽

Computational Tools ◽

Effective Performance ◽

Learned Features

Transfer learning (TL) has been widely utilized to address the lack of training data for deep learning models. Specifically, one of the most popular uses of TL has been for the pre-trained models of the ImageNet dataset. Nevertheless, although these pre-trained models have shown an effective performance in several domains of application, those models may not offer significant benefits in all instances when dealing with medical imaging scenarios. Such models were designed to classify a thousand classes of natural images. There are fundamental differences between these models and those dealing with medical imaging tasks regarding learned features. Most medical imaging applications range from two to ten different classes, where we suspect that it would not be necessary to employ deeper learning models. This paper investigates such a hypothesis and develops an experimental study to examine the corresponding conclusions about this issue. The lightweight convolutional neural network (CNN) model and the pre-trained models have been evaluated using three different medical imaging datasets. We have trained the lightweight CNN model and the pre-trained models with two scenarios which are with a small number of images once and a large number of images once again. Surprisingly, it has been found that the lightweight model trained from scratch achieved a more competitive performance when compared to the pre-trained model. More importantly, the lightweight CNN model can be successfully trained and tested using basic computational tools and provide high-quality results, specifically when using medical imaging datasets.

Download Full-text

Restoring Aspect Ratio Distortion of Natural Images With Convolutional Neural Network

IEEE Transactions on Industrial Informatics ◽

10.1109/tii.2018.2803041 ◽

2019 ◽

Vol 15 (1) ◽

pp. 563-571 ◽

Cited By ~ 5

Author(s):

Ryuhei Sakurai ◽

Sasuke Yamane ◽

Joo-Ho Lee

Keyword(s):

Neural Network ◽

Aspect Ratio ◽

Convolutional Neural Network ◽

Natural Images ◽

Ratio Distortion

Download Full-text

A perceptual quantization strategy for HEVC based on a convolutional neural network trained on natural images

10.1117/12.2188913 ◽

2015 ◽

Author(s):

Md Mushfiqul Alam ◽

Tuan D. Nguyen ◽

Martin T. Hagan ◽

Damon M. Chandler

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Natural Images

Download Full-text

An Algorithm for Natural Images Text Recognition Using Four Direction Features

Electronics ◽

10.3390/electronics8090971 ◽

2019 ◽

Vol 8 (9) ◽

pp. 971 ◽

Cited By ~ 2

Author(s):

Min Zhang ◽

Yujin Yan ◽

Hai Wang ◽

Wei Zhao

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Natural Images ◽

Text Recognition ◽

Textual Information ◽

Word Level ◽

End To End ◽

Regular Text ◽

Character Sequence ◽

Feature Code

Irregular text has widespread applications in multiple areas. Different from regular text, irregular text is difficult to recognize because of its various shapes and distorted patterns. In this paper, we develop a multidirectional convolutional neural network (MCN) to extract four direction features to fully describe the textual information. Meanwhile, the character placement possibility is extracted as the weight of the four direction features. Based on these works, we propose the encoder to fuse the four direction features for the generation of feature code to predict the character sequence. The whole network is end-to-end trainable due to using images and word-level labels. The experiments on standard benchmarks, including the IIIT-5K, SVT, CUTE80, and ICDAR datasets, demonstrate the superiority of the proposed method on both regular and irregular datasets. The developed method shows an increase of 1.2% in the CUTE80 dataset and 1.5% in the SVT dataset, and has fewer parameters than most existing methods.

Download Full-text