A survey of deep learning methods and software tools for image classification and object detection

Semantic image classification as practised in Earth Observation is poorly suited to mapping fluvial landforms which are often composed of multiple landcover types such as water, riparian vegetation and exposed sediment. Deep learning methods developed in the field of computer vision for the purpose of image classification (ie the attribution of a single label to an image such as cat/dog/etc) are in fact more suited to such landform mapping tasks. Notably, Convolutional Neural Networks (CNN) have excelled at the task of labelling images. However, CNN are notorious for requiring very large training sets that are laborious and costly to assemble. Similarity learning is a sub-field of deep learning and is better known for one-shot and few-shot learning methods. These approaches aim to reduce the need for large training sets by using CNN architectures to compare a single, or few, known examples of an instance to a new image and determining if the new image is similar to the provided examples. Similarity learning rests on the concept of image embeddings which are condensed higher-dimension vector representations of an image generated by a CNN. Ideally, and if a CNN is suitably trained, image embeddings will form clusters according to image classes, even if some of these classes were never used in the initial CNN training.&#160;In this paper, we use similarity learning for the purpose of fluvial landform mapping from Sentinel-2 imagery. We use the True Color Image product with a spatial resolution of 10 meters and begin by manually extracting tiles of 128x128 pixels for 4 classes: non-river, meandering reaches, anastomosing reaches and braiding reaches. We use the DenseNet121 CNN topped with a densely connected layer of 8 nodes which will produce embeddings as 8-dimension vectors. We then train this network with only 3 classes (non-river, meandering and anastomosing) using a categorical cross-entropy loss function. Our first result is that when applied to our image tiles, the embeddings produced by the trained CNN deliver 4 clusters. Despite not being used in the network training, the braiding river reach tiles have produced embeddings that form a distinct cluster. We then use this CNN to perform few-shot learning with a Siamese triplet architecture that will classify a new tile based on only 3 examples of each class. Here we find that tiles from the non-river, meandering and anastomising class were classified with F1 scores of 72%, 87% and 84%, respectively. The braiding river tiles were classified to an F1 score of 80%. Whilst these performances are lesser than the 90%+ performances expected from conventional CNN, the prediction of a new class of objects (braiding reaches) with only 3 samples to 80% F1 is unprecedented in river remote sensing. We will conclude the paper by extending the method to mapping fluvial landforms on entire Sentinel-2 tiles and we will show how we can use advanced cluster analyses of image embeddings to identify landform classes in an image without making a priori decisions on the classes that are present in the image.

Download Full-text

Autonomous Martian rock image classification based on transfer deep learning methods

Earth Science Informatics ◽

10.1007/s12145-019-00433-9 ◽

2020 ◽

Vol 13 (3) ◽

pp. 951-963 ◽

Cited By ~ 2

Author(s):

Jialun Li ◽

Li Zhang ◽

Zhongchen Wu ◽

Zongcheng Ling ◽

Xueqiang Cao ◽

...

Keyword(s):

Deep Learning ◽

Image Classification ◽

Learning Methods ◽

Rock Image

Download Full-text

Video Object Detection through Traditional and Deep Learning Methods

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.d6833.049420 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1822-1826

Keyword(s):

Deep Learning ◽

Object Detection ◽

Sparse Matrix ◽

Image Understanding ◽

Low Rank ◽

Detection Methods ◽

Video Object ◽

Learning Methods ◽

Primary Focus ◽

High Level

Object detection in videos is gaining more attention recently as it is related to video analytics and facilitates image understanding and applicable to . The video object detection methods can be divided into traditional and deep learning based methods. Trajectory classification, low rank sparse matrix, background subtraction and object tracking are considered as traditional object detection methods as they primary focus is informative feature collection, region selection and classification. The deep learning methods are more popular now days as they facilitate high-level features and problem solving in object detection algorithms. We have discussed various object detection methods and challenges in this paper.

Download Full-text

Multiple Object Detection Based on Clustering and Deep Learning Methods

Sensors ◽

10.3390/s20164424 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4424

Author(s):

Huu Thu Nguyen ◽

Eon-Ho Lee ◽

Chul Hee Bae ◽

Sejin Lee

Keyword(s):

Deep Learning ◽

Object Detection ◽

Spatial Clustering ◽

Clustering Algorithms ◽

Three Dimensional ◽

Autonomous Driving ◽

Learning Methods ◽

Multiple Object ◽

Multiple Object Detection ◽

Negative Effect

Multiple object detection is challenging yet crucial in computer vision. In This study, owing to the negative effect of noise on multiple object detection, two clustering algorithms are used on both underwater sonar images and three-dimensional point cloud LiDAR data to study and improve the performance result. The outputs from using deep learning methods on both types of data are treated with K-Means clustering and density-based spatial clustering of applications with noise (DBSCAN) algorithms to remove outliers, detect and cluster meaningful data, and improve the result of multiple object detections. Results indicate the potential application of the proposed method in the fields of object detection, autonomous driving system, and so forth.

Download Full-text

Mapping Center Pivot Irrigation Systems in the Southern Amazon from Sentinel-2 Images

Water ◽

10.3390/w13030298 ◽

2021 ◽

Vol 13 (3) ◽

pp. 298

Author(s):

Jiwen Tang ◽

Damien Arvor ◽

Thomas Corpetti ◽

Ping Tang

Keyword(s):

Deep Learning ◽

Object Detection ◽

Image Classification ◽

Water Consumption ◽

Hough Transform ◽

Classification Model ◽

Irrigation Systems ◽

Mato Grosso ◽

Center Pivot ◽

Sentinel 2

Irrigation systems play an important role in agriculture. Center pivot irrigation systems are popular in many countries as they are labor-saving and water consumption efficient. Monitoring the distribution of center pivot irrigation systems can provide important information for agricultural production, water consumption and land use. Deep learning has become an effective method for image classification and object detection. In this paper, a new method to detect the precise shape of center pivot irrigation systems is proposed. The proposed method combines a lightweight real-time object detection network (PVANET) based on deep learning, an image classification model (GoogLeNet) and accurate shape detection (Hough transform) to detect and accurately delineate center pivot irrigation systems and their associated circular shape. PVANET is lightweight and fast and GoogLeNet can reduce the false detections associated with PVANET, while Hough transform can accurately detect the shape of center pivot irrigation systems. Experiments with Sentinel-2 images in Mato Grosso achieved a precision of 95% and a recall of 95.5%, which demonstrated the effectiveness of the proposed method. Finally, with the accurate shape of center pivot irrigation systems detected, the area of irrigation in the region was estimated.

Download Full-text

Hair Fescue and Sheep Sorrel Identification Using Deep Learning in Wild Blueberry Production

Remote Sensing ◽

10.3390/rs13050943 ◽

2021 ◽

Vol 13 (5) ◽

pp. 943

Author(s):

Patrick J. Hennessy ◽

Travis J. Esau ◽

Aitazaz A. Farooque ◽

Arnold W. Schumann ◽

Qamar U. Zaman ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

Image Classification ◽

Cost Savings ◽

Training Dataset ◽

Specific Information ◽

Agricultural Efficiency ◽

Dataset Size ◽

Future Work ◽

Wild Blueberry

Deep learning convolutional neural networks (CNNs) are an emerging technology that provide an opportunity to increase agricultural efficiency through remote sensing and automatic inferencing of field conditions. This paper examined the novel use of CNNs to identify two weeds, hair fescue and sheep sorrel, in images of wild blueberry fields. Commercial herbicide sprayers provide a uniform application of agrochemicals to manage patches of these weeds. Three object-detection and three image-classification CNNs were trained to identify hair fescue and sheep sorrel using images from 58 wild blueberry fields. The CNNs were trained using 1280x720 images and were tested at four different internal resolutions. The CNNs were retrained with progressively smaller training datasets ranging from 3780 to 472 images to determine the effect of dataset size on accuracy. YOLOv3-Tiny was the best object-detection CNN, detecting at least one target weed per image with F1-scores of 0.97 for hair fescue and 0.90 for sheep sorrel at 1280 × 736 resolution. Darknet Reference was the most accurate image-classification CNN, classifying images containing hair fescue and sheep sorrel with F1-scores of 0.96 and 0.95, respectively at 1280 × 736. MobileNetV2 achieved comparable results at the lowest resolution, 864 × 480, with F1-scores of 0.95 for both weeds. Training dataset size had minimal effect on accuracy for all CNNs except Darknet Reference. This technology can be used in a smart sprayer to control target specific spray applications, reducing herbicide use. Future work will involve testing the CNNs for use on a smart sprayer and the development of an application to provide growers with field-specific information. Using CNNs to improve agricultural efficiency will create major cost-savings for wild blueberry producers.

Download Full-text

NOISY IMAGE CLASSIFICATION USING HYBRID DEEP LEARNING METHODS

Journal of Information and Communication Technology ◽

10.32890/jict2018.17.2.4 ◽

2018 ◽

Vol 17 (2) ◽

Cited By ~ 1

Author(s):

Sudipta Singha Roy ◽

Mahtab Ahmed ◽

Muhammad Aminul Haque Akhand

Keyword(s):

Deep Learning ◽

Image Classification ◽

Noisy Image ◽

Learning Methods

Download Full-text