Deep Learning in Data-Driven Pavement Image Analysis and Automated Distress Detection: A Review

Deep learning, more specifically deep convolutional neural networks, is fast becoming a popular choice for computer vision-based automated pavement distress detection. While pavement image analysis has been extensively researched over the past three decades or so, recent ground-breaking achievements of deep learning algorithms in the areas of machine translation, speech recognition, and computer vision has sparked interest in the application of deep learning to automated detection of distresses in pavement images. This paper provides a narrative review of recently published studies in this field, highlighting the current achievements and challenges. A comparison of the deep learning software frameworks, network architecture, hyper-parameters employed by each study, and crack detection performance is provided, which is expected to provide a good foundation for driving further research on this important topic in the context of smart pavement or asset management systems. The review concludes with potential avenues for future research; especially in the application of deep learning to not only detect, but also characterize the type, extent, and severity of distresses from 2D and 3D pavement images.

Download Full-text

Maize-IAS: a maize image analysis software using deep learning for high-throughput plant phenotyping

Plant Methods ◽

10.1186/s13007-021-00747-0 ◽

2021 ◽

Vol 17 (1) ◽

Author(s):

Shuo Zhou ◽

Xiujuan Chai ◽

Zixuan Yang ◽

Hongwu Wang ◽

Chenxue Yang ◽

...

Keyword(s):

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

High Throughput ◽

Batch Processing ◽

Plant Phenotyping ◽

Plant Science ◽

Analysis Software ◽

Image Analysis Software ◽

Maize Growth

Abstract Background Maize (Zea mays L.) is one of the most important food sources in the world and has been one of the main targets of plant genetics and phenotypic research for centuries. Observation and analysis of various morphological phenotypic traits during maize growth are essential for genetic and breeding study. The generally huge number of samples produce an enormous amount of high-resolution image data. While high throughput plant phenotyping platforms are increasingly used in maize breeding trials, there is a reasonable need for software tools that can automatically identify visual phenotypic features of maize plants and implement batch processing on image datasets. Results On the boundary between computer vision and plant science, we utilize advanced deep learning methods based on convolutional neural networks to empower the workflow of maize phenotyping analysis. This paper presents Maize-IAS (Maize Image Analysis Software), an integrated application supporting one-click analysis of maize phenotype, embedding multiple functions: (I) Projection, (II) Color Analysis, (III) Internode length, (IV) Height, (V) Stem Diameter and (VI) Leaves Counting. Taking the RGB image of maize as input, the software provides a user-friendly graphical interaction interface and rapid calculation of multiple important phenotypic characteristics, including leaf sheath points detection and leaves segmentation. In function Leaves Counting, the mean and standard deviation of difference between prediction and ground truth are 1.60 and 1.625. Conclusion The Maize-IAS is easy-to-use and demands neither professional knowledge of computer vision nor deep learning. All functions for batch processing are incorporated, enabling automated and labor-reduced tasks of recording, measurement and quantitative analysis of maize growth traits on a large dataset. We prove the efficiency and potential capability of our techniques and software to image-based plant research, which also demonstrates the feasibility and capability of AI technology implemented in agriculture and plant science.

Download Full-text

Aerial Imagery Paddy Seedlings Inspection Using Deep Learning

Remote Sensing ◽

10.3390/rs14020274 ◽

2022 ◽

Vol 14 (2) ◽

pp. 274

Author(s):

Mohamed Marzhar Anuar ◽

Alfian Abdul Halin ◽

Thinagaran Perumal ◽

Bahareh Kalantar

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Production Costs ◽

Fine Tuning ◽

Aerial Imagery ◽

Planting Density ◽

Deep Convolutional Neural Networks ◽

Security Issues ◽

Agriculture Industry ◽

Paddy Cultivation

In recent years complex food security issues caused by climatic changes, limitations in human labour, and increasing production costs require a strategic approach in addressing problems. The emergence of artificial intelligence due to the capability of recent advances in computing architectures could become a new alternative to existing solutions. Deep learning algorithms in computer vision for image classification and object detection can facilitate the agriculture industry, especially in paddy cultivation, to alleviate human efforts in laborious, burdensome, and repetitive tasks. Optimal planting density is a crucial factor for paddy cultivation as it will influence the quality and quantity of production. There have been several studies involving planting density using computer vision and remote sensing approaches. While most of the studies have shown promising results, they have disadvantages and show room for improvement. One of the disadvantages is that the studies aim to detect and count all the paddy seedlings to determine planting density. The defective paddy seedlings’ locations are not pointed out to help farmers during the sowing process. In this work we aimed to explore several deep convolutional neural networks (DCNN) models to determine which one performs the best for defective paddy seedling detection using aerial imagery. Thus, we evaluated the accuracy, robustness, and inference latency of one- and two-stage pretrained object detectors combined with state-of-the-art feature extractors such as EfficientNet, ResNet50, and MobilenetV2 as a backbone. We also investigated the effect of transfer learning with fine-tuning on the performance of the aforementioned pretrained models. Experimental results showed that our proposed methods were capable of detecting the defective paddy rice seedlings with the highest precision and an F1-Score of 0.83 and 0.77, respectively, using a one-stage pretrained object detector called EfficientDet-D1 EficientNet.

Download Full-text

Geographic Object-Based Image Analysis: A Primer and Future Directions

Remote Sensing ◽

10.3390/rs12122012 ◽

2020 ◽

Vol 12 (12) ◽

pp. 2012 ◽

Cited By ~ 3

Author(s):

Maja Kucharczyk ◽

Geoffrey J. Hay ◽

Salar Ghaffarian ◽

Chris H. Hugenholtz

Keyword(s):

Image Analysis ◽

Current Status ◽

Future Research ◽

Deep Convolutional Neural Networks ◽

Comprehensive Overview ◽

Image Objects ◽

Object Based Image Analysis ◽

Object Based ◽

Geographic Object ◽

Methodological Considerations

Geographic object-based image analysis (GEOBIA) is a remote sensing image analysis paradigm that defines and examines image-objects: groups of neighboring pixels that represent real-world geographic objects. Recent reviews have examined methodological considerations and highlighted how GEOBIA improves upon the 30+ year pixel-based approach, particularly for H-resolution imagery. However, the literature also exposes an opportunity to improve guidance on the application of GEOBIA for novice practitioners. In this paper, we describe the theoretical foundations of GEOBIA and provide a comprehensive overview of the methodological workflow, including: (i) software-specific approaches (open-source and commercial); (ii) best practices informed by research; and (iii) the current status of methodological research. Building on this foundation, we then review recent research on the convergence of GEOBIA with deep convolutional neural networks, which we suggest is a new form of GEOBIA. Specifically, we discuss general integrative approaches and offer recommendations for future research. Overall, this paper describes the past, present, and anticipated future of GEOBIA in a novice-accessible format, while providing innovation and depth to experienced practitioners.

Download Full-text

Deep Learning Based Switching Filter for Impulsive Noise Removal in Color Images

Sensors ◽

10.3390/s20102782 ◽

2020 ◽

Vol 20 (10) ◽

pp. 2782

Author(s):

Krystian Radlak ◽

Lukasz Malinski ◽

Bogdan Smolka

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Network Architecture ◽

Noise Suppression ◽

Impulsive Noise ◽

Noise Removal ◽

Vision Systems ◽

Mean Filter ◽

Computer Vision Systems ◽

Active Research

Noise reduction is one of the most important and still active research topics in low-level image processing due to its high impact on object detection and scene understanding for computer vision systems. Recently, we observed a substantially increased interest in the application of deep learning algorithms. Many computer vision systems use them, due to their impressive capability of feature extraction and classification. While these methods have also been successfully applied in image denoising, significantly improving its performance, most of the proposed approaches were designed for Gaussian noise suppression. In this paper, we present a switching filtering technique intended for impulsive noise removal using deep learning. In the proposed method, the distorted pixels are detected using a deep neural network architecture and restored with the fast adaptive mean filter. The performed experiments show that the proposed approach is superior to the state-of-the-art filters designed for impulsive noise removal in color digital images.

Download Full-text

Deep Convolutional Neural Networks with transfer learning for computer vision-based data-driven pavement distress detection

Construction and Building Materials ◽

10.1016/j.conbuildmat.2017.09.110 ◽

2017 ◽

Vol 157 ◽

pp. 322-330 ◽

Cited By ~ 204

Author(s):

Kasthurirangan Gopalakrishnan ◽

Siddhartha K. Khaitan ◽

Alok Choudhary ◽

Ankit Agrawal

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Data Driven ◽

Deep Convolutional Neural Networks ◽

Pavement Distress ◽

Pavement Distress Detection

Download Full-text

Salient Object Detection Techniques in Computer Vision—A Survey

Entropy ◽

10.3390/e22101174 ◽

2020 ◽

Vol 22 (10) ◽

pp. 1174

Author(s):

Ashish Kumar Gupta ◽

Ayan Seal ◽

Mukesh Prasad ◽

Pritee Khanna

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Object Detection ◽

Large Scale ◽

Research Work ◽

Salient Object Detection ◽

Future Research ◽

Automatic Identification ◽

Salient Object ◽

Detection Techniques

Detection and localization of regions of images that attract immediate human visual attention is currently an intensive area of research in computer vision. The capability of automatic identification and segmentation of such salient image regions has immediate consequences for applications in the field of computer vision, computer graphics, and multimedia. A large number of salient object detection (SOD) methods have been devised to effectively mimic the capability of the human visual system to detect the salient regions in images. These methods can be broadly categorized into two categories based on their feature engineering mechanism: conventional or deep learning-based. In this survey, most of the influential advances in image-based SOD from both conventional as well as deep learning-based categories have been reviewed in detail. Relevant saliency modeling trends with key issues, core techniques, and the scope for future research work have been discussed in the context of difficulties often faced in salient object detection. Results are presented for various challenging cases for some large-scale public datasets. Different metrics considered for assessment of the performance of state-of-the-art salient object detection models are also covered. Some future directions for SOD are presented towards end.

Download Full-text

Deep Learning-based Image Analysis for High-content Screening

10.26686/wgtn.16892482 ◽

2021 ◽

Author(s):

◽

Dylon Zeng

Keyword(s):

Image Analysis ◽

Deep Learning ◽

Network Architecture ◽

Estimating Equation ◽

Time Frame ◽

High Content Screening ◽

Similarity Analysis ◽

Analysis Pipeline ◽

Data Set ◽

A Cell

High-content screening is an empirical strategy in drug discovery toidentify substances capable of altering cellular phenotype — the set ofobservable characteristics of a cell — in a desired way. Throughout thepast two decades, high-content screening has gathered significant attentionfrom academia and the pharmaceutical industry. However, imageanalysis remains a considerable hindrance to the widespread applicationof high-content screening. Standard image analysis relies on feature engineeringand suffers from inherent drawbacks such as the dependence onannotated inputs. There is an urging need for reliable and more efficientmethods to cope with increasingly large amounts of data produced. This thesis centres around the design and implementation of a deeplearning-based image analysis pipeline for high-content screening. Theend goal is to identify and cluster hit compounds that significantly alterthe phenotype of a cell. The proposed pipeline replaces feature engineeringwith a k-nearest neighbour-based similarity analysis. In addition, featureextraction using convolutional autoencoders is applied to reduce thenegative effects of noise on hit selection. As a result, the feature engineeringprocess is circumvented. A novel similarity measure is developed tofacilitate similarity analysis. Moreover, we combine deep learning withstatistical modelling to achieve optimal results. Preliminary explorationssuggest that the choice of hyperparameters have a direct impact on neuralnetwork performance. Generalised estimating equation models are usedto predict the most suitable neural network architecture for the input data. Using the proposed pipeline, we analyse an extensive set of images acquiredfrom a series of cell-based assays examining the effect of 282 FDAapproved drugs. The analysis of this data set produces a shortlist of drugsthat can significantly alter a cell’s phenotype, then further identifies fiveclusters of the shortlisted drugs. The clustering results present groups ofexisting drugs that have the potential to be repurposed for new therapeuticuses. Furthermore, our findings align with published studies. Comparedwith other neural networks, the image analysis pipeline proposedin this thesis provides reliable and better results in a shorter time frame.

Download Full-text

Artificial Neural Network-Based Automated Crack Detection and Analysis for the Inspection of Concrete Structures

Applied Sciences ◽

10.3390/app10228105 ◽

2020 ◽

Vol 10 (22) ◽

pp. 8105

Author(s):

Jung Jin Kim ◽

Ah-Ram Kim ◽

Seong-Won Lee

Keyword(s):

Neural Network ◽

Image Analysis ◽

Deep Learning ◽

Crack Detection ◽

Feature Learning ◽

Small Scale ◽

Image Analysis Technique ◽

Analysis Technique ◽

Third Stage ◽

Deep Learning Network

The damage investigation and inspection methods for infrastructures performed in small-scale (type III) facilities usually involve a visual examination by an inspector using surveying tools (e.g., cracking, crack microscope, etc.) in the field. These methods can interfere with the subjectivity of the inspector, which may reduce the objectivity and reliability of the record. Therefore, a new image analysis technique is needed to automatically detect cracks and analyze the characteristics of the cracks objectively. In this study, an image analysis technique using deep learning is developed to detect cracks and analyze characteristics (e.g., length, and width) in images for small-scale facilities. Three stages of image processing pipeline are proposed to obtain crack detection and its characteristics. In the first and second stages, two-dimensional convolutional neural networks are used for crack image detection (e.g., classification and segmentation). Based on convolution neural network for the detection, hierarchical feature learning architecture is applied into our deep learning network. After deep learning-based detection, in the third stage, thinning and tracking algorithms are applied to analyze length and width of crack in the image. The performance of the proposed method was tested using various crack images with label and the results showed good performance of crack detection and its measurement.

Download Full-text

Learning Cartographic Building Generalization with Deep Convolutional Neural Networks

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi8060258 ◽

2019 ◽

Vol 8 (6) ◽

pp. 258 ◽

Cited By ~ 13

Author(s):

Yu Feng ◽

Frank Thiemann ◽

Monika Sester

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Physical Reality ◽

Learning Approaches ◽

Deep Convolutional Neural Networks ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Cartographic Generalization

Cartographic generalization is a problem, which poses interesting challenges to automation. Whereas plenty of algorithms have been developed for the different sub-problems of generalization (e.g., simplification, displacement, aggregation), there are still cases, which are not generalized adequately or in a satisfactory way. The main problem is the interplay between different operators. In those cases the human operator is the benchmark, who is able to design an aesthetic and correct representation of the physical reality. Deep learning methods have shown tremendous success for interpretation problems for which algorithmic methods have deficits. A prominent example is the classification and interpretation of images, where deep learning approaches outperform traditional computer vision methods. In both domains-computer vision and cartography-humans are able to produce good solutions. A prerequisite for the application of deep learning is the availability of many representative training examples for the situation to be learned. As this is given in cartography (there are many existing map series), the idea in this paper is to employ deep convolutional neural networks (DCNNs) for cartographic generalizations tasks, especially for the task of building generalization. Three network architectures, namely U-net, residual U-net and generative adversarial network (GAN), are evaluated both quantitatively and qualitatively in this paper. They are compared based on their performance on this task at target map scales 1:10,000, 1:15,000 and 1:25,000, respectively. The results indicate that deep learning models can successfully learn cartographic generalization operations in one single model in an implicit way. The residual U-net outperforms the others and achieved the best generalization performance.

Download Full-text

Computer Vision and Deep Learning for Real-Time Pavement Distress Detection

Advances in Informatics and Computing in Civil and Construction Engineering ◽

10.1007/978-3-030-00220-6_72 ◽

2018 ◽

pp. 601-607 ◽

Cited By ~ 2

Author(s):

Kristina Doycheva ◽

Christian Koch ◽

Markus König

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Real Time ◽

Pavement Distress ◽

Pavement Distress Detection

Download Full-text