Inspecting Buildings Using Drones and Computer Vision: A Machine Learning Approach to Detect Cracks and Damages

Manual inspection of infrastructure damages such as building cracks is difficult due to the objectivity and reliability of assessment and high demands of time and costs. This can be automated using unmanned aerial vehicles (UAVs) for aerial imagery of damages. Numerous computer vision-based approaches have been applied to address the limitations of crack detection but they have their limitations that can be overcome by using various hybrid approaches based on artificial intelligence (AI) and machine learning (ML) techniques. The convolutional neural networks (CNNs), an application of the deep learning (DL) method, display remarkable potential for automatically detecting image features such as damages and are less sensitive to image noise. A modified deep hierarchical CNN architecture has been used in this study for crack detection and damage assessment in civil infrastructures. The proposed architecture is based on 16 convolution layers and a cycle generative adversarial network (CycleGAN). For this study, the crack images were collected using UAVs and open-source images of mid to high rise buildings (five stories and above) constructed during 2000 in Sydney, Australia. Conventionally, a CNN network only utilizes the last layer of convolution. However, our proposed network is based on the utility of multiple layers. Another important component of the proposed CNN architecture is the application of guided filtering (GF) and conditional random fields (CRFs) to refine the predicted outputs to get reliable results. Benchmarking data (600 images) of Sydney-based buildings damages was used to test the proposed architecture. The proposed deep hierarchical CNN architecture produced superior performance when evaluated using five methods: GF method, Baseline (BN) method, Deep-Crack BN, Deep-Crack GF, and SegNet. Overall, the GF method outperformed all other methods as indicated by the global accuracy (0.990), class average accuracy (0.939), mean intersection of the union overall classes (IoU) (0.879), precision (0.838), recall (0.879), and F-score (0.8581) values. Overall, the proposed CNN architecture provides the advantages of reduced noise, highly integrated supervision of features, adequate learning, and aggregation of both multi-scale and multilevel features during the training procedure along with the refinement of the overall output predictions.

Download Full-text

Subjectively Measured Streetscape Qualities for Shanghai with Large-Scale Application of Computer Vision and Machine Learning

10.1007/978-981-16-5983-6_23 ◽

2021 ◽

pp. 242-251

Author(s):

Waishan Qiu ◽

Wenjing Li ◽

Xun Liu ◽

Xiaokai Huang

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Urban Design ◽

Design Theory ◽

Large Scale ◽

Image Features ◽

Subjective Measures ◽

Visual Elements ◽

Urban Amenities ◽

Human Perceptions

AbstractRecently, many new studies emerged to apply computer vision (CV) to street view imagery (SVI) dataset to objectively extract the view indices of various streetscape features such as trees to proxy urban scene qualities. However, human perceptions (e.g., imageability) have a subtle relationship to visual elements which cannot be fully captured using view indices. Conversely, subjective measures using survey and interview data explain more human behaviors. However, the effectiveness of integrating subjective measures with SVI dataset has been less discussed. To address this, we integrated crowdsourcing, CV, and machine learning (ML) to subjectively measure four important perceptions suggested by classical urban design theory. We first collected experts’ rating on sample SVIs regarding the four qualities which became the training labels. CV segmentation was applied to SVI samples extracting streetscape view indices as the explanatory variables. We then trained ML models and achieved high accuracy in predicting the scores. We found a strong correlation between predicted complexity score and the density of urban amenities and services Point of Interests (POI), which validates the effectiveness of subjective measures. In addition, to test the generalizability of the proposed framework as well as to inform urban renewal strategies, we compared the measured qualities in Pudong to other five renowned urban cores worldwide. Rather than predicting perceptual scores directly from generic image features using convolution neural network, our approach follows what urban design theory suggested and confirms various streetscape features affecting multi-dimensional human perceptions. Therefore, its result provides more interpretable and actionable implications for policymakers and city planners.

Download Full-text

Computer Vision Classification of Barley Flour Based on Spatial Pyramid Partition Ensemble

Sensors ◽

10.3390/s19132953 ◽

2019 ◽

Vol 19 (13) ◽

pp. 2953 ◽

Cited By ~ 6

Author(s):

Jessica Fernandes Lopes ◽

Leniza Ludwig ◽

Douglas Fernandes Barbin ◽

Maria Victória Eiras Grossmann ◽

Sylvio Barbon

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Image Features ◽

Machine Learning Algorithms ◽

Support Vector ◽

Human Consumption ◽

Barley Flour ◽

Quality Aspects ◽

Spatial Pyramid

Imaging sensors are largely employed in the food processing industry for quality control. Flour from malting barley varieties is a valuable ingredient in the food industry, but its use is restricted due to quality aspects such as color variations and the presence of husk fragments. On the other hand, naked varieties present superior quality with better visual appearance and nutritional composition for human consumption. Computer Vision Systems (CVS) can provide an automatic and precise classification of samples, but identification of grain and flour characteristics require more specialized methods. In this paper, we propose CVS combined with the Spatial Pyramid Partition ensemble (SPPe) technique to distinguish between naked and malting types of twenty-two flour varieties using image features and machine learning. SPPe leverages the analysis of patterns from different spatial regions, providing more reliable classification. Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), J48 decision tree, and Random Forest (RF) were compared for samples’ classification. Machine learning algorithms embedded in the CVS were induced based on 55 image features. The results ranged from 75.00% (k-NN) to 100.00% (J48) accuracy, showing that sample assessment by CVS with SPPe was highly accurate, representing a potential technique for automatic barley flour classification.

Download Full-text

Image Classification and Retrieval with Kernel Methods

Kernel Methods in Bioengineering, Signal and Image Processing ◽

10.4018/978-1-59904-042-4.ch014 ◽

2011 ◽

pp. 325-345

Author(s):

Francesca Odone ◽

Alessandro Verri

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Image Classification ◽

Kernel Methods ◽

Image Features ◽

Kernel Functions ◽

Current Work ◽

Classification Problems ◽

Image Representations ◽

Open Issues

In this chapter we review some kernel methods useful for image classification and retrieval applications. Starting from the problem of constructing appropriate image representations, we describe in depth and comment on the main properties of various kernel engineering approaches that have been recently proposed in the computer vision and machine learning literature for solving a number of image classification problems. We distinguish between kernel functions applied to images as a whole and kernel functions looking at image features. We conclude by presenting some current work and discussing open issues.

Download Full-text

ORGANIC (1).pdf

10.26434/chemrxiv.5309668.v1 ◽

2017 ◽

Author(s):

Benjamin Sanchez-Lengeling ◽

Carlos Outeiral ◽

Gabriel L. Guimaraes ◽

Alan Aspuru-Guzik

Keyword(s):

Machine Learning ◽

Learning Community ◽

Chemical Species ◽

Material Design ◽

Organic Photovoltaic ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Photovoltaic Material

Molecular discovery seeks to generate chemical species tailored to very specific needs. In this paper, we present ORGANIC, a framework based on Objective-Reinforced Generative Adversarial Networks (ORGAN), capable of producing a distribution over molecular space that matches with a certain set of desirable metrics. This methodology combines two successful techniques from the machine learning community: a Generative Adversarial Network (GAN), to create non-repetitive sensible molecular species, and Reinforcement Learning (RL), to bias this generative distribution towards certain attributes. We explore several applications, from optimization of random physicochemical properties to candidates for drug discovery and organic photovoltaic material design.

Download Full-text

A Study on Augmented Reality Assisted Navigation App Using Machine Learning and Computer Vision

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i2.412415 ◽

2019 ◽

Vol 7 (2) ◽

pp. 412-415

Author(s):

Waheeda Dhokley ◽

Asif Syed ◽

Nitika Tomar ◽

Riya Patil

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Augmented Reality ◽

Assisted Navigation

Download Full-text

Levee-Crack Detection from Satellite or Drone Imagery Using Machine Learning Approaches

IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss39084.2020.9323936 ◽

2020 ◽

Author(s):

Aditi Kuchi ◽

Md Tamjidul Hoque ◽

Mahdi Abdelguerfi ◽

Maik C. Flanagin

Keyword(s):

Machine Learning ◽

Crack Detection ◽

Learning Approaches

Download Full-text

Research on computer vision enhancement in intelligent robot based on machine learning and deep learning

Neural Computing and Applications ◽

10.1007/s00521-021-05898-8 ◽

2021 ◽

Author(s):

Yuhan Ding ◽

Lisha Hua ◽

Shunlei Li

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Deep Learning ◽

Intelligent Robot

Download Full-text

An approach to human iris recognition using quantitative analysis of image features and machine learning

2020 IEEE Global Humanitarian Technology Conference (GHTC) ◽

10.1109/ghtc46280.2020.9342935 ◽

2020 ◽

Author(s):

Abolfazl Zargari Khuzani ◽

Najmeh Mashhadi ◽

Morteza Heidari ◽

Donya Khaledyan

Keyword(s):

Machine Learning ◽

Quantitative Analysis ◽

Iris Recognition ◽

Image Features ◽

Human Iris

Download Full-text

A Systematic Survey of ML Datasets for Prime CV Research Areas—Media and Metadata

Data ◽

10.3390/data6020012 ◽

2021 ◽

Vol 6 (2) ◽

pp. 12

Author(s):

Helder F. Castro ◽

Jaime S. Cardoso ◽

Maria T. Andrade

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Qualitative Analysis ◽

Continuous Growth ◽

Scientific Databases ◽

Extensive Survey ◽

Research Areas ◽

Current State ◽

Global View ◽

Cooperative Production

The ever-growing capabilities of computers have enabled pursuing Computer Vision through Machine Learning (i.e., MLCV). ML tools require large amounts of information to learn from (ML datasets). These are costly to produce but have received reduced attention regarding standardization. This prevents the cooperative production and exploitation of these resources, impedes countless synergies, and hinders ML research. No global view exists of the MLCV dataset tissue. Acquiring it is fundamental to enable standardization. We provide an extensive survey of the evolution and current state of MLCV datasets (1994 to 2019) for a set of specific CV areas as well as a quantitative and qualitative analysis of the results. Data were gathered from online scientific databases (e.g., Google Scholar, CiteSeerX). We reveal the heterogeneous plethora that comprises the MLCV dataset tissue; their continuous growth in volume and complexity; the specificities of the evolution of their media and metadata components regarding a range of aspects; and that MLCV progress requires the construction of a global standardized (structuring, manipulating, and sharing) MLCV “library”. Accordingly, we formulate a novel interpretation of this dataset collective as a global tissue of synthetic cognitive visual memories and define the immediately necessary steps to advance its standardization and integration.

Download Full-text

Automotive OEM Demand Forecasting: A Comparative Study of Forecasting Algorithms and Strategies

Applied Sciences ◽

10.3390/app11156787 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6787

Author(s):

Jože M. Rožanec ◽

Blaž Kažič ◽

Maja Škrjanc ◽

Blaž Fortuna ◽

Dunja Mladenić

Keyword(s):

Machine Learning ◽

Automotive Industry ◽

Demand Management ◽

Demand Forecasting ◽

Machine Learning Algorithms ◽

Superior Performance ◽

Forecast Errors ◽

Manufacturing Companies ◽

Case Scenario ◽

Crucial Component

Demand forecasting is a crucial component of demand management, directly impacting manufacturing companies’ planning, revenues, and actors through the supply chain. We evaluate 21 baseline, statistical, and machine learning algorithms to forecast smooth and erratic demand on a real-world use case scenario. The products’ data were obtained from a European original equipment manufacturer targeting the global automotive industry market. Our research shows that global machine learning models achieve superior performance than local models. We show that forecast errors from global models can be constrained by pooling product data based on the past demand magnitude. We also propose a set of metrics and criteria for a comprehensive understanding of demand forecasting models’ performance.

Download Full-text