Computerized Ultrasonic Imaging Inspection: From Shallow to Deep Learning

For many decades, ultrasonic imaging inspection has been adopted as a principal method to detect multiple defects, e.g., void and corrosion. However, the data interpretation relies on an inspector’s subjective judgment, thus making the results vulnerable to human error. Nowadays, advanced computer vision techniques reveal new perspectives on the high-level visual understanding of universal tasks. This research aims to develop an efficient automatic ultrasonic image analysis system for nondestructive testing (NDT) using the latest visual information processing technique. To this end, we first established an ultrasonic inspection image dataset containing 6849 ultrasonic scan images with full defect/no-defect annotations. Using the dataset, we performed a comprehensive experimental comparison of various computer vision techniques, including both conventional methods using hand-crafted visual features and the most recent convolutional neural networks (CNN) which generate multiple-layer stacking for representation learning. In the computer vision community, the two groups are referred to as shallow and deep learning, respectively. Experimental results make it clear that the deep learning-enabled system outperformed conventional (shallow) learning schemes by a large margin. We believe this benchmarking could be used as a reference for similar research dealing with automatic defect detection in ultrasonic imaging inspection.

Download Full-text

Representation Learning: A Statistical Perspective

Annual Review of Statistics and Its Application ◽

10.1146/annurev-statistics-031219-041131 ◽

2020 ◽

Vol 7 (1) ◽

pp. 303-335 ◽

Cited By ~ 1

Author(s):

Jianwen Xie ◽

Ruiqi Gao ◽

Erik Nijkamp ◽

Song-Chun Zhu ◽

Ying Nian Wu

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Factor Analysis ◽

Deep Learning ◽

Multidimensional Scaling ◽

Computational Neuroscience ◽

Representation Learning ◽

Central Theme ◽

Matrix Representations ◽

Vector Representations

Learning representations of data is an important problem in statistics and machine learning. While the origin of learning representations can be traced back to factor analysis and multidimensional scaling in statistics, it has become a central theme in deep learning with important applications in computer vision and computational neuroscience. In this article, we review recent advances in learning representations from a statistical perspective. In particular, we review the following two themes: ( a) unsupervised learning of vector representations and ( b) learning of both vector and matrix representations.

Download Full-text

Detecting Surface Cracks on Buildings Using Computer Vision: An Experimental Comparison of Digital Image Processing and Deep Learning

Advances in Intelligent Systems and Computing - Soft Computing and Signal Processing ◽

10.1007/978-981-16-1249-7_20 ◽

2021 ◽

pp. 197-210

Author(s):

Ramshankar Yadhunath ◽

Srivenkata Srikanth ◽

Arvind Sudheer ◽

C. Jyotsna ◽

J. Amudha

Keyword(s):

Image Processing ◽

Computer Vision ◽

Deep Learning ◽

Digital Image Processing ◽

Digital Image ◽

Experimental Comparison ◽

Surface Cracks

Download Full-text

Generating Images of Face Poses for Pose Varying Face Recognition

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f9998.079220 ◽

2020 ◽

Vol 9 (2) ◽

pp. 351-356

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Face Recognition ◽

Representation Learning ◽

Image Synthesis ◽

Face Image ◽

Learning Systems ◽

Face Expression ◽

Face Images ◽

The Face

Deep learning has attracted several researchers in the field of computer vision due to its ability to perform face and object recognition tasks with high accuracy than the traditional shallow learning systems. The convolutional layers present in the deep learning systems help to successfully capture the distinctive features of the face. For biometric authentication, face recognition (FR) has been preferred due to its passive nature. Processing face images are accompanied by a series of complexities, like variation of pose, light, face expression, and make up. Although all aspects are important, the one that impacts the most face-related computer vision applications is pose. In face recognition, it has been long desired to have a method capable of bringing faces to the same pose, usually a frontal view, in order to ease recognition. Synthesizing different views of a face is still a great challenge, mostly because in nonfrontal face images there are loss of information when one side of the face occludes the other. Most solutions for FR fail to perform well in cases involving extreme pose variations as in such scenarios, the convolutional layers of the deep models are unable to find discriminative parts of the face for extracting information. Most of the architectures proposed earlier deal with the scenarios where the face images used for training as well as testing the deep learning models are frontal and nearfrontal. On the contrary, here a limited number of face images at different poses is used to train the model, where a number of separate generator models learn to map a single face image at any arbitrary pose to specific poses and the discriminator performs the task of face recognition along with discriminating a synthetic face from a realworld sample. To this end, this paper proposes a representation learning by rotating the face. Here an encoderdecoder structure of the generator enables to learn a representation that is both generative and discriminative, which can be used for face image synthesis and pose-invariant face recognition. This representation is explicitly disentangled from other face variations such as pose, through the pose code provided to the decoder and pose estimation in the discriminator.

Download Full-text

Tensor Methods in Computer Vision and Deep Learning

Proceedings of the IEEE ◽

10.1109/jproc.2021.3074329 ◽

2021 ◽

Vol 109 (5) ◽

pp. 863-890

Author(s):

Yannis Panagakis ◽

Jean Kossaifi ◽

Grigorios G. Chrysos ◽

James Oldfield ◽

Mihalis A. Nicolaou ◽

...

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Tensor Methods

Download Full-text

Maize-IAS: a maize image analysis software using deep learning for high-throughput plant phenotyping

Plant Methods ◽

10.1186/s13007-021-00747-0 ◽

2021 ◽

Vol 17 (1) ◽

Author(s):

Shuo Zhou ◽

Xiujuan Chai ◽

Zixuan Yang ◽

Hongwu Wang ◽

Chenxue Yang ◽

...

Keyword(s):

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

High Throughput ◽

Batch Processing ◽

Plant Phenotyping ◽

Plant Science ◽

Analysis Software ◽

Image Analysis Software ◽

Maize Growth

Abstract Background Maize (Zea mays L.) is one of the most important food sources in the world and has been one of the main targets of plant genetics and phenotypic research for centuries. Observation and analysis of various morphological phenotypic traits during maize growth are essential for genetic and breeding study. The generally huge number of samples produce an enormous amount of high-resolution image data. While high throughput plant phenotyping platforms are increasingly used in maize breeding trials, there is a reasonable need for software tools that can automatically identify visual phenotypic features of maize plants and implement batch processing on image datasets. Results On the boundary between computer vision and plant science, we utilize advanced deep learning methods based on convolutional neural networks to empower the workflow of maize phenotyping analysis. This paper presents Maize-IAS (Maize Image Analysis Software), an integrated application supporting one-click analysis of maize phenotype, embedding multiple functions: (I) Projection, (II) Color Analysis, (III) Internode length, (IV) Height, (V) Stem Diameter and (VI) Leaves Counting. Taking the RGB image of maize as input, the software provides a user-friendly graphical interaction interface and rapid calculation of multiple important phenotypic characteristics, including leaf sheath points detection and leaves segmentation. In function Leaves Counting, the mean and standard deviation of difference between prediction and ground truth are 1.60 and 1.625. Conclusion The Maize-IAS is easy-to-use and demands neither professional knowledge of computer vision nor deep learning. All functions for batch processing are incorporated, enabling automated and labor-reduced tasks of recording, measurement and quantitative analysis of maize growth traits on a large dataset. We prove the efficiency and potential capability of our techniques and software to image-based plant research, which also demonstrates the feasibility and capability of AI technology implemented in agriculture and plant science.

Download Full-text

Research on computer vision enhancement in intelligent robot based on machine learning and deep learning

Neural Computing and Applications ◽

10.1007/s00521-021-05898-8 ◽

2021 ◽

Author(s):

Yuhan Ding ◽

Lisha Hua ◽

Shunlei Li

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Deep Learning ◽

Intelligent Robot

Download Full-text

An Automated Light Trap to Monitor Moths (Lepidoptera) Using Computer Vision-Based Tracking and Deep Learning

Sensors ◽

10.3390/s21020343 ◽

2021 ◽

Vol 21 (2) ◽

pp. 343

Author(s):

Kim Bjerge ◽

Jakob Bonde Nielsen ◽

Martin Videbæk Sepstrup ◽

Flemming Helsing-Nielsen ◽

Toke Thomas Høye

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Vision System ◽

Low Cost ◽

Light Trap ◽

Automatic Monitoring ◽

Light Sources ◽

Monitoring Methods ◽

Computer Vision System ◽

Substantial Investment

Insect monitoring methods are typically very time-consuming and involve substantial investment in species identification following manual trapping in the field. Insect traps are often only serviced weekly, resulting in low temporal resolution of the monitoring data, which hampers the ecological interpretation. This paper presents a portable computer vision system capable of attracting and detecting live insects. More specifically, the paper proposes detection and classification of species by recording images of live individuals attracted to a light trap. An Automated Moth Trap (AMT) with multiple light sources and a camera was designed to attract and monitor live insects during twilight and night hours. A computer vision algorithm referred to as Moth Classification and Counting (MCC), based on deep learning analysis of the captured images, tracked and counted the number of insects and identified moth species. Observations over 48 nights resulted in the capture of more than 250,000 images with an average of 5675 images per night. A customized convolutional neural network was trained on 2000 labeled images of live moths represented by eight different classes, achieving a high validation F1-score of 0.93. The algorithm measured an average classification and tracking F1-score of 0.71 and a tracking detection rate of 0.79. Overall, the proposed computer vision system and algorithm showed promising results as a low-cost solution for non-destructive and automatic monitoring of moths.

Download Full-text

Representation Learning for Fine-Grained Change Detection

Sensors ◽

10.3390/s21134486 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4486

Author(s):

Niall O’Mahony ◽

Sean Campbell ◽

Lenka Krpalkova ◽

Anderson Carvalho ◽

Joseph Walsh ◽

...

Keyword(s):

Deep Learning ◽

Change Detection ◽

Model Calibration ◽

State Of The Art ◽

Representation Learning ◽

Machine Intelligence ◽

The State ◽

Sensor Data ◽

Fine Grained ◽

Learning Techniques

Fine-grained change detection in sensor data is very challenging for artificial intelligence though it is critically important in practice. It is the process of identifying differences in the state of an object or phenomenon where the differences are class-specific and are difficult to generalise. As a result, many recent technologies that leverage big data and deep learning struggle with this task. This review focuses on the state-of-the-art methods, applications, and challenges of representation learning for fine-grained change detection. Our research focuses on methods of harnessing the latent metric space of representation learning techniques as an interim output for hybrid human-machine intelligence. We review methods for transforming and projecting embedding space such that significant changes can be communicated more effectively and a more comprehensive interpretation of underlying relationships in sensor data is facilitated. We conduct this research in our work towards developing a method for aligning the axes of latent embedding space with meaningful real-world metrics so that the reasoning behind the detection of change in relation to past observations may be revealed and adjusted. This is an important topic in many fields concerned with producing more meaningful and explainable outputs from deep learning and also for providing means for knowledge injection and model calibration in order to maintain user confidence.

Download Full-text