Automated pill quality inspection using deep learning

International Journal of Modern Physics B ◽

10.1142/s0217979221400506 ◽

2021 ◽

pp. 2140050

Author(s):

Thi Thoa Mac ◽

Nguyen Thanh Hung

Keyword(s):

Neural Network ◽

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

Internal Crack ◽

Quality Inspection ◽

Statistical Representation ◽

Financial Costs ◽

Average Accuracy ◽

Processing Techniques

The pill manufacturing process accrues substantial financial costs due to quality. Pill quality inspection is laborious, time-consuming and subjective, resulting in poor statistical representation and inconsistent results. In this study, we developed an approach that integrates deep learning algorithms and computer-vision-based processing with an optimization algorithm to fully automate the image analysis of internal crack/contamination detection. This approach exploits the features learned by convolutional neural network using various sub-processing techniques and Adam optimization. It achieves robust quantification of internal pill defects with an average accuracy of 95%.

Download Full-text

Survey of Machine Learning Applications of Convolutional Neural Networks to Medical Image Analysis

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.38947 ◽

2021 ◽

Vol 9 (11) ◽

pp. 1186-1196

Author(s):

Dr. K. Naveen Kumar

Keyword(s):

Neural Network ◽

Machine Learning ◽

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

Convolutional Neural Network ◽

Medical Image ◽

Medical Image Analysis ◽

Learning Techniques ◽

Before And After

Abstract: Recently, a machine learning (ML) area called deep learning emerged in the computer-vision field and became very popular in many fields. It started from an event in late 2012, when a deep-learning approach based on a convolutional neural network (CNN) won an overwhelming victory in the best-known worldwide computer vision competition, ImageNet Classification. Since then, researchers in many fields, including medical image analysis, have started actively participating in the explosively growing field of deep learning. In this paper, deep learning techniques and their applications to medical image analysis are surveyed. This survey overviewed 1) standard ML techniques in the computer-vision field, 2) what has changed in ML before and after the introduction of deep learning, 3) ML models in deep learning, and 4) applications of deep learning to medical image analysis. The comparisons between MLs before and after deep learning revealed that ML with feature input (or feature-based ML) was dominant before the introduction of deep learning, and that the major and essential difference between ML before and after deep learning is learning image data directly without object segmentation or feature extraction; thus, it is the source of the power of deep learning, although the depth of the model is an important attribute. The survey of deep learningalso revealed that there is a long history of deep-learning techniques in the class of ML with image input, except a new term, “deep learning”. “Deep learning” even before the term existed, namely, the class of ML with image input was applied to various problems in medical image analysis including classification between lesions and nonlesions, classification between lesion types, segmentation of lesions or organs, and detection of lesions. ML with image input including deep learning is a verypowerful, versatile technology with higher performance, which can bring the current state-ofthe-art performance level of medical image analysis to the next level, and it is expected that deep learning will be the mainstream technology in medical image analysis in the next few decades. “Deep learning”, or ML with image input, in medical image analysis is an explosively growing, promising field. It is expected that ML with image input will be the mainstream area in the field of medical image analysis in the next few decades. Keywords: Deep learning, Convolutional neural network, Massive-training artificial neural network, Computer-aided diagnosis, Medical image analysis, Classification (key words)

Download Full-text

Maize-IAS: a maize image analysis software using deep learning for high-throughput plant phenotyping

Plant Methods ◽

10.1186/s13007-021-00747-0 ◽

2021 ◽

Vol 17 (1) ◽

Author(s):

Shuo Zhou ◽

Xiujuan Chai ◽

Zixuan Yang ◽

Hongwu Wang ◽

Chenxue Yang ◽

...

Keyword(s):

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

High Throughput ◽

Batch Processing ◽

Plant Phenotyping ◽

Plant Science ◽

Analysis Software ◽

Image Analysis Software ◽

Maize Growth

Abstract Background Maize (Zea mays L.) is one of the most important food sources in the world and has been one of the main targets of plant genetics and phenotypic research for centuries. Observation and analysis of various morphological phenotypic traits during maize growth are essential for genetic and breeding study. The generally huge number of samples produce an enormous amount of high-resolution image data. While high throughput plant phenotyping platforms are increasingly used in maize breeding trials, there is a reasonable need for software tools that can automatically identify visual phenotypic features of maize plants and implement batch processing on image datasets. Results On the boundary between computer vision and plant science, we utilize advanced deep learning methods based on convolutional neural networks to empower the workflow of maize phenotyping analysis. This paper presents Maize-IAS (Maize Image Analysis Software), an integrated application supporting one-click analysis of maize phenotype, embedding multiple functions: (I) Projection, (II) Color Analysis, (III) Internode length, (IV) Height, (V) Stem Diameter and (VI) Leaves Counting. Taking the RGB image of maize as input, the software provides a user-friendly graphical interaction interface and rapid calculation of multiple important phenotypic characteristics, including leaf sheath points detection and leaves segmentation. In function Leaves Counting, the mean and standard deviation of difference between prediction and ground truth are 1.60 and 1.625. Conclusion The Maize-IAS is easy-to-use and demands neither professional knowledge of computer vision nor deep learning. All functions for batch processing are incorporated, enabling automated and labor-reduced tasks of recording, measurement and quantitative analysis of maize growth traits on a large dataset. We prove the efficiency and potential capability of our techniques and software to image-based plant research, which also demonstrates the feasibility and capability of AI technology implemented in agriculture and plant science.

Download Full-text

Single-cell conventional pap smear image classification using pre-trained deep neural network architectures

BMC Biomedical Engineering ◽

10.1186/s42490-021-00056-6 ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Mohammed Aliy Mohammed ◽

Fetulhak Abdurahman ◽

Yodit Abebe Ayalew

Keyword(s):

Neural Network ◽

Cervical Cancer ◽

Computer Vision ◽

Single Cell ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Pap Smear ◽

Experimental Result ◽

Network Architectures ◽

Average Accuracy

Abstract Background Automating cytology-based cervical cancer screening could alleviate the shortage of skilled pathologists in developing countries. Up until now, computer vision experts have attempted numerous semi and fully automated approaches to address the need. Yet, these days, leveraging the astonishing accuracy and reproducibility of deep neural networks has become common among computer vision experts. In this regard, the purpose of this study is to classify single-cell Pap smear (cytology) images using pre-trained deep convolutional neural network (DCNN) image classifiers. We have fine-tuned the top ten pre-trained DCNN image classifiers and evaluated them using five class single-cell Pap smear images from SIPaKMeD dataset. The pre-trained DCNN image classifiers were selected from Keras Applications based on their top 1% accuracy. Results Our experimental result demonstrated that from the selected top-ten pre-trained DCNN image classifiers DenseNet169 outperformed with an average accuracy, precision, recall, and F1-score of 0.990, 0.974, 0.974, and 0.974, respectively. Moreover, it dashed the benchmark accuracy proposed by the creators of the dataset with 3.70%. Conclusions Even though the size of DenseNet169 is small compared to the experimented pre-trained DCNN image classifiers, yet, it is not suitable for mobile or edge devices. Further experimentation with mobile or small-size DCNN image classifiers is required to extend the applicability of the models in real-world demands. In addition, since all experiments used the SIPaKMeD dataset, additional experiments will be needed using new datasets to enhance the generalizability of the models.

Download Full-text

Computer vision based obstacle detection and target tracking for autonomous vehicles

MATEC Web of Conferences ◽

10.1051/matecconf/202133607004 ◽

2021 ◽

Vol 336 ◽

pp. 07004

Author(s):

Ruoyu Fang ◽

Cheng Cai

Keyword(s):

Neural Network ◽

Computer Vision ◽

Deep Learning ◽

Target Tracking ◽

Real Time ◽

Autonomous Vehicles ◽

Autonomous Vehicle ◽

Obstacle Detection ◽

Pid Algorithm ◽

Deep Learning Neural Network

Obstacle detection and target tracking are two major issues for intelligent autonomous vehicles. This paper proposes a new scheme to achieve target tracking and real-time obstacle detection of obstacles based on computer vision. ResNet-18 deep learning neural network is utilized for obstacle detection and Yolo-v3 deep learning neural network is employed for real-time target tracking. These two trained models can be deployed on an autonomous vehicle equipped with an NVIDIA Jetson Nano motherboard. The autonomous vehicle moves to avoid obstacles and follow tracked targets by camera. Adjusting the steering and movement of the autonomous vehicle according to the PID algorithm during the movement, therefore, will help the proposed vehicle achieve stable and precise tracking.

Download Full-text

Deep Learning Framework for Steel Surface Defects Classification

INTERNATIONAL JOURNAL OF ADVANCED PRODUCTION AND INDUSTRIAL ENGINEERING ◽

10.35121/ijapie201901135 ◽

2019 ◽

Vol 4 (1) ◽

pp. 25-32

Author(s):

Ranganath Singari ◽

Karun Singla ◽

Gangesh Chawla

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Steel Surface ◽

Surface Defects ◽

Fixed Number ◽

Quality Level ◽

Acceptance Sampling ◽

Quality Inspection ◽

Defects Classification

Deep learning has offered new avenues in the field of industrial management. Traditional methods of quality inspection such as Acceptance Sampling relies on a probabilistic measure derived from inspecting a sample of finished products. Evaluating a fixed number of products to derive the quality level for the complete batch is not a robust approach. Visual inspection solutions based on deep learning can be employed in the large manufacturing units to improve the quality inspection units for steel surface defect detection. This leads to optimization of the human capital due to reduction in manual intervention and turnaround time in the overall supply chain of the industry. Consequently, the sample size in the Acceptance sampling can be increased with minimal effort vis-à-vis an increase in the overall accuracy of the inspection. The learning curve of this work is supported by Convolutional Neural Network which has been used to extract feature representations from grayscale images to classify theinputs into six types of surface defects. The neural network architecture is compiled in Keras framework using Tensorflow backend with state of the art Adam RMS Prop with Nesterov Momentum (NADAM) optimizer. The proposed classification algorithm holds the potential to identify the dominant flaws in the manufacturing system responsible for leaking costs.

Download Full-text

Deep Learning in Data-Driven Pavement Image Analysis and Automated Distress Detection: A Review

Data ◽

10.3390/data3030028 ◽

2018 ◽

Vol 3 (3) ◽

pp. 28 ◽

Cited By ~ 23

Author(s):

Kasthurirangan Gopalakrishnan

Keyword(s):

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

Asset Management ◽

Network Architecture ◽

Crack Detection ◽

Future Research ◽

Deep Convolutional Neural Networks ◽

Pavement Distress ◽

Learning Software

Deep learning, more specifically deep convolutional neural networks, is fast becoming a popular choice for computer vision-based automated pavement distress detection. While pavement image analysis has been extensively researched over the past three decades or so, recent ground-breaking achievements of deep learning algorithms in the areas of machine translation, speech recognition, and computer vision has sparked interest in the application of deep learning to automated detection of distresses in pavement images. This paper provides a narrative review of recently published studies in this field, highlighting the current achievements and challenges. A comparison of the deep learning software frameworks, network architecture, hyper-parameters employed by each study, and crack detection performance is provided, which is expected to provide a good foundation for driving further research on this important topic in the context of smart pavement or asset management systems. The review concludes with potential avenues for future research; especially in the application of deep learning to not only detect, but also characterize the type, extent, and severity of distresses from 2D and 3D pavement images.

Download Full-text

Fashion Product Classification through Deep Learning and Computer Vision

Applied Sciences ◽

10.3390/app9071385 ◽

2019 ◽

Vol 9 (7) ◽

pp. 1385 ◽

Cited By ~ 4

Author(s):

Luca Donati ◽

Eleonora Iotti ◽

Giulio Mordonini ◽

Andrea Prati

Keyword(s):

Image Processing ◽

Computer Vision ◽

Feature Extraction ◽

Deep Learning ◽

Template Matching ◽

Learning Approaches ◽

Visual Classification ◽

Product Classification ◽

Processing Techniques

Visual classification of commercial products is a branch of the wider fields of object detection and feature extraction in computer vision, and, in particular, it is an important step in the creative workflow in fashion industries. Automatically classifying garment features makes both designers and data experts aware of their overall production, which is fundamental in order to organize marketing campaigns, avoid duplicates, categorize apparel products for e-commerce purposes, and so on. There are many different techniques for visual classification, ranging from standard image processing to machine learning approaches: this work, made by using and testing the aforementioned approaches in collaboration with Adidas AG™, describes a real-world study aimed at automatically recognizing and classifying logos, stripes, colors, and other features of clothing, solely from final rendering images of their products. Specifically, both deep learning and image processing techniques, such as template matching, were used. The result is a novel system for image recognition and feature extraction that has a high classification accuracy and which is reliable and robust enough to be used by a company like Adidas. This paper shows the main problems and proposed solutions in the development of this system, and the experimental results on the Adidas AG™ dataset.

Download Full-text

CloudA: A Ground-Based Cloud Classification Method with a Convolutional Neural Network

Journal of Atmospheric and Oceanic Technology ◽

10.1175/jtech-d-19-0189.1 ◽

2020 ◽

Vol 37 (9) ◽

pp. 1661-1668

Author(s):

Min Wang ◽

Shudao Zhou ◽

Zhong Yang ◽

Zhanhua Liu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Image Recognition ◽

Learning Ability ◽

Visualization Method ◽

Classification Methods ◽

Recognition Method ◽

Cloud Classification ◽

Average Accuracy

AbstractConventional classification methods are based on artificial experience to extract features, and each link is independent, which is a kind of “shallow learning.” As a result, the scope of the cloud category applied by this method is limited. In this paper, we propose a new convolutional neural network (CNN) with deep learning ability, called CloudA, for the ground-based cloud image recognition method. We use the Singapore Whole-Sky Imaging Categories (SWIMCAT) sample library and total-sky sample library to train and test CloudA. In particular, we visualize the cloud features captured by CloudA using the TensorBoard visualization method, and these features can help us to understand the process of ground-based cloud classification. We compare this method with other commonly used methods to explore the feasibility of using CloudA to classify ground-based cloud images, and the evaluation of a large number of experiments show that the average accuracy of this method is nearly 98.63% for ground-based cloud classification.

Download Full-text

Self-Adaptive Approximate Mobile Deep Learning

Electronics ◽

10.3390/electronics10232958 ◽

2021 ◽

Vol 10 (23) ◽

pp. 2958

Author(s):

Timotej Knez ◽

Octavian Machidon ◽

Veljko Pejović

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real World ◽

Edge Computing ◽

Major Drawback ◽

High Resource ◽

Average Accuracy ◽

Resource Requirements ◽

And Performance ◽

Network Compression

Edge intelligence is currently facing several important challenges hindering its performance, with the major drawback being meeting the high resource requirements of deep learning by the resource-constrained edge computing devices. The most recent adaptive neural network compression techniques demonstrated, in theory, the potential to facilitate the flexible deployment of deep learning models in real-world applications. However, their actual suitability and performance in ubiquitous or edge computing applications has not, to this date, been evaluated. In this context, our work aims to bridge the gap between the theoretical resource savings promised by such approaches and the requirements of a real-world mobile application by introducing algorithms that dynamically guide the compression rate of a neural network according to the continuously changing context in which the mobile computation is taking place. Through an in-depth trace-based investigation, we confirm the feasibility of our adaptation algorithms in offering a scalable trade-off between the inference accuracy and resource usage. We then implement our approach on real-world edge devices and, through a human activity recognition application, confirm that it offers efficient neural network compression adaptation in highly dynamic environments. The results of our experiment with 21 participants show that, compared to using static network compression, our approach uses 2.18× less energy with only a 1.5% drop in the average accuracy of the classification.

Download Full-text

Implementation of Deep Learning for Slump Optimization Based on Concrete Quality Using Convolutional Neural Network in PT. SELO PROGO SAKTI

Jurnal AKSI (Akuntansi dan Sistem Informasi) ◽

10.32486/aksi.v5i2.598 ◽

2020 ◽

Vol 5 (2) ◽

pp. 83-88

Author(s):

Hedi Pandowo

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Experimental Design ◽

Convolutional Neural Network ◽

Processing Unit ◽

Training Process ◽

Average Accuracy ◽

Graphical Processing ◽

Intelligence Science

Deep Learning is part of the scientific field of Machine Learning and Machine Learning is part of Artificial Intelligence science. Deep Learning has extraordinary capabilities by using a hardware Graphical Processing Unit (GPU) so that the artificial requirement network can run faster than using a Personal Computer Unit (CPU). Especially in terms of object classification in images using existing methods in the Convolutional Neural Network (CNN). The method used in this research is Preprocessing and Processing of Input Data, Training Process in which CNN is trained to obtain high accuracy from the classification carried out and the Testing Process which is a classification process using weights and bias from the results of the training process. This type of research is a pre experimental design (pre experimental design). The results of the object image classification test with different levels of confusion in the Concrete database with the Mix Design K-125, K-150, K-250 and K-300 produce an average accuracy value. This is also relevant to measuring the failure rate of concrete or slump

Download Full-text