Implementation of deep learning neural network for real-time object recognition in OpenCL framework

Author(s):  
Yukui Luo ◽  
Shuai Li ◽  
Kuangyuan Sun ◽  
Raul Renteria ◽  
Ken Choi
2021 ◽  
Vol 11 (11) ◽  
pp. 4758
Author(s):  
Ana Malta ◽  
Mateus Mendes ◽  
Torres Farinha

Maintenance professionals and other technical staff regularly need to learn to identify new parts in car engines and other equipment. The present work proposes a model of a task assistant based on a deep learning neural network. A YOLOv5 network is used for recognizing some of the constituent parts of an automobile. A dataset of car engine images was created and eight car parts were marked in the images. Then, the neural network was trained to detect each part. The results show that YOLOv5s is able to successfully detect the parts in real time video streams, with high accuracy, thus being useful as an aid to train professionals learning to deal with new equipment using augmented reality. The architecture of an object recognition system using augmented reality glasses is also designed.


2021 ◽  
Vol 336 ◽  
pp. 07004
Author(s):  
Ruoyu Fang ◽  
Cheng Cai

Obstacle detection and target tracking are two major issues for intelligent autonomous vehicles. This paper proposes a new scheme to achieve target tracking and real-time obstacle detection of obstacles based on computer vision. ResNet-18 deep learning neural network is utilized for obstacle detection and Yolo-v3 deep learning neural network is employed for real-time target tracking. These two trained models can be deployed on an autonomous vehicle equipped with an NVIDIA Jetson Nano motherboard. The autonomous vehicle moves to avoid obstacles and follow tracked targets by camera. Adjusting the steering and movement of the autonomous vehicle according to the PID algorithm during the movement, therefore, will help the proposed vehicle achieve stable and precise tracking.


Detecting vehicle motions are a progressively significant part in road surveillance and Traffic organizing systems. This paper presents a new Deep Gaussian based mixture model that predicts accurate in detecting vehicle motions. Although the existing arrangements based on conventional Gaussian mixture model which is limited in insufficient of many distinct points which eliminate covariance and solutions relative to infinite likelihood. In the proposed scheme, the deep learning neural network is used for including the more points with nested mixture models. To overcome the effects of adding more points the modification achieved in architecture development. The validation of proposed scheme is achieved with real-time videos and process with scikit learn based model.


Author(s):  
K Pandiaraj ◽  
P Sivakumar ◽  
V Nandhini ◽  
S Parkav

In farms we can see that the birds and animals destroying the crops. The movement of birds and animals cannot be controlled by any method. We can only drive away them. To drive away them, humans are used. To reduce the human effort we have introduced a method using image processing. In this method, the real time images are given as input and sound will be derived as output. The image given as input is compared with the trained images and classified into birds and animals. After the identification, birds can be driven away by using cracker sound and animals can be driven away by using a human sound.


2021 ◽  
Vol 35 (5) ◽  
pp. 431-435
Author(s):  
Vijayakumar Ponnusamy ◽  
Diwakar R. Marur ◽  
Deepa Dhanaskodi ◽  
Thangavel Palaniappan

This work proposes deep learning neural network-based X-ray image classification. The X-ray baggage scanning machinery plays an essential role in the safeguard of customs, airports, and other systematically very important landmarks and infrastructures. The technology at present of baggage scanning machines is designed on X-ray attenuation. The detection of threatful objects is built on how different objects attenuate the X-ray beams going through them. In this paper, the deep convolutional neural network of YOLO is utilized in classifying baggage images. Real-time performance of the baggage image classification is an essential one for security scanning. There are many computationally intensive operations in the You Only Look Once (YOLO) architecture. The computational intensive operations are implemented in the Field Programmable Gate Array (FPGA) platform to optimize process delays. The critical issues involved in those implementations include data representation, inner products computation and implementation of activation function and resolving these issues will also be a significant task. The FPGA implementation results show that with less resource occupancy, the YOLO implementation provides maximum accuracy of 98.9% in classifying X-ray baggage images and identifying hazardous materials. This result proves that the proposed implementation is best suited for practical system deployments for real-time Baggage scanning.


Author(s):  
Hoa T. Nguyen ◽  
Jens-Patrick Langstrand ◽  
Michael Hildebrandt

We demonstrate the ReClass system, a real time reading detection classifier. ReClass detects if a user is reading text or not solely based on the user’s eye movements, without considering the content of the screen. We examined two machine learning approaches. In the first approach, we pre-processed the data using feature engineering and trained a Linear Support Vector Machine classifier. The second approach used a Deep Learning Neural Network; instead of feature engineering, we allowed the neural network to extract features from the raw data. The second approach outperformed the first by about 25%, achieving 96% accuracy. Our tool illustrates how Deep Learning can be a new innovative method for teaching machines to understand human behaviour. We discuss the potential applications of ReClass for educational assessment, medical diagnosis and training.


Water ◽  
2021 ◽  
Vol 13 (11) ◽  
pp. 1547
Author(s):  
Jian Sha ◽  
Xue Li ◽  
Man Zhang ◽  
Zhong-Liang Wang

Accurate real-time water quality prediction is of great significance for local environmental managers to deal with upcoming events and emergencies to develop best management practices. In this study, the performances in real-time water quality forecasting based on different deep learning (DL) models with different input data pre-processing methods were compared. There were three popular DL models concerned, including the convolutional neural network (CNN), long short-term memory neural network (LSTM), and hybrid CNN–LSTM. Two types of input data were applied, including the original one-dimensional time series and the two-dimensional grey image based on the complete ensemble empirical mode decomposition algorithm with adaptive noise (CEEMDAN) decomposition. Each type of input data was used in each DL model to forecast the real-time monitoring water quality parameters of dissolved oxygen (DO) and total nitrogen (TN). The results showed that (1) the performances of CNN–LSTM were superior to the standalone model CNN and LSTM; (2) the models used CEEMDAN-based input data performed much better than the models used the original input data, while the improvements for non-periodic parameter TN were much greater than that for periodic parameter DO; and (3) the model accuracies gradually decreased with the increase of prediction steps, while the original input data decayed faster than the CEEMDAN-based input data and the non-periodic parameter TN decayed faster than the periodic parameter DO. Overall, the input data preprocessed by the CEEMDAN method could effectively improve the forecasting performances of deep learning models, and this improvement was especially significant for non-periodic parameters of TN.


Sign in / Sign up

Export Citation Format

Share Document