scholarly journals Real-Time Detection of Cook Assistant Overalls Based on Embedded Reasoning

Sensors ◽  
2021 ◽  
Vol 21 (23) ◽  
pp. 8069
Author(s):  
Qinghua Sheng ◽  
Haixiang Sheng ◽  
Peng Gao ◽  
Zhu Li ◽  
Haibing Yin

Currently, the target detection based on convolutional neural network plays an important role in image recognition, speech recognition and other fields. However, the current network model features a complex structure, a huge number of parameters and resources. These conditions make it difficult to apply in embedded devices with limited computational capabilities and extreme sensitivity to power consumption. In this regard, the application scenarios of deep learning are limited. This paper proposes a real-time detection scheme for cook assistant overalls based on the Hi3559A embedded processor. With YOLOv3 as the benchmark network, this scheme fully mobilizes the hardware acceleration resources through the network model optimization and the parallel processing technology of the processor, and improves the network reasoning speed, so that the embedded device can complete the task of real-time detection on the local device. The experimental results show that through the purposeful cropping, segmentation and in-depth optimization of the neural network according to the specific processor, the neural network can recognize the image accurately. In an application environment where the power consumption is only 5.5 W, the recognition speed of the neural network on the embedded end is increased to about 28 frames (the design requirement was to achieve a recognition speed of 25 frames or more), so that the optimized network can be effectively applied in the back kitchen overalls identification scene.

2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Shoji Morita ◽  
Hitoshi Tabuchi ◽  
Hiroki Masumoto ◽  
Tomofusa Yamauchi ◽  
Naotake Kamiura

Abstract The present study aimed to conduct a real-time automatic analysis of two important surgical phases, which are continuous curvilinear capsulorrhexis (CCC), nuclear extraction, and three other surgical phases of cataract surgery using artificial intelligence technology. A total of 303 cases of cataract surgery registered in the clinical database of the Ophthalmology Department of Tsukazaki Hospital were used as a dataset. Surgical videos were downsampled to a resolution of 299 × 168 at 1 FPS to image each frame. Next, based on the start and end times of each surgical phase recorded by an ophthalmologist, the obtained images were labeled correctly. Using the data, a neural network model, known as InceptionV3, was developed to identify the given surgical phase for each image. Then, the obtained images were processed in chronological order using the neural network model, where the moving average of the output result of five consecutive images was derived. The class with the maximum output value was defined as the surgical phase. For each surgical phase, the time at which a phase was first identified was defined as the start time, and the time at which a phase was last identified was defined as the end time. The performance was evaluated by finding the mean absolute error between the start and end times of each important phase recorded by the ophthalmologist as well as the start and end times determined by the model. The correct response rate of the cataract surgical phase classification was 90.7% for CCC, 94.5% for nuclear extraction, and 97.9% for other phases, with a mean correct response rate of 96.5%. The errors between each phase’s start and end times recorded by the ophthalmologist and those determined by the neural network model were as follows: CCC’s start and end times, 3.34 seconds and 4.43 seconds, respectively and nuclear extraction’s start and end times, 7.21 seconds and 6.04 seconds, respectively, with a mean of 5.25 seconds. The neural network model used in this study was able to perform the classification of the surgical phase by only referring to the last 5 seconds of video images. Therefore, our method has performed like a real-time classification.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Niu Zijie ◽  
Zhang Peng ◽  
Yongjie Cui ◽  
Zhang Jun

Purpose Omnidirectional mobile platforms are still plagued by the problem of heading deviation. In four-Mecanum-wheel systems, this problem arises from the phenomena of dynamic imbalance and slip of the Mecanum wheels while driving. The purpose of this paper is to analyze the mechanism of omnidirectional motion using Mecanum wheels, with the aim of enhancing the heading precision. A proportional-integral-derivative (PID) setting control algorithm based on a radial basis function (RBF) neural network model is introduced. Design/methodology/approach In this study, the mechanism of omnidirectional motion using Mecanum wheels is analyzed, with the aim of enhancing the heading precision. A PID setting control algorithm based on an RBF neural network model is introduced. The algorithm is based on a kinematics model for an omnidirectional mobile platform and corrects the driving heading in real time. In this algorithm, the neural network RBF NN2 is used for identifying the state of the system, calculating the Jacobian information of the system and transmitting information to the neural network RBF NN1. Findings The network RBF NN1 calculates the deviations ?Kp, ?Ki and ?Kd to regulate the three coefficients Kp, Ki and Kd of the heading angle PID controller. This corrects the driving heading in real time, resolving the problems of low heading precision and unstable driving. The experimental data indicate that, for a externally imposed deviation in the heading angle of between 34º and ∼38°, the correction time for an omnidirectional mobile platform applying the algorithm during longitudinal driving is reduced by 1.4 s compared with the traditional PID control algorithm, while the overshoot angle is reduced by 7.4°; for lateral driving, the correction time is reduced by 1.4 s and the overshoot angle is reduced by 4.2°. Originality/value In this study, the mechanism of omnidirectional motion using Mecanum wheels is analyzed, with the aim of enhancing the heading precision. A PID setting control algorithm based on an RBF neural network model is introduced. The algorithm is based on a kinematics model for an omnidirectional mobile platform and corrects the driving heading in real time. In this algorithm, the neural network RBF NN2 is used for identifying the state of the system, calculating the Jacobian information of the system and transmitting information to the neural network RBF NN1. The method is innovative.


2013 ◽  
Vol 846-847 ◽  
pp. 167-171
Author(s):  
Chun Ping Huang

Due to the complex structure, high frequency of occurrence of the fault signal and the short cycle, it is difficult to form long lasting identifiable characteristics for intelligent instruments. The traditional fault detected model is difficult to effectively extract short fault signal, and unable to form meaningful fault recognition model, resulting in difficulties in the detection. In order to solve these problems, this paper presents a mining method for fault symptoms of the intelligent instruments based on the neural network model. The method uses the fault characteristic of the intelligent instrument as its basic data, according which to build neural network model in order to realize fault mining. The experiment results show that the proposed method can instantly detect the failure of the intelligent instruments, according to which the treatment can be carried out rapidly and accurately to ensure the performance of the instruments.


Author(s):  
Muhammad Hanif Ahmad Nizar ◽  
Chow Khuen Chan ◽  
Azira Khalil ◽  
Ahmad Khairuddin Mohamed Yusof ◽  
Khin Wee Lai

Background: Valvular heart disease is a serious disease leading to mortality and increasing medical care cost. The aortic valve is the most common valve affected by this disease. Doctors rely on echocardiogram for diagnosing and evaluating valvular heart disease. However, the images from echocardiogram are poor in comparison to Computerized Tomography and Magnetic Resonance Imaging scan. This study proposes the development of Convolutional Neural Networks (CNN) that can function optimally during a live echocardiographic examination for detection of the aortic valve. An automated detection system in an echocardiogram will improve the accuracy of medical diagnosis and can provide further medical analysis from the resulting detection. Methods: Two detection architectures, Single Shot Multibox Detector (SSD) and Faster Regional based Convolutional Neural Network (R-CNN) with various feature extractors were trained on echocardiography images from 33 patients. Thereafter, the models were tested on 10 echocardiography videos. Results: Faster R-CNN Inception v2 had shown the highest accuracy (98.6%) followed closely by SSD Mobilenet v2. In terms of speed, SSD Mobilenet v2 resulted in a loss of 46.81% in framesper- second (fps) during real-time detection but managed to perform better than the other neural network models. Additionally, SSD Mobilenet v2 used the least amount of Graphic Processing Unit (GPU) but the Central Processing Unit (CPU) usage was relatively similar throughout all models. Conclusion: Our findings provide a foundation for implementing a convolutional detection system to echocardiography for medical purposes.


2021 ◽  
Vol 11 (11) ◽  
pp. 4758
Author(s):  
Ana Malta ◽  
Mateus Mendes ◽  
Torres Farinha

Maintenance professionals and other technical staff regularly need to learn to identify new parts in car engines and other equipment. The present work proposes a model of a task assistant based on a deep learning neural network. A YOLOv5 network is used for recognizing some of the constituent parts of an automobile. A dataset of car engine images was created and eight car parts were marked in the images. Then, the neural network was trained to detect each part. The results show that YOLOv5s is able to successfully detect the parts in real time video streams, with high accuracy, thus being useful as an aid to train professionals learning to deal with new equipment using augmented reality. The architecture of an object recognition system using augmented reality glasses is also designed.


Author(s):  
Mostafa H. Tawfeek ◽  
Karim El-Basyouny

Safety Performance Functions (SPFs) are regression models used to predict the expected number of collisions as a function of various traffic and geometric characteristics. One of the integral components in developing SPFs is the availability of accurate exposure factors, that is, annual average daily traffic (AADT). However, AADTs are not often available for minor roads at rural intersections. This study aims to develop a robust AADT estimation model using a deep neural network. A total of 1,350 rural four-legged, stop-controlled intersections from the Province of Alberta, Canada, were used to train the neural network. The results of the deep neural network model were compared with the traditional estimation method, which uses linear regression. The results indicated that the deep neural network model improved the estimation of minor roads’ AADT by 35% when compared with the traditional method. Furthermore, SPFs developed using linear regression resulted in models with statistically insignificant AADTs on minor roads. Conversely, the SPF developed using the neural network provided a better fit to the data with both AADTs on minor and major roads being statistically significant variables. The findings indicated that the proposed model could enhance the predictive power of the SPF and therefore improve the decision-making process since SPFs are used in all parts of the safety management process.


Sign in / Sign up

Export Citation Format

Share Document