scholarly journals Implementation of an Obstacle Recognition System for the Blind

2019 ◽  
Vol 10 (1) ◽  
pp. 282 ◽  
Author(s):  
Soobin Ou ◽  
Huijin Park ◽  
Jongwoo Lee

The blind encounter commuting risks, such as failing to recognize and avoid obstacles while walking, but protective support systems are lacking. Acoustic signals at crosswalk lights are activated by button or remote control; however, these signals are difficult to operate and not always available (i.e., broken). Bollards are posts installed for pedestrian safety, but they can create dangerous situations in that the blind cannot see them. Therefore, we proposed an obstacle recognition system to assist the blind in walking safely outdoors; this system can recognize and guide the blind through two obstacles (crosswalk lights and bollards) with image training from the Google Object Detection application program interface (API) based on TensorFlow. The recognized results notify the blind through voice guidance playback in real time. The single shot multibox detector (SSD) MobileNet and faster region-convolutional neural network (R-CNN) models were applied to evaluate the obstacle recognition system; the latter model demonstrated better performance. Crosswalk lights were evaluated and found to perform better during the day than night. They were also analyzed to determine if a client could cross at a crosswalk, while the locations of bollards were analyzed by algorithms to guide the client by voice guidance.

2020 ◽  
Vol 12 (1) ◽  
pp. 1
Author(s):  
Vivian Alfionita Sutama ◽  
Suryo Adhi Wibowo ◽  
Rissa Rahmania

Nowadays, Artificial Intelligence is one of the most developing technology, especially on Augmented Reality (AR). AR is a technology which connected between real world and virtual in a real time that allows user to interact directly and display it in 3D. AR technology has two methods, that are AR based on marker and AR based on markerless. However, AR based on marker need an object detection system which has high performance as an interaction tools between user and the device. Single shot multibox detector (SSD) is an object detection algorithm that has fast learning computation and good performance. This method is affected by some parameters like number of epoch, learning rate, batch size, step training, etc. However, to create a good system it took a long process such as taking dataset, labelling process, then training and testing models to gain the best performance. In this experiment, we analyze SSD method in AR technology using inception architecture as pre-trained Convolutional neural network (CNN), and then do transfer learning to minimize amount training time. The configuration that used is the number of step training. The result of this experiment gets the best accuracy in 70.17%. Then, the best performance is used as an object detection model for marker’s AR technology.Abstrak Saat ini, Artificial intelligence merupakan teknologi yang sedang berkembang pesat. Salah satunya adalah teknologi Augmented Reality (AR). AR adalah teknologi yang menggabungkan dunia nyata dengan virtual secara real-time dengan interaksi pengguna secara langsung dan menampilkannya dalam bentuk 3D. Teknologi AR ini memiliki dua metode yaitu dengan marker dan markerless. Dalam perkembangannya, AR berbasis marker membutuhkan sistem deteksi objek yang memiliki performa tinggi sebagai alat interaksi antara pengguna dengan perangkatnya. Single shot multibox detector (SSD) merupakan algoritma deteksi objek yang memiliki komputasi pembelajaran dan kinerja yang baik. Metode ini dipengaruhi oleh beberapa parameter seperti jumlah lapisan konvolusi, epoch, learning rate, jumlah batch, step training, dll. Namun, dalam mengimplementasikannya diperlukan proses yang cukup panjang seperti, pengambilan dataset, proses pelabelan, proses pelatihan menggunakan metode SSD, dan melakukan pengujian terhadap beberapa model untuk mencari perfomansi paling baik. Dalam percobaan ini, kami melakukan analisis terhadap metode SSD pada teknologi AR menggunakan arsitektur Inception sebagai pre-trained Convolutional neural network (CNN), kemudian dilakukan transfer learning untuk memperkecil jumlah kelas data pelatihan dan waktu pelatihan data. Konfigurasi yang digunakan berupa jumlah step pada pelatihan. Hasil dari penilitian ini menunjukan akurasi terbaik sebesar 70,17%. Kemudian, perfomansi terbaik digunakan sebagai model deteksi objek untuk marker pada teknologi AR.


Author(s):  
Zhiyong Gao ◽  
Jianhong Xiang

Background: While detecting the object directly from the 3D point cloud, the natural 3D patterns and invariance of 3D data are often obscure. Objective: In this work, we aimed at studying the 3D object detection from discrete, disordered and sparse 3D point clouds. Methods: The CNN is composed of the frustum sequence module, 3D instance segmentation module S-NET, 3D point cloud transformation module T-NET, and 3D boundary box estimation module E-NET. The search space of the object is determined by the frustum sequence module. The instance segmentation of the point cloud is performed by the 3D instance segmentation module. The 3D coordinates of the object are confirmed by the transformation module and the 3D bounding box estimation module. Results: Evaluated on KITTI benchmark dataset, our method outperforms the state of the art by remarkable margins while having real-time capability. Conclusion: We achieve real-time 3D object detection by proposing an improved convolutional neural network (CNN) based on image-driven point clouds.


Sensors ◽  
2020 ◽  
Vol 20 (23) ◽  
pp. 6779
Author(s):  
Byung-Gil Han ◽  
Joon-Goo Lee ◽  
Kil-Taek Lim ◽  
Doo-Hyun Choi

With the increase in research cases of the application of a convolutional neural network (CNN)-based object detection technology, studies on the light-weight CNN models that can be performed in real time on the edge-computing devices are also increasing. This paper proposed scalable convolutional blocks that can be easily designed CNN networks of You Only Look Once (YOLO) detector which have the balanced processing speed and accuracy of the target edge-computing devices considering different performances by exchanging the proposed blocks simply. The maximum number of kernels of the convolutional layer was determined through simple but intuitive speed comparison tests for three edge-computing devices to be considered. The scalable convolutional blocks were designed in consideration of the limited maximum number of kernels to detect objects in real time on these edge-computing devices. Three scalable and fast YOLO detectors (SF-YOLO) which designed using the proposed scalable convolutional blocks compared the processing speed and accuracy with several conventional light-weight YOLO detectors on the edge-computing devices. When compared with YOLOv3-tiny, SF-YOLO was seen to be 2 times faster than the previous processing speed but with the same accuracy as YOLOv3-tiny, and also, a 48% improved processing speed than the YOLOv3-tiny-PRN which is the processing speed improvement model. Also, even in the large SF-YOLO model that focuses on the accuracy performance, it achieved a 10% faster processing speed with better accuracy of 40.4% [email protected] in the MS COCO dataset than YOLOv4-tiny model.


Author(s):  
Mochammad Langgeng Prasetyo ◽  
Achmad Teguh Wibowo ◽  
Mujib Ridwan ◽  
Mohammad Khusnu Milad ◽  
Sirajul Arifin ◽  
...  

The implementation of face recognition technique using CCTV is able to prevent unauthorized person enter the gate. Face recognition can be used for authentication, which can be implemented for preventing of criminal incidents. This re-search proposed a face recognition system using convolutional neural network to open and close the real-time barrier gate. The process consists of a convolutional layer, pooling layer, max pooling, flattening, and fully connected layer for detecting a face. The information was sent to the microcontroller using Internet of Thing (IoT) for controlling the barrier gate. The face recognition results are used to open or close the gate in the real time. The experimental results obtained average error rate of 0.320 and the accuracy of success rate is about 93.3%. The average response time required by microcontroller is about 0.562ms. The simulation result show that the face recognition technique using CNN is highly recommended to be implemented in barrier gate system.


Sign in / Sign up

Export Citation Format

Share Document