Real-time adaptive object localization and tracking for autonomous vehicles

Author(s):  
Max Mauro Dias Santos ◽  
Joao Eduardo Hoffmann ◽  
Hilkija Gaius Tosso ◽  
Asad Waqar Malik ◽  
Anis Ur Rahman ◽  
...  
2014 ◽  
Vol 1077 ◽  
pp. 221-226
Author(s):  
Dan Popescu ◽  
Loretta Ichim ◽  
Radu Fratila ◽  
Diana Gornea

Tracking the road or a mobile object and also obstacle avoidance are very important components that must be considered in the process of developing a robotic system. In this paper we propose a mobile platform for indoor navigation, based on a cheap computing hardware, which is able to be configured in two scenarios: the first refers to the movement of the robot on a predetermined path and to avoidance the obstacles, while maintaining the final target, and the second refers to the possibility of identifying and tracking a target. The robotic system aggregates the information acquired from different sensors and combines the computing resources from the mobile platform with those from the central unit. MATLAB is used for all the implementations and tests, to develop algorithms and to create models and applications. The robot's communication with central unit is wireless. Experimental results show that the mobile platform is able to perform, in real time, the following tasks in indoor environment: the recognition of the object, localization and tracking and also the obstacles avoidance.


2021 ◽  
Vol 7 (8) ◽  
pp. 145
Author(s):  
Antoine Mauri ◽  
Redouane Khemmar ◽  
Benoit Decoux ◽  
Madjid Haddad ◽  
Rémi Boutteau

For smart mobility, autonomous vehicles, and advanced driver-assistance systems (ADASs), perception of the environment is an important task in scene analysis and understanding. Better perception of the environment allows for enhanced decision making, which, in turn, enables very high-precision actions. To this end, we introduce in this work a new real-time deep learning approach for 3D multi-object detection for smart mobility not only on roads, but also on railways. To obtain the 3D bounding boxes of the objects, we modified a proven real-time 2D detector, YOLOv3, to predict 3D object localization, object dimensions, and object orientation. Our method has been evaluated on KITTI’s road dataset as well as on our own hybrid virtual road/rail dataset acquired from the video game Grand Theft Auto (GTA) V. The evaluation of our method on these two datasets shows good accuracy, but more importantly that it can be used in real-time conditions, in road and rail traffic environments. Through our experimental results, we also show the importance of the accuracy of prediction of the regions of interest (RoIs) used in the estimation of 3D bounding box parameters.


2020 ◽  
Vol 13 (1) ◽  
pp. 89
Author(s):  
Manuel Carranza-García ◽  
Jesús Torres-Mateo ◽  
Pedro Lara-Benítez ◽  
Jorge García-Gutiérrez

Object detection using remote sensing data is a key task of the perception systems of self-driving vehicles. While many generic deep learning architectures have been proposed for this problem, there is little guidance on their suitability when using them in a particular scenario such as autonomous driving. In this work, we aim to assess the performance of existing 2D detection systems on a multi-class problem (vehicles, pedestrians, and cyclists) with images obtained from the on-board camera sensors of a car. We evaluate several one-stage (RetinaNet, FCOS, and YOLOv3) and two-stage (Faster R-CNN) deep learning meta-architectures under different image resolutions and feature extractors (ResNet, ResNeXt, Res2Net, DarkNet, and MobileNet). These models are trained using transfer learning and compared in terms of both precision and efficiency, with special attention to the real-time requirements of this context. For the experimental study, we use the Waymo Open Dataset, which is the largest existing benchmark. Despite the rising popularity of one-stage detectors, our findings show that two-stage detectors still provide the most robust performance. Faster R-CNN models outperform one-stage detectors in accuracy, being also more reliable in the detection of minority classes. Faster R-CNN Res2Net-101 achieves the best speed/accuracy tradeoff but needs lower resolution images to reach real-time speed. Furthermore, the anchor-free FCOS detector is a slightly faster alternative to RetinaNet, with similar precision and lower memory usage.


Sign in / Sign up

Export Citation Format

Share Document