scholarly journals Semantic-aware Active Perception for UAVs using Deep Reinforcement Learning

Author(s):  
Luca Bartolomei ◽  
Lucas Teixeira ◽  
Margarita Chli
2019 ◽  
Vol 4 (30) ◽  
pp. eaaw6326 ◽  
Author(s):  
Santhosh K. Ramakrishnan ◽  
Dinesh Jayaraman ◽  
Kristen Grauman

Standard computer vision systems assume access to intelligently captured inputs (e.g., photos from a human photographer), yet autonomously capturing good observations is a major challenge in itself. We address the problem of learning to look around: How can an agent learn to acquire informative visual observations? We propose a reinforcement learning solution, where the agent is rewarded for reducing its uncertainty about the unobserved portions of its environment. Specifically, the agent is trained to select a short sequence of glimpses, after which it must infer the appearance of its full environment. To address the challenge of sparse rewards, we further introduce sidekick policy learning, which exploits the asymmetry in observability between training and test time. The proposed methods learned observation policies that not only performed the completion task for which they were trained but also generalized to exhibit useful “look-around” behavior for a range of active perception tasks.


1990 ◽  
Vol 2 (4) ◽  
pp. 409-419 ◽  
Author(s):  
Steven D. Whitehead ◽  
Dana H. Ballard

This paper considers adaptive control architectures that integrate active sensorimotor systems with decision systems based on reinforcement learning. One unavoidable consequence of active perception is that the agent's internal representation often confounds external world states. We call this phenomenon perceptual aliasing and show that it destabilizes existing reinforcement learning algorithms with respect to the optimal decision policy. A new decision system that overcomes these difficulties is described. The system incorporates a perceptual subcycle within the overall decision cycle and uses a modified learning algorithm to suppress the effects of perceptual aliasing. The result is a control architecture that learns not only how to solve a task but also where to focus its attention in order to collect necessary sensory information.


1996 ◽  
Vol 3 (4) ◽  
pp. 126-134 ◽  
Author(s):  
Katsunari Shibata ◽  
Tetsuo Nishino ◽  
Yoichi Okabe

Decision ◽  
2016 ◽  
Vol 3 (2) ◽  
pp. 115-131 ◽  
Author(s):  
Helen Steingroever ◽  
Ruud Wetzels ◽  
Eric-Jan Wagenmakers

Sign in / Sign up

Export Citation Format

Share Document