Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives

Author(s):  
Dongwon Son ◽  
Myungsin Kim ◽  
Jaecheol Sim ◽  
Wonsik Shin
Author(s):  
Edwin Valarezo Añazco ◽  
Patricio Rivera Lopez ◽  
Nahyeon Park ◽  
Jiheon Oh ◽  
Gahyeon Ryu ◽  
...  

Author(s):  
Pietro Pierpaoli ◽  
Thinh T. Doan ◽  
Justin Romberg ◽  
Magnus Egerstedt

AbstractGiven a collection of parameterized multi-robot controllers associated with individual behaviors designed for particular tasks, this paper considers the problem of how to sequence and instantiate the behaviors for the purpose of completing a more complex, overarching mission. In addition, uncertainties about the environment or even the mission specifications may require the robots to learn, in a cooperative manner, how best to sequence the behaviors. In this paper, we approach this problem by using reinforcement learning to approximate the solution to the computationally intractable sequencing problem, combined with an online gradient descent approach to selecting the individual behavior parameters, while the transitions among behaviors are triggered automatically when the behaviors have reached a desired performance level relative to a task performance cost. To illustrate the effectiveness of the proposed method, it is implemented on a team of differential-drive robots for solving two different missions, namely, convoy protection and object manipulation.


Author(s):  
John Benjamin Cassel

This chapter provides a stakeholder discovery model for distributed risk governance suitable to machine learning and decision-theoretic planning. Distributed risk governance concerns when the underlying risk is not localized or has unknown locality so that any initial interaction with stakeholders is limited and educational and participatory initiatives are costly. Therefore, expecting the initial reaction to communications is critical. To capture this initial reaction, the authors sample the population of potential stakeholders to discover both their concerns and knowledge while handling inaccuracies and contradictions. This chapter provides a stakeholder discovery model that can accommodate these inconsistencies. Stakeholder discovery provides a timely strategic assessment of the risk situation. This assessment forecasts projected stakeholder actions to find if those actions are in line with their strategic interests or if there are better choices using reinforcement learning. Unlike other reinforcement learning formulations, it does not take the state space, criteria, potential observations, other agents, actions, or rewards for granted, but discovers these factors non-parametrically. Overall, this chapter introduces machine learning researchers and risk governance professionals to the compatibility between non-parametric models and early-stage stakeholder discovery problems and addresses widely known biases and deficits within risk governance and intelligence practices.


Sign in / Sign up

Export Citation Format

Share Document