Incremental Learning Framework for Autonomous Robots based on Q-learning and the Adaptive Kernel Linear Model

Author(s):  
Yanming Hu ◽  
Decai Li ◽  
Yuqing He ◽  
Jianda Han
2022 ◽  
Vol 71 (2) ◽  
pp. 2901-2921
Author(s):  
Alaa Eisa ◽  
Nora EL-Rashidy ◽  
Mohammad Dahman Alshehri ◽  
Hazem M. El-bakry ◽  
Samir Abdelrazek

Mathematics ◽  
2020 ◽  
Vol 8 (9) ◽  
pp. 1479
Author(s):  
Francisco Martinez-Gil ◽  
Miguel Lozano ◽  
Ignacio García-Fernández ◽  
Pau Romero ◽  
Dolors Serra ◽  
...  

Reinforcement learning is one of the most promising machine learning techniques to get intelligent behaviors for embodied agents in simulations. The output of the classic Temporal Difference family of Reinforcement Learning algorithms adopts the form of a value function expressed as a numeric table or a function approximator. The learned behavior is then derived using a greedy policy with respect to this value function. Nevertheless, sometimes the learned policy does not meet expectations, and the task of authoring is difficult and unsafe because the modification of one value or parameter in the learned value function has unpredictable consequences in the space of the policies it represents. This invalidates direct manipulation of the learned value function as a method to modify the derived behaviors. In this paper, we propose the use of Inverse Reinforcement Learning to incorporate real behavior traces in the learning process to shape the learned behaviors, thus increasing their trustworthiness (in terms of conformance to reality). To do so, we adapt the Inverse Reinforcement Learning framework to the navigation problem domain. Specifically, we use Soft Q-learning, an algorithm based on the maximum causal entropy principle, with MARL-Ped (a Reinforcement Learning-based pedestrian simulator) to include information from trajectories of real pedestrians in the process of learning how to navigate inside a virtual 3D space that represents the real environment. A comparison with the behaviors learned using a Reinforcement Learning classic algorithm (Sarsa(λ)) shows that the Inverse Reinforcement Learning behaviors adjust significantly better to the real trajectories.


Sensors ◽  
2020 ◽  
Vol 20 (22) ◽  
pp. 6550 ◽  
Author(s):  
Chen Cheng ◽  
Ji Chang ◽  
Wenjun Lv ◽  
Yuping Wu ◽  
Kun Li ◽  
...  

The accurate terrain classification in real time is of great importance to an autonomous robot working in field, because the robot could avoid non-geometric hazards, adjust control scheme, or improve localization accuracy, with the aid of terrain classification. In this paper, we investigate the vibration-based terrain classification (VTC) in a dynamic environment, and propose a novel learning framework, named DyVTC, which tackles online-collected unlabeled data with concept drift. In the DyVTC framework, the exterior disagreement (ex-disagreement) and interior disagreement (in-disagreement) are proposed novely based on the feature diversity and intrinsic temporal correlation, respectively. Such a disagreement mechanism is utilized to design a pseudo-labeling algorithm, which shows its compelling advantages in extracting key samples and labeling; and consequently, the classification accuracy could be retrieved by incremental learning in a changing environment. Since two sets of features are extracted from frequency and time domain to generate disagreements, we also name the proposed method feature-temporal disagreement adaptation (FTDA). The real-world experiment shows that the proposed DyVTC could reach an accuracy of 89.5%, but the traditional time- and frequency-domain terrain classification methods could only reach 48.8% and 71.5%, respectively, in a dynamic environment.


Author(s):  
Muhannad Alomari ◽  
Paul Duckworth ◽  
Nils Bore ◽  
Majd Hawasly ◽  
David C. Hogg ◽  
...  

With the recent proliferation of human-oriented robotic applications in domestic and industrial scenarios, it is vital for robots to continually learn about their environments and about the humans they share their environments with. In this paper, we present a novel, online, incremental framework for unsupervised symbol grounding in real-world, human environments for autonomous robots. We demonstrate the flexibility of the framework by learning about colours, people names, usable objects and simple human activities, integrating state-of-the-art object segmentation, pose estimation, activity analysis along with a number of sensory input encodings into a continual learning framework. Natural language is grounded to the learned concepts, enabling the robot to communicate in a human-understandable way. We show, using a challenging real-world dataset of human activities as perceived by a mobile robot, that our framework is able to extract useful concepts, ground natural language descriptions to them, and, as a proof-of-concept, generate simple sentences from templates to describe people and the activities they are engaged in.


Author(s):  
Pallege Gamini Dilupa Siriwardana ◽  
Clarence de Silva

In cooperative multi-robot object transportation, several autonomous robots navigate cooperatively in either a static or a dynamic environment to transport an object to a goal location and orientation. The environment may consist of both fixed and removable obstacles and it will be subject to uncertainty and unforeseen changes within the environment. More than one robot may be required for handling heavy and large objects. This paper presents a modified Q-learning approach for object transportation utilizing cooperative and autonomous multiple mobile robots. A modified version of Q-learning is presented, which employs for effective robot coordination. A solution to the action selection conflicts of the robots is presented, which helps to improve the real time performance and robustness of the system. As required in the task, the paper presents an algorithm for object pose estimation, by utilizing the laser range finder and color blob tracking. The developed techniques are implemented in a multi-robot system in laboratory. Experimental results are presented to demonstrate the effectiveness of the developed multi-robot system and its underlying methodologies.


2021 ◽  
Author(s):  
Wenju Sun ◽  
Jing Zhang ◽  
Danyu Wang ◽  
Yangli-Ao Geng ◽  
Qingyong Li

Sign in / Sign up

Export Citation Format

Share Document