A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems

Author(s):  
Hiram Ponce ◽  
Ricardo Padilla ◽  
Alan Davalos ◽  
Alvaro Herrasti ◽  
Cynthia Pichardo ◽  
...  
Author(s):  
V. S. Simankov ◽  
Yu. V. Dubenko

The system analysis of the hierarchical intelligent multi-agent system in general, as well as its main structural unit, the intelligent agent, its major subsystems identified. As part of the analysis of the computer vision subsystem, it was concluded that the considered sources have insufficiently worked out issues related to the processing of occlusions, with the automation of the process of reconstruction of three-dimensional scenes, with the implementation of the processing of an unstructured set of images. The structure of the block for the reconstruction of three-dimensional scenes is proposed, the implementation of which is aimed at eliminating the indicated problems characteristic of the machine vision subsystem. The analysis of the main methods of implementing unsupervised learning is carried out, based on the results of which it is concluded that it is advisable to use reinforcement learning when implementing systems of this type. Such types of reinforcement learning as hierarchical reinforcement learning and multi-agent reinforcement learning are considered. A method for segmentation of macro actions is proposed, based on the implementation of clustering by the method of label propagation, in which the number of transitions is formalized in the form of weight coefficients of edges.


2021 ◽  
Vol 10 (2) ◽  
pp. 27
Author(s):  
Roberto Casadei ◽  
Gianluca Aguzzi ◽  
Mirko Viroli

Research and technology developments on autonomous agents and autonomic computing promote a vision of artificial systems that are able to resiliently manage themselves and autonomously deal with issues at runtime in dynamic environments. Indeed, autonomy can be leveraged to unburden humans from mundane tasks (cf. driving and autonomous vehicles), from the risk of operating in unknown or perilous environments (cf. rescue scenarios), or to support timely decision-making in complex settings (cf. data-centre operations). Beyond the results that individual autonomous agents can carry out, a further opportunity lies in the collaboration of multiple agents or robots. Emerging macro-paradigms provide an approach to programming whole collectives towards global goals. Aggregate computing is one such paradigm, formally grounded in a calculus of computational fields enabling functional composition of collective behaviours that could be proved, under certain technical conditions, to be self-stabilising. In this work, we address the concept of collective autonomy, i.e., the form of autonomy that applies at the level of a group of individuals. As a contribution, we define an agent control architecture for aggregate multi-agent systems, discuss how the aggregate computing framework relates to both individual and collective autonomy, and show how it can be used to program collective autonomous behaviour. We exemplify the concepts through a simulated case study, and outline a research roadmap towards reliable aggregate autonomy.


Sign in / Sign up

Export Citation Format

Share Document