Investigation of the Dynamics of the Formation of Collective Immunity in a Pandemic Using Multi-agent Modeling and Selective Neural Network

Author(s):  
Mikhail Mazurov
Author(s):  
Yanlin Han ◽  
Piotr Gmytrasiewicz

This paper introduces the IPOMDP-net, a neural network architecture for multi-agent planning under partial observability. It embeds an interactive partially observable Markov decision process (I-POMDP) model and a QMDP planning algorithm that solves the model in a neural network architecture. The IPOMDP-net is fully differentiable and allows for end-to-end training. In the learning phase, we train an IPOMDP-net on various fixed and randomly generated environments in a reinforcement learning setting, assuming observable reinforcements and unknown (randomly initialized) model functions. In the planning phase, we test the trained network on new, unseen variants of the environments under the planning setting, using the trained model to plan without reinforcements. Empirical results show that our model-based IPOMDP-net outperforms the other state-of-the-art modelfree network and generalizes better to larger, unseen environments. Our approach provides a general neural computing architecture for multi-agent planning using I-POMDPs. It suggests that, in a multi-agent setting, having a model of other agents benefits our decision-making, resulting in a policy of higher quality and better generalizability.


2020 ◽  
Author(s):  
Douglas Meneghetti ◽  
Reinaldo Bianchi

This work proposes a neural network architecture that learns policies for multiple agent classes in a heterogeneous multi-agent reinforcement setting. The proposed network uses directed labeled graph representations for states, encodes feature vectors of different sizes for different entity classes, uses relational graph convolution layers to model different communication channels between entity types and learns distinct policies for different agent classes, sharing parameters wherever possible. Results have shown that specializing the communication channels between entity classes is a promising step to achieve higher performance in environments composed of heterogeneous entities.


2016 ◽  
Vol 28 (04) ◽  
pp. 1650028
Author(s):  
Julien Henriet ◽  
Christophe Lang ◽  
Ronnie Muthada Pottayya ◽  
Karla Breschi

Three dimensional (3D) voxel phantoms are numerical representations of human bodies, used by physicians in very different contexts. In the controlled context of hospitals, where from 2 to 10 subjects may arrive per day, phantoms are used to verify computations before therapeutic exposure to radiation of cancerous tumors. In addition, 3D phantoms are used to diagnose the gravity of accidental exposure to radiation. In such cases, there may be from 10 to more than 1000 subjects to be diagnosed simultaneously. In all of these cases, computation accuracy depends on a single such representation. In this paper, we present EquiVox which is a tool composed of several distributed functions and enables to create, as quickly and as accurately as possible, 3D numerical phantoms that fit anyone, whatever the context. It is based on a multi-agent system. Agents are convenient for this kind of structure, they can interact together and they may have individual capacities. In EquiVox, the phantoms adaptation is a key phase based on artificial neural network (ANN) interpolations. Thus, ANNs must be trained regularly in order to take into account newly capitalized subjects and to increase interpolation accuracy. However, ANN training is a time-consuming process. Consequently, we have built Equivox to optimize this process. Thus, in this paper, we present our architecture, based on agents and ANN, and we put the stress on the adaptation module. We propose, next, some experimentations in order to show the efficiency of the EquiVox architecture.


Sign in / Sign up

Export Citation Format

Share Document