Simulation Study on Inventory Decision-Making of Global Mobile Logistics System Based on Multi-agent

Author(s):  
Ye Xuan
2021 ◽  
Author(s):  
Arthur Campbell

Abstract An important task for organizations is establishing truthful communication between parties with differing interests. This task is made particularly challenging when the accuracy of the information is poorly observed or not at all. In these settings, incentive contracts based on the accuracy of information will not be very effective. This paper considers an alternative mechanism that does not require any signal of the accuracy of any information communicated to provide incentives for truthful communication. Rather, an expert sacrifices future participation in decision-making to influence the current period’s decision in favour of their preferred project. This mechanism captures a notion often described as ‘political capital’ whereby an individual is able to achieve their own preferred decision in the current period at the expense of being able to exert influence in future decisions (‘spending political capital’). When the first-best is not possible in this setting, I show that experts hold more influence than under the first-best and that, in a multi-agent extension, a finite team size is optimal. Together these results suggest that a small number of individuals hold excessive influence in organizations.


Symmetry ◽  
2020 ◽  
Vol 12 (4) ◽  
pp. 631
Author(s):  
Chunyang Hu

In this paper, deep reinforcement learning (DRL) and knowledge transfer are used to achieve the effective control of the learning agent for the confrontation in the multi-agent systems. Firstly, a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm with parameter sharing is proposed to achieve confrontation decision-making of multi-agent. In the process of training, the information of other agents is introduced to the critic network to improve the strategy of confrontation. The parameter sharing mechanism can reduce the loss of experience storage. In the DDPG algorithm, we use four neural networks to generate real-time action and Q-value function respectively and use a momentum mechanism to optimize the training process to accelerate the convergence rate for the neural network. Secondly, this paper introduces an auxiliary controller using a policy-based reinforcement learning (RL) method to achieve the assistant decision-making for the game agent. In addition, an effective reward function is used to help agents balance losses of enemies and our side. Furthermore, this paper also uses the knowledge transfer method to extend the learning model to more complex scenes and improve the generalization of the proposed confrontation model. Two confrontation decision-making experiments are designed to verify the effectiveness of the proposed method. In a small-scale task scenario, the trained agent can successfully learn to fight with the competitors and achieve a good winning rate. For large-scale confrontation scenarios, the knowledge transfer method can gradually improve the decision-making level of the learning agent.


Symmetry ◽  
2020 ◽  
Vol 12 (7) ◽  
pp. 1115 ◽  
Author(s):  
Peng Jiang ◽  
Yixin Wang ◽  
Chao Liu ◽  
Yi-Chung Hu ◽  
Jingci Xie

The infectious disease COVID-19 has swept across the world in 2020, and it continues to cause massive losses of life and severe economic problems in all countries. Providing emergency supplies such as protective medical equipment and materials required to secure people’s livelihood is thus currently prioritized by governments. Establishing a reliable emergency logistics system is critical in this regard. This paper used the Delphi method to design a formal decision structure to assess emergency logistics system reliability (ELSR) by obtaining a consensus from a panel of experts. Assessing ELSR is a typical multiple-attribute decision making (MADM) problem, and the related MADM methods are usually on the basis of symmetry principles. A hybrid MADM model, called the Decision Making Trial and Evaluation Laboratory (DEMATEL)-based Analytical Network Process (D-ANP), was developed to identify the critical factors influencing ELSR. An analysis of empirical evidence showed that the emergency logistics command and coordination system and the emergency material supply system play important roles in ELSR, while the emergency logistics transportation and distribution system and the emergency information system are not so important. This conclusion is different from previous research about traditional disaster emergency logistics. Moreover, the cause–effect relationships among the key factors indicated that the system of command and coordination for emergency logistics and the supply system for emergency materials should be improved. Accordingly, effective suggestions for emergency logistics services for epidemic prevention are provided in this paper. The main contributions of this paper are (1) establishing a comprehensive and systematic evaluating index of ELSR for epidemic prevention; (2) employing a kind of structured, namely D-ANP, to identify the critical factors with non-commensurable and conflicting (competing) characteristics; and (3) comparing the differences of reliable criteria between the emergency logistics of epidemic prevention and the traditional disaster emergency logistics.


Sign in / Sign up

Export Citation Format

Share Document