scholarly journals A Markovian Mechanism of Proportional Resource Allocation in the Incentive Model as a Dynamic Stochastic Inverse Stackelberg Game

Mathematics ◽  
2018 ◽  
Vol 6 (8) ◽  
pp. 131 ◽  
Author(s):  
Grigory Belyavsky ◽  
Natalya Danilova ◽  
Guennady Ougolnitsky

This paper considers resource allocation among producers (agents) in the case where the Principal knows nothing about their cost functions while the agents have Markovian awareness about his/her strategies. We use a dynamic setup of the stochastic inverse Stackelberg game as the model. We suggest an algorithm for solving this game based on Q-learning. The associated Bellman equations contain functions of one variable for the Principal and also for the agents. The new results are illustrated by numerical examples.

2020 ◽  
Vol 10 (5) ◽  
pp. 1557
Author(s):  
Weijia Feng ◽  
Xiaohui Li

Ultra-dense and highly heterogeneous network (HetNet) deployments make the allocation of limited wireless resources among ubiquitous Internet of Things (IoT) devices an unprecedented challenge in 5G and beyond (B5G) networks. The interactions among mobile users and HetNets remain to be analyzed, where mobile users choose optimal networks to access and the HetNets adopt proper methods for allocating their own network resource. Existing works always need complete information among mobile users and HetNets. However, it is not practical in a realistic situation where important individual information is protected and will not be public to others. This paper proposes a distributed pricing and resource allocation scheme based on a Stackelberg game with incomplete information. The proposed model proves to be more practical by solving the problem that important information of either mobile users or HetNets is difficult to acquire during the resource allocation process. Considering the unknowability of channel gain information, the follower game among users is modeled as an incomplete information game, and channel gain is regarded as the type of each player. Given the pricing strategies of networks, users will adjust their bandwidth requesting strategies to maximize their expected utility. While based on the sub-equilibrium obtained in the follower game, networks will correspondingly update their pricing strategies to be optimal. The existence and uniqueness of Bayesian Nash equilibrium is proved. A probabilistic prediction method realizes the feasibility of the incomplete information game, and a reverse deduction method is utilized to obtain the game equilibrium. Simulation results show the superior performance of the proposed method.


IEEE Access ◽  
2021 ◽  
pp. 1-1
Author(s):  
Qi Zhai ◽  
Miodrag Bolic ◽  
Yong Li ◽  
Wei Cheng ◽  
Chenxi Liu

Author(s):  
Huashuai Zhang ◽  
Tingmei Wang ◽  
Haiwei Shen

The resource optimization of ultra-dense networks (UDNs) is critical to meet the huge demand of users for wireless data traffic. But the mainstream optimization algorithms have many problems, such as the poor optimization effect, and high computing load. This paper puts forward a wireless resource allocation algorithm based on deep reinforcement learning (DRL), which aims to maximize the total throughput of the entire network and transform the resource allocation problem into a deep Q-learning process. To effectively allocate resources in UDNs, the DRL algorithm was introduced to improve the allocation efficiency of wireless resources; the authors adopted the resource allocation strategy of the deep Q-network (DQN), and employed empirical repetition and target network to overcome the instability and divergence of the results caused by the previous network state, and to solve the overestimation of the Q value. Simulation results show that the proposed algorithm can maximize the total throughput of the network, while making the network more energy-efficient and stable. Thus, it is very meaningful to introduce the DRL to the research of UDN resource allocation.


2020 ◽  
Vol 13 ◽  
pp. 8-23
Author(s):  
Movlatkhan T. Agieva ◽  
◽  
Olga I. Gorbaneva ◽  

We consider a dynamic Stackelberg game theoretic model of the coordination of social and private interests (SPICE-model) of resource allocation in marketing networks. The dynamics of controlled system describes an interaction of the members of a target audience (basic agents) that leads to a change of their opinions (cost of buying the goods and services of firms competing on a market). An interaction of the firms (influence agents) is formalized as their differential game in strategic form. The payoff functional of each firm includes two terms: the summary opinion of the basic agents with consideration of their marketing costs (a common interest of all firms), and the income from investments in a private activity. The latter income is described by a linear function. The firms exert their influence not to all basic agents but only to the members of strong subgroups of the influence digraph (opinion leaders). The opinion leaders determine the stable final opinions of all members of the target audience. A coordinating principal determines the firms' marketing budgets and maximizes the summary opinion of the basic agents with consideration of the allocated resources. The Nash equilibrium in the game of influence agents and the Stackelberg equilibrium in a general hierarchical game of the principal with them are found. It is proved that the value of opinion of a basic agent is the same for all influence agents and the principal. It is also proved that the influence agents assign less resources for the marketing efforts than the principal would like.


Sign in / Sign up

Export Citation Format

Share Document