scholarly journals Distributed Multi-agent Meta Learning for Trajectory Design in Wireless Drone Networks

Author(s):  
Ye Hu ◽  
Mingzhe Chen ◽  
Walid Saad ◽  
H. Vincent Poor ◽  
Shuguang Cui
Author(s):  
M Vasile ◽  
F Zuiani

This article presents an algorithm for multi-objective optimization that blends together a number of heuristics. A population of agents combines heuristics that aim at exploring the search space both globally and in a neighbourhood of each agent. These heuristics are complemented with a combination of a local and global archive. The novel agent-based algorithm is tested at first on a set of standard problems and then on three specific problems in space trajectory design. Its performance is compared against a number of state-of-the-art multi-objective optimization algorithms that use the Pareto dominance as selection criterion: non-dominated sorting genetic algorithm (NSGA-II), Pareto archived evolution strategy (PAES), multiple objective particle swarm optimization (MOPSO), and multiple trajectory search (MTS). The results demonstrate that the agent-based search can identify parts of the Pareto set that the other algorithms were not able to capture. Furthermore, convergence is statistically better although the variance of the results is in some cases higher.


Sensors ◽  
2018 ◽  
Vol 18 (12) ◽  
pp. 4202 ◽  
Author(s):  
Dan Popescu ◽  
Cristian Dragana ◽  
Florin Stoican ◽  
Loretta Ichim ◽  
Grigore Stamatescu

Large-scale monitoring systems have seen rapid development in recent years. Wireless sensor networks (WSN), composed of thousands of sensing, computing and communication nodes, form the backbone of such systems. Integration with unmanned aerial vehicles (UAVs) leads to increased monitoring area and to better overall performance. This paper presents a hybrid UAV-WSN network which is self-configured to improve the acquisition of environmental data across large areas. A prime objective and novelty of the heterogeneous multi-agent scheme proposed here is the optimal generation of reference trajectories, parameterized after inter- and intra-line distances. The main contribution is the trajectory design, optimized to avoid interdicted regions, to pass near predefined way-points, with guaranteed communication time, and to minimize total path length. Mixed-integer description is employed into the associated constrained optimization problem. The second novelty is the sensor localization and clustering method for optimal ground coverage taking into account the communication information between UAV and a subset of ground sensors (i.e., the cluster heads). Results show improvements in both network and data collection efficiency metrics by implementing the proposed algorithms. These are initially evaluated by means of simulation and then validated on a realistic WSN-UAV test-bed, thus bringing significant practical value.


Author(s):  
Wenqian Liang ◽  
Ji Wang ◽  
Weidong Bao ◽  
Xiaomin Zhu ◽  
Qingyong Wang ◽  
...  

AbstractMulti-agent reinforcement learning (MARL) methods have shown superior performance to solve a variety of real-world problems focusing on learning distinct policies for individual tasks. These approaches face problems when applied to the non-stationary real-world: agents trained in specialized tasks cannot achieve satisfied generalization performance across multiple tasks; agents have to learn and store specialized policies for individual task and reliable identities of tasks are hardly observable in practice. To address the challenge continuously adapting to multiple tasks in MARL, we formalize the problem into a two-stage curriculum. Single-task policies are learned with MARL approaches, after that we develop a gradient-based Self-Adaptive Meta-Learning algorithm, SAML, that cannot only distill single-task policies into a unified policy but also can facilitate the unified policy to continuously adapt to new incoming tasks. In addition, to validate the continuous adaptation performance on complex task, we extend the widely adopted StarCraft benchmark SMAC and develop a new multi-task multi-agent StarCraft environment, Meta-SMAC, for testing various aspects of continuous adaptation method. Our experiments with a population of agents show that our method enables significantly more efficient adaptation than reactive baselines across different scenarios.


Sign in / Sign up

Export Citation Format

Share Document