Training Multiagent Systems by Q-Learning: Approaches and Empirical Results

Jose Manuel Lopez-Guede; Borja Fernandez-Gauna; Manuel Graña; Ekaitz Zulueta

doi:10.1111/coin.12035

Training Multiagent Systems by Q-Learning: Approaches and Empirical Results

Computational Intelligence ◽

10.1111/coin.12035 ◽

2014 ◽

Vol 31 (3) ◽

pp. 498-512 ◽

Cited By ~ 3

Author(s):

Jose Manuel Lopez-Guede ◽

Borja Fernandez-Gauna ◽

Manuel Graña ◽

Ekaitz Zulueta

Keyword(s):

Multiagent Systems ◽

Learning Approaches ◽

Q Learning ◽

Empirical Results

Download Full-text

General Second-Order Consensus of Discrete-Time Multiagent Systems via Q-Learning Method

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2020.3019519 ◽

2020 ◽

pp. 1-9

Author(s):

Yifan Liu ◽

Housheng Su

Keyword(s):

Multiagent Systems ◽

Discrete Time ◽

Second Order ◽

Learning Method ◽

Q Learning

Download Full-text

Output-Feedback Global Consensus of Discrete-Time Multiagent Systems Subject to Input Saturation via Q-Learning Method

IEEE Transactions on Cybernetics ◽

10.1109/tcyb.2020.2987385 ◽

2020 ◽

pp. 1-10

Author(s):

Mingkang Long ◽

Housheng Su ◽

Zhigang Zeng

Keyword(s):

Multiagent Systems ◽

Discrete Time ◽

Output Feedback ◽

Input Saturation ◽

Learning Method ◽

Q Learning ◽

Global Consensus

Download Full-text

Model-Free Algorithms for Containment Control of Saturated Discrete-Time Multiagent Systems via Q-Learning Method

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2020.3019504 ◽

2020 ◽

pp. 1-9

Author(s):

Mingkang Long ◽

Housheng Su ◽

Zhigang Zeng

Keyword(s):

Multiagent Systems ◽

Discrete Time ◽

Learning Method ◽

Containment Control ◽

Q Learning ◽

Model Free

Download Full-text

Reinforcement Learning vs Genetic Algorithms in Game-Theoretic Cyber-Security

10.31237/osf.io/nxzep ◽

2018 ◽

Cited By ~ 1

Author(s):

Stefan Niculae

Keyword(s):

Reinforcement Learning ◽

Cyber Security ◽

Large Scale ◽

Human Performance ◽

Learning Approaches ◽

Classifier Systems ◽

Penetration Testing ◽

Q Learning ◽

Game Theoretic ◽

Security Game

Penetration testing is the practice of performing a simulated attack on a computer system in order to reveal its vulnerabilities. The most common approach is to gain information and then plan and execute the attack manually, by a security expert. This manual method cannot meet the speed and frequency required for efficient, large-scale secu- rity solutions development. To address this, we formalize penetration testing as a security game between an attacker who tries to compro- mise a network and a defending adversary actively protecting it. We compare multiple algorithms for finding the attacker’s strategy, from fixed-strategy to Reinforcement Learning, namely Q-Learning (QL), Extended Classifier Systems (XCS) and Deep Q-Networks (DQN). The attacker’s strength is measured in terms of speed and stealthi- ness, in the specific environment used in our simulations. The results show that QL surpasses human performance, XCS yields worse than human performance but is more stable, and the slow convergence of DQN keeps it from achieving exceptional performance, in addition, we find that all of these Machine Learning approaches outperform fixed-strategy attackers.

Download Full-text

The equilibrium of venture capital incentive contract: Optimization and Q-learning approaches

Scientia Iranica ◽

10.24200/sci.2020.55059.4050 ◽

2020 ◽

Vol 0 (0) ◽

pp. 0-0

Author(s):

seyed Hossein Jafarpour Rezaei ◽

Mohammad Ali Rastegar

Keyword(s):

Venture Capital ◽

Learning Approaches ◽

Incentive Contract ◽

Q Learning

Download Full-text

Selective auditory attention detection using dynamic learning systems: The study of RNN and reinforcement learning

10.1101/2021.02.18.431748 ◽

2021 ◽

Author(s):

Masoud Geravanchizadeh ◽

Hossein Roushan

Keyword(s):

Reinforcement Learning ◽

Detection System ◽

Auditory Attention ◽

Final Decision ◽

Learning Approaches ◽

Cocktail Party ◽

Dynamic Learning ◽

Learning Stage ◽

Q Learning ◽

Markov Decision

AbstractThe cocktail party phenomenon describes the ability of the human brain to focus auditory attention on a particular stimulus while ignoring other acoustic events. Selective auditory attention detection (SAAD) is an important issue in the development of brain-computer interface systems and cocktail party processors. This paper proposes a new dynamic attention detection system to process the temporal evolution of the input signal. In the proposed dynamic system, after preprocessing of the input signals, the probabilistic state space of the system is formed. Then, in the learning stage, different dynamic learning methods, including recurrent neural network (RNN) and reinforcement learning (Markov decision process (MDP) and deep Q-learning) are applied to make the final decision as to the attended speech. Among different dynamic learning approaches, the evaluation results show that the deep Q-learning approach (MDP+RNN) provides the highest classification accuracy (94.2%) with the least detection delay. The proposed SAAD system is advantageous, in the sense that the detection of attention is performed dynamically for the sequential inputs. Also, the system has the potential to be used in scenarios, where the attention of the listener might be switched in time in the presence of various acoustic events.

Download Full-text

Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning

Journal of the Franklin Institute ◽

10.1016/j.jfranklin.2019.06.007 ◽

2019 ◽

Vol 356 (13) ◽

pp. 6946-6967 ◽

Cited By ~ 7

Author(s):

Chaoxu Mu ◽

Qian Zhao ◽

Zhongke Gao ◽

Changyin Sun

Keyword(s):

Reinforcement Learning ◽

Multiagent Systems ◽

Discrete Time ◽

Consensus Control ◽

Q Learning

Download Full-text

Strategic air traffic flow management under uncertainties using scalable sampling-based dynamic programming and Q-learning approaches

2017 11th Asian Control Conference (ASCC) ◽

10.1109/ascc.2017.8287327 ◽

2017 ◽

Cited By ~ 5

Author(s):

Junfei Xie ◽

Yan Wan ◽

F. L. Lewis

Keyword(s):

Dynamic Programming ◽

Traffic Flow ◽

Air Traffic ◽

Learning Approaches ◽

Flow Management ◽

Air Traffic Flow Management ◽

Q Learning ◽

Air Traffic Flow ◽

Traffic Flow Management

Download Full-text

Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2021.3055761 ◽

2021 ◽

pp. 1-13

Author(s):

Zhinan Peng ◽

Rui Luo ◽

Jiangping Hu ◽

Kaibo Shi ◽

Sing Kiong Nguang ◽

...

Keyword(s):

Multiagent Systems ◽

Tracking Control ◽

Q Learning ◽

Optimal Tracking ◽

Optimal Tracking Control

Download Full-text

Use of Q-learning approaches for practical medium access control in wireless sensor networks

Engineering Applications of Artificial Intelligence ◽

10.1016/j.engappai.2016.06.012 ◽

2016 ◽

Vol 55 ◽

pp. 146-154 ◽

Cited By ~ 14

Author(s):

Selahattin Kosunalp ◽

Yi Chu ◽

Paul D. Mitchell ◽

David Grace ◽

Tim Clarke

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Access Control ◽

Medium Access Control ◽

Wireless Sensor ◽

Learning Approaches ◽

Medium Access ◽

Q Learning

Download Full-text