A Novel Implementation of Q-Learning for the Whittle Index

Mapping Intimacies ◽

10.1007/978-3-030-92511-6_10 ◽

2021 ◽

pp. 154-170

Author(s):

Lachlan J. Gibson ◽

Peter Jacko ◽

Yoni Nazarathy

Keyword(s):

Download Full-text

Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/556 ◽

2021 ◽

Author(s):

Arpita Biswas ◽

Gaurav Aggarwal ◽

Pradeep Varakantham ◽

Milind Tambe

Keyword(s):

Adaptive Learning ◽

Transition Probabilities ◽

A Priori ◽

Optimal Solution ◽

Health Programs ◽

Preventive Healthcare ◽

Health Checks ◽

Whittle Index ◽

Care Information

In many public health settings, it is important for patients to adhere to health programs, such as taking medications and periodic health checks. Unfortunately, beneficiaries may gradually disengage from such programs, which is detrimental to their health. A concrete example of gradual disengagement has been observed by an organization that carries out a free automated call-based program for spreading preventive care information among pregnant women. Many women stop picking up calls after being enrolled for a few months. To avoid such disengagements, it is important to provide timely interventions. Such interventions are often expensive and can be provided to only a small fraction of the beneficiaries. We model this scenario as a restless multi-armed bandit (RMAB) problem, where each beneficiary is assumed to transition from one state to another depending on the intervention. Moreover, since the transition probabilities are unknown a priori, we propose a Whittle index based Q-Learning mechanism and show that it converges to the optimal solution. Our method improves over existing learning-based methods for RMABs on multiple benchmarks from literature and also on the maternal healthcare dataset.

Download Full-text

Towards Q-learning the Whittle Index for Restless Bandits

2019 Australian & New Zealand Control Conference (ANZCC) ◽

10.1109/anzcc47194.2019.8945748 ◽

2019 ◽

Author(s):

Jing Fu ◽

Yoni Nazarathy ◽

Sarat Moka ◽

Peter G. Taylor

Keyword(s):

Restless Bandits ◽

Download Full-text

Q-learning based Service Function Chaining using VNF Resource-aware Reward Model

2020 21st Asia-Pacific Network Operations and Management Symposium (APNOMS) ◽

10.23919/apnoms50412.2020.9236975 ◽

2020 ◽

Author(s):

Doyoung Lee ◽

Jae-Hyoung Yoo ◽

James Won-Ki Hong

Keyword(s):

Service Function ◽

Download Full-text

Q-learning Based Radio Channels Utility Evaluation Algorithm for the Local Dynamic Spectrum Management in Mobile Ad-hoc Networks

2020 Baltic URSI Symposium (URSI) ◽

10.23919/ursi48707.2020.9254037 ◽

2020 ◽

Author(s):

Krzysztof Malon ◽

Jerzy Lopatka ◽

Pawel Skokowski

Keyword(s):

Ad Hoc ◽

Spectrum Management ◽

Dynamic Spectrum ◽

Local Dynamic ◽

Dynamic Spectrum Management ◽

Evaluation Algorithm ◽

Utility Evaluation ◽

Mobile Ad Hoc ◽

Download Full-text

Prioritized epoch-incremental Q-learning algorithm

Theoretical and Applied Informatics ◽

10.2478/v10179-012-0008-1 ◽

2012 ◽

Vol 24 (2) ◽

Author(s):

Roman Zajdel

Keyword(s):

Learning Algorithm ◽

Download Full-text

Applying Q-Learning in intrusion policy prediction based on preferences on advanced persistent threats

Advances in Management Engineering and Information Technology ◽

10.2495/ameit140841 ◽

2015 ◽

Author(s):

S.H. Chien ◽

C.S. Ho ◽

C.H. Chen

Keyword(s):

Advanced Persistent Threats

Download Full-text

Q-learning System Based on Cooperative Least Squares Support Vector Machine

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2009.00214 ◽

2009 ◽

Vol 35 (2) ◽

pp. 214-219 ◽

Author(s):

Xue-Song WANG ◽

Xi-Lan TIAN ◽

Yu-Hu CHENG ◽

Jian-Qiang YI

Keyword(s):

Support Vector Machine ◽

Least Squares ◽

Learning System ◽

Support Vector ◽

Download Full-text

Application of improved Q learning algorithm to job shop problem

Journal of Computer Applications ◽

10.3724/sp.j.1087.2008.03268 ◽

2009 ◽

Vol 28 (12) ◽

pp. 3268-3270

Author(s):

Chao WANG ◽

Jing GUO ◽

Zhen-qiang BAO

Keyword(s):

Job Shop ◽

Learning Algorithm ◽

Download Full-text

New Q-learning based heterogeneous network selection algorithm

Journal of Computer Applications ◽

10.3724/sp.j.1087.2011.01461 ◽

2012 ◽

Vol 31 (6) ◽

pp. 1461-1464

Author(s):

Yan-qing ZHAO ◽

Qi ZHU

Keyword(s):

Heterogeneous Network ◽

Network Selection ◽

Selection Algorithm ◽

Download Full-text

Q-Learning Based Sensing Task Management Algorithm for Cognitive Radio Systems

JOURNAL OF ELECTRONICS INFORMATION TECHNOLOGY ◽

10.3724/sp.j.1146.2009.00296 ◽

2010 ◽

Vol 32 (3) ◽

pp. 623-628 ◽

Author(s):

Mo Li ◽

You-yun Xu ◽

Yue-ming Cai

Keyword(s):

Cognitive Radio ◽

Task Management ◽

Management Algorithm ◽

Cognitive Radio Systems ◽

Download Full-text