Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration

Mapping Intimacies ◽

10.1109/mlsp52302.2021.9596327 ◽

2021 ◽

Author(s):

Hassaan Hashmi ◽

Dionysios S. Kalogerias

Keyword(s):

Space Exploration ◽

Wireless Systems ◽

Action Space ◽

Resource Allocations ◽

Download Full-text

Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning

Sensors ◽

10.3390/s20164468 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4468

Author(s):

Ao Xi ◽

Chao Chen

Keyword(s):

Reinforcement Learning ◽

Center Of Pressure ◽

Stable State ◽

Biped Robot ◽

Action Space ◽

Training Procedure ◽

Joint Angles ◽

Initial Control ◽

Hybrid Reinforcement

In this work, we introduced a novel hybrid reinforcement learning scheme to balance a biped robot (NAO) on an oscillating platform, where the rotation of the platform is considered as the external disturbance to the robot. The platform had two degrees of freedom in rotation, pitch and roll. The state space comprised the position of center of pressure, and joint angles and joint velocities of two legs. The action space consisted of the joint angles of ankles, knees, and hips. By adding the inverse kinematics techniques, the dimension of action space was significantly reduced. Then, a model-based system estimator was employed during the offline training procedure to estimate the dynamics model of the system by using novel hierarchical Gaussian processes, and to provide initial control inputs, after which the reduced action space of each joint was obtained by minimizing the cost of reaching the desired stable state. Finally, a model-free optimizer based on DQN (λ) was introduced to fine tune the initial control inputs, where the optimal control inputs were obtained for each joint at any state. The proposed reinforcement learning not only successfully avoided the distribution mismatch problem, but also improved the sample efficiency. Simulation results showed that the proposed hybrid reinforcement learning mechanism enabled the NAO robot to balance on an oscillating platform with different frequencies and magnitudes. Both control performance and robustness were guaranteed during the experiments.

Download Full-text

Model-Free Learning of Optimal Ergodic Policies in Wireless Systems

IEEE Transactions on Signal Processing ◽

10.1109/tsp.2020.3030073 ◽

2020 ◽

Vol 68 ◽

pp. 6272-6286

Author(s):

Dionysios S. Kalogerias ◽

Mark Eisen ◽

George J. Pappas ◽

Alejandro Ribeiro

Keyword(s):

Wireless Systems ◽

Download Full-text

Learning Optimal Resource Allocations in Wireless Systems

IEEE Transactions on Signal Processing ◽

10.1109/tsp.2019.2908906 ◽

2019 ◽

Vol 67 (10) ◽

pp. 2775-2790 ◽

Author(s):

Mark Eisen ◽

Clark Zhang ◽

Luiz F. O. Chamon ◽

Daniel D. Lee ◽

Alejandro Ribeiro

Keyword(s):

Wireless Systems ◽

Resource Allocations ◽

Optimal Resource

Download Full-text

Delay Models for Static and Adaptive Persistent Resource Allocations in Wireless Systems

IEEE Transactions on Mobile Computing ◽

10.1109/tmc.2015.2492546 ◽

2016 ◽

Vol 15 (9) ◽

pp. 2193-2205

Author(s):

Jason Brown ◽

Nusrat Afrin ◽

Jamil Y. Khan

Keyword(s):

Wireless Systems ◽

Resource Allocations ◽

Download Full-text

Almost-Zero Duality Gaps in Model-Free Resource Allocation for Wireless Systems

2020 28th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco47968.2020.9287660 ◽

2021 ◽

Author(s):

Dionysios S. Kalogerias ◽

Mark Eisen ◽

George J. Pappas ◽

Alejandro Ribeiro

Keyword(s):

Resource Allocation ◽

Wireless Systems ◽

Free Resource ◽

Download Full-text

Autonomous blimp control using model-free reinforcement learning in a continuous state and action space

2007 IEEE/RSJ International Conference on Intelligent Robots and Systems ◽

10.1109/iros.2007.4399531 ◽

2007 ◽

Author(s):

Axel Rottmann ◽

Christian Plagemann ◽

Peter Hilgers ◽

Wolfram Burgard

Keyword(s):

Reinforcement Learning ◽

Action Space ◽

Continuous State

Download Full-text

A SystemC-based Simulator for design space exploration of smart wireless systems

2018 Design, Automation & Test in Europe Conference & Exhibition (DATE) ◽

10.23919/date.2018.8342093 ◽

2018 ◽

Author(s):

Gabriele Miorandi ◽

Francesco Stefanni ◽

Federico Fraccaroli ◽

Davide Quaglia

Keyword(s):

Design Space Exploration ◽

Design Space ◽

Space Exploration ◽

Wireless Systems

Download Full-text

Dual Domain Learning of Optimal Resource Allocations in Wireless Systems

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2019.8683150 ◽

2019 ◽

Author(s):

Mark Eisen ◽

Clark Zhang ◽

Luiz F. O. Chamon ◽

Daniel D. Lee ◽

Alejandro Ribeiro

Keyword(s):

Wireless Systems ◽

Resource Allocations ◽

Domain Learning ◽

Optimal Resource ◽

Download Full-text

Blocking analysis of persistent resource allocations for M2M applications in wireless systems

Transactions on Emerging Telecommunications Technologies ◽

10.1002/ett.3091 ◽

2016 ◽

Vol 27 (11) ◽

pp. 1513-1529

Author(s):

Jason Brown ◽

Nusrat Afrin ◽

Jamil Y Khan

Keyword(s):

Wireless Systems ◽

Resource Allocations ◽

Blocking Analysis

Download Full-text

Learning Statistically Accurate Resource Allocations in Non-Stationary Wireless Systems

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8461444 ◽

2018 ◽

Author(s):

Mark Eisen ◽

Konstantinos Gatsis ◽

George J. Pappas ◽

Alejandro Ribeiro

Keyword(s):

Wireless Systems ◽

Resource Allocations

Download Full-text