Generalized reinforcement learning for building control using Behavioral Cloning

Applied Energy ◽

10.1016/j.apenergy.2021.117602 ◽

2021 ◽

Vol 304 ◽

pp. 117602

Author(s):

Zachary E. Lee ◽

K. Max Zhang

Keyword(s):

Reinforcement Learning ◽

Building Control ◽

Behavioral Cloning

Download Full-text

Grid-Interactive Multi-Zone Building Control Using Reinforcement Learning with Global-Local Policy Search

2021 American Control Conference (ACC) ◽

10.23919/acc50511.2021.9482917 ◽

2021 ◽

Author(s):

Xiangyu Zhang ◽

Rohit Chintala ◽

Andrey Bernstein ◽

Peter Graf ◽

Xin Jin

Keyword(s):

Reinforcement Learning ◽

Local Policy ◽

Building Control ◽

Download Full-text

Ground Delay Program Analytics with Behavioral Cloning and Inverse Reinforcement Learning

14th AIAA Aviation Technology, Integration, and Operations Conference ◽

10.2514/6.2014-2026 ◽

2014 ◽

Author(s):

Michael J. Bloem ◽

Nicholas Bambos

Keyword(s):

Reinforcement Learning ◽

Inverse Reinforcement Learning ◽

Ground Delay Program ◽

Ground Delay ◽

Behavioral Cloning

Download Full-text

SWIRL: A sequential windowed inverse reinforcement learning algorithm for robot tasks with delayed rewards

The International Journal of Robotics Research ◽

10.1177/0278364918784350 ◽

2018 ◽

Vol 38 (2-3) ◽

pp. 126-145 ◽

Author(s):

Sanjay Krishnan ◽

Animesh Garg ◽

Richard Liaw ◽

Brijen Thananjeyan ◽

Lauren Miller ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Search Algorithm ◽

Inverse Reinforcement Learning ◽

Parallel Parking ◽

Physical Experiments ◽

Long Time ◽

Reward Functions ◽

Behavioral Cloning

We present sequential windowed inverse reinforcement learning (SWIRL), a policy search algorithm that is a hybrid of exploration and demonstration paradigms for robot learning. We apply unsupervised learning to a small number of initial expert demonstrations to structure future autonomous exploration. SWIRL approximates a long time horizon task as a sequence of local reward functions and subtask transition conditions. Over this approximation, SWIRL applies Q-learning to compute a policy that maximizes rewards. Experiments suggest that SWIRL requires significantly fewer rollouts than pure reinforcement learning and fewer expert demonstrations than behavioral cloning to learn a policy. We evaluate SWIRL in two simulated control tasks, parallel parking and a two-link pendulum. On the parallel parking task, SWIRL achieves the maximum reward on the task with 85% fewer rollouts than Q-learning, and one-eight of demonstrations needed by behavioral cloning. We also consider physical experiments on surgical tensioning and cutting deformable sheets using a da Vinci surgical robot. On the deformable tensioning task, SWIRL achieves a 36% relative improvement in reward compared with a baseline of behavioral cloning with segmentation.

Download Full-text

Advanced Building Control via Deep Reinforcement Learning

Energy Procedia ◽

10.1016/j.egypro.2019.01.494 ◽

2019 ◽

Vol 158 ◽

pp. 6158-6163 ◽

Author(s):

Ruoxi Jia ◽

Ming Jin ◽

Kaiyu Sun ◽

Tianzhen Hong ◽

Costas Spanos

Keyword(s):

Reinforcement Learning ◽

Building Control

Download Full-text

The reinforcement learning method for occupant behavior in building control: A review

Energy and Built Environment ◽

10.1016/j.enbenv.2020.08.005 ◽

2020 ◽

Author(s):

Mengjie Han ◽

Jing Zhao ◽

Xingxing Zhang ◽

Jingchun Shen ◽

Yu Li

Keyword(s):

Reinforcement Learning ◽

Occupant Behavior ◽

Learning Method ◽

Building Control

Download Full-text

Flexible Reinforcement Learning Framework for Building Control using EnergyPlus-Modelica Energy Models

Proceedings of the 1st International Workshop on Reinforcement Learning for Energy Management in Buildings & Cities ◽

10.1145/3427773.3427873 ◽

2020 ◽

Author(s):

Joon-Yong Lee ◽

Sen Huang ◽

Aowabin Rahman ◽

Amanda D. Smith ◽

Srinivas Katipamula

Keyword(s):

Reinforcement Learning ◽

Building Control ◽

Learning Framework ◽

Download Full-text

Joint Behavioral Cloning and Reinforcement Learning Method for Propofol and Remifentanil Infusion in Anesthesia

2021 International Conference on Information Networking (ICOIN) ◽

10.1109/icoin50884.2021.9333933 ◽

2021 ◽

Author(s):

MyungJae Shin ◽

Joongheon Kim

Keyword(s):

Reinforcement Learning ◽

Learning Method ◽

Remifentanil Infusion ◽

Behavioral Cloning

Download Full-text

Autonomous Building Control Using Offline Reinforcement Learning

10.1007/978-3-030-89899-1_25 ◽

2021 ◽

pp. 246-255

Author(s):

Jorren Schepers ◽

Reinout Eyckerman ◽

Furkan Elmaz ◽

Wim Casteels ◽

Steven Latré ◽

...

Keyword(s):

Reinforcement Learning ◽

Building Control

Download Full-text

Two-Stage Reinforcement Learning Policy Search for Grid-Interactive Building Control

IEEE Transactions on Smart Grid ◽

10.1109/tsg.2022.3141625 ◽

2022 ◽

pp. 1-1

Author(s):

Xiangyu Zhang ◽

Yue Chen ◽

Andrey Bernstein ◽

Rohit Chintala ◽

Peter Graf ◽

...

Keyword(s):

Reinforcement Learning ◽

Two Stage ◽

Building Control ◽

Download Full-text

Ground Delay Program Analytics with Behavioral Cloning and Inverse Reinforcement Learning

Journal of Aerospace Information Systems ◽

10.2514/1.i010304 ◽

2015 ◽

Vol 12 (3) ◽

pp. 299-313 ◽

Author(s):

Michael Bloem ◽

Nicholas Bambos

Keyword(s):

Reinforcement Learning ◽

Inverse Reinforcement Learning ◽

Ground Delay Program ◽

Ground Delay ◽

Behavioral Cloning

Download Full-text