Regularized Cost-Model Oblivious Database Tuning with Reinforcement Learning

Cost-Model Oblivious Database Tuning with Reinforcement Learning

Lecture Notes in Computer Science - Database and Expert Systems Applications ◽

10.1007/978-3-319-22849-5_18 ◽

2015 ◽

pp. 253-268 ◽

Cited By ~ 4

Author(s):

Debabrota Basu ◽

Qian Lin ◽

Weidong Chen ◽

Hoang Tam Vo ◽

Zihong Yuan ◽

...

Keyword(s):

Reinforcement Learning ◽

Cost Model ◽

Database Tuning

Download Full-text

An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning

Proceedings of the 2019 International Conference on Management of Data - SIGMOD '19 ◽

10.1145/3299869.3300085 ◽

2019 ◽

Cited By ~ 20

Author(s):

Ji Zhang ◽

Yu Liu ◽

Ke Zhou ◽

Guoliang Li ◽

Zhili Xiao ◽

...

Keyword(s):

Reinforcement Learning ◽

Cloud Database ◽

End To End ◽

Database Tuning

Download Full-text

Large-scale cost function learning for path planning using deep inverse reinforcement learning

The International Journal of Robotics Research ◽

10.1177/0278364917722396 ◽

2017 ◽

Vol 36 (10) ◽

pp. 1073-1087 ◽

Cited By ~ 32

Author(s):

Markus Wulfmeier ◽

Dushyant Rao ◽

Dominic Zeng Wang ◽

Peter Ondruska ◽

Ingmar Posner

Keyword(s):

Reinforcement Learning ◽

Large Scale ◽

Cost Model ◽

High Capacity ◽

Urban Environments ◽

Training Dataset ◽

Inverse Reinforcement Learning ◽

Dataset Size ◽

And Performance ◽

The Cost

We present an approach for learning spatial traversability maps for driving in complex, urban environments based on an extensive dataset demonstrating the driving behaviour of human experts. The direct end-to-end mapping from raw input data to cost bypasses the effort of manually designing parts of the pipeline, exploits a large number of data samples, and can be framed additionally to refine handcrafted cost maps produced based on manual hand-engineered features. To achieve this, we introduce a maximum-entropy-based, non-linear inverse reinforcement learning (IRL) framework which exploits the capacity of fully convolutional neural networks (FCNs) to represent the cost model underlying driving behaviours. The application of a high-capacity, deep, parametric approach successfully scales to more complex environments and driving behaviours, while at deployment being run-time independent of training dataset size. After benchmarking against state-of-the-art IRL approaches, we focus on demonstrating scalability and performance on an ambitious dataset collected over the course of 1 year including more than 25,000 demonstration trajectories extracted from over 120 km of urban driving. We evaluate the resulting cost representations by showing the advantages over a carefully, manually designed cost map and furthermore demonstrate its robustness towards systematic errors by learning accurate representations even in the presence of calibration perturbations. Importantly, we demonstrate that a manually designed cost map can be refined to more accurately handle corner cases that are scarcely seen in the environment, such as stairs, slopes and underpasses, by further incorporating human priors into the training framework.

Download Full-text

Operation of Distributed Battery Considering Demand Response Using Deep Reinforcement Learning in Grid Edge Control

Energies ◽

10.3390/en14227749 ◽

2021 ◽

Vol 14 (22) ◽

pp. 7749

Author(s):

Wenying Li ◽

Ming Tang ◽

Xinzhen Zhang ◽

Danhui Gao ◽

Jian Wang

Keyword(s):

Reinforcement Learning ◽

Demand Response ◽

Life Cycle Cost ◽

Low Voltage ◽

Cost Model ◽

Voltage Transformer ◽

Markov Decision ◽

Battery Energy Storage Systems ◽

Phase Balance ◽

Battery Energy

Battery energy storage systems (BESSs) are able to facilitate economical operation of the grid through demand response (DR), and are regarded as the most significant DR resource. Among them, distributed BESS integrating home photovoltaics (PV) have developed rapidly, and account for nearly 40% of newly installed capacity. However, the use scenarios and use efficiency of distributed BESS are far from sufficient to be able to utilize the potential loads and overcome uncertainties caused by disorderly operation. In this paper, the low-voltage transformer-powered area (LVTPA) is firstly defined, and then a DR grid edge controller was implemented based on deep reinforcement learning to maximize the total DR benefits and promote three-phase balance in the LVTPA. The proposed DR problem is formulated as a Markov decision process (MDP). In addition, the deep deterministic policy gradient (DDPG) algorithm is applied to train the controller in order to learn the optimal DR strategy. Additionally, a life cycle cost model of the BESS is established and implemented in the DR scheme to measure the income. The numerical results, compared to deep Q learning and model-based methods, demonstrate the effectiveness and validity of the proposed method.

Download Full-text

XTuning: Expert Database Tuning System Based on Reinforcement Learning

10.1007/978-3-030-90888-1_8 ◽

2021 ◽

pp. 101-110

Author(s):

Yanfeng Chai ◽

Jiake Ge ◽

Yunpeng Chai ◽

Xin Wang ◽

BoXuan Zhao

Keyword(s):

Reinforcement Learning ◽

Expert Database ◽

Database Tuning

Download Full-text

A Spreadsheet Life Cycle Cost Model for System Modernization Studies

The Journal of Cost Analysis ◽

10.1080/08823871.1994.10462284 ◽

1994 ◽

Vol 11 (1) ◽

pp. 47-56

Author(s):

Virginia C. Day ◽

Zachary F. Lansdowne ◽

Richard A Moynihan ◽

John A. Vitkevich

Keyword(s):

Life Cycle ◽

Life Cycle Cost ◽

Cost Model

Download Full-text

Supplemental Material for Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

Psychological Review ◽

10.1037/0033-295x.114.3.784.supp ◽

2007 ◽

Cited By ~ 1

Keyword(s):

Reinforcement Learning ◽

Problem Gambling ◽

Learning Models ◽

Behavioral Extinction ◽

Reinforcement Learning Models

Download Full-text

Bayes factors for reinforcement-learning models of the Iowa gambling task.

Decision ◽

10.1037/dec0000040 ◽

2016 ◽

Vol 3 (2) ◽

pp. 115-131 ◽

Cited By ~ 14

Author(s):

Helen Steingroever ◽

Ruud Wetzels ◽

Eric-Jan Wagenmakers

Keyword(s):

Reinforcement Learning ◽

Iowa Gambling Task ◽

Bayes Factors ◽

Gambling Task ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text

Analogical Reinforcement Learning With Two-Stage Memory Retrieval

PsycEXTRA Dataset ◽

10.1037/e528942014-705 ◽

2014 ◽

Author(s):

James Foster ◽

Matt Jones

Keyword(s):

Reinforcement Learning ◽

Memory Retrieval ◽

Two Stage

Download Full-text

Effects of Working Memory Capacity on the Speed and Accuracy of Learning in Reinforcement Learning Models

PsycEXTRA Dataset ◽

10.1037/e528942014-552 ◽

2014 ◽

Author(s):

Adnane Ez-Zizi ◽

Simon Farrell ◽

David Leslie

Keyword(s):

Working Memory ◽

Reinforcement Learning ◽

Working Memory Capacity ◽

Memory Capacity ◽

Learning Models ◽

Reinforcement Learning Models ◽

Speed And Accuracy

Download Full-text