Performance Evaluation of Tile Coding in Reinforcement Learning

Actor-Critic–Like Stochastic Adaptive Search for Continuous Simulation Optimization

Operations Research ◽

10.1287/opre.2021.2214 ◽

2021 ◽

Author(s):

Qi Zhang ◽

Jiaqiao Hu

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning ◽

Simulation Optimization ◽

Search Algorithms ◽

Benchmark Problems ◽

Adaptive Search ◽

Time Analysis ◽

Simulation Experiments ◽

Simulation Data ◽

Continuous Simulation

Many systems arising in applications from engineering design, manufacturing, and healthcare require the use of simulation optimization (SO) techniques to improve their performance. In “Actor-Critic–Like Stochastic Adaptive Search for Continuous Simulation Optimization,” Q. Zhang and J. Hu propose a randomized approach that integrates ideas from actor-critic reinforcement learning within a class of adaptive search algorithms for solving SO problems. The approach fully retains the previous simulation data and incorporates them into an approximation architecture to exploit knowledge of the objective function in searching for improved solutions. The authors provide a finite-time analysis for the method when only a single simulation observation is collected at each iteration. The method works well on a diverse set of benchmark problems and has the potential to yield good performance for complex problems using expensive simulation experiments for performance evaluation.

Download Full-text

Upper Bounds on the Performance of Discretisation in Reinforcement Learning

South African Computer Journal ◽

10.18489/sacj.v0i57.284 ◽

2015 ◽

Author(s):

Michael Robin Mitchley

Keyword(s):

Reinforcement Learning ◽

Value Function ◽

Value Function Approximation ◽

Learning Framework ◽

A Value ◽

Continuous State Space ◽

Policy Representation ◽

Continuous State ◽

Tile Coding ◽

Policy Mapping

Reinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value function. It is common in reinforcement learning to discretise a continuous state space using tile coding or binary features. We prove an upper bound on the performance of discretisation for direct policy representation or value function approximation.

Download Full-text

Research on Performance Evaluation of Wargame System Based on Deep Reinforcement Learning

Journal of Physics Conference Series ◽

10.1088/1742-6596/1302/3/032028 ◽

2019 ◽

Vol 1302 ◽

pp. 032028

Author(s):

Tongfei Shang ◽

Jinyun Wu ◽

Jianfeng Ma

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning

Download Full-text

Performance Evaluation of Reinforcement Learning Algorithm for Control of Smart TMD

Journal of the Korean Association for Spatial Structures ◽

10.9712/kass.2021.21.2.41 ◽

2021 ◽

Vol 21 (2) ◽

pp. 41-48

Author(s):

Joo-Won Kang ◽

◽

Hyun-Su Kim

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning ◽

Learning Algorithm ◽

Reinforcement Learning Algorithm

Download Full-text

Modeling Performance Evaluation of Reinforcement Learning Based Routing Algorithm for Scalable Non-cooperative Ad-hoc Environment

Communications in Computer and Information Science - Advances in Computing, Communication and Control ◽

10.1007/978-3-642-18440-6_34 ◽

2011 ◽

pp. 269-274

Author(s):

Shrirang Ambaji Kulkarni ◽

G. Raghavendra Rao

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning ◽

Ad Hoc ◽

Routing Algorithm

Download Full-text

Vehicle-Mounted Self-Organizing Network Routing Algorithm Based on Deep Reinforcement Learning

Wireless Communications and Mobile Computing ◽

10.1155/2021/9934585 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Shitong Ye ◽

Lijuan Xu ◽

Xiaomin Li

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning ◽

Ad Hoc ◽

Routing Algorithm ◽

Network Routing ◽

Technical Problems ◽

Evaluation Indexes ◽

Vehicle Communication ◽

Vehicle Network ◽

Self Organizing

Through the research on the vehicle-mounted self-organizing network, in view of the current routing technical problems of the vehicle-mounted self-organizing network under the condition of no roadside auxiliary communication unit cooperation, this paper proposes a vehicle network routing algorithm based on deep reinforcement learning. For the problems of massive vehicle nodes and multiple performance evaluation indexes in vehicular ad hoc network, this paper proposes a time prediction model of vehicle communication to reduce the probability of communication interruption and proposes the routing technology of vehicle network by studying the deep reinforcement learning method. This technology can quickly select routing nodes and plan the optimal route according to the required performance evaluation indicators.

Download Full-text

A Survey and Performance Evaluation of Reinforcement Learning Based Spectrum Aware Routing in Cognitive Radio Ad Hoc Networks

International Journal of Wireless Information Networks ◽

10.1007/s10776-019-00463-6 ◽

2019 ◽

Vol 27 (1) ◽

pp. 144-163 ◽

Cited By ~ 2

Author(s):

Rashmi Naveen Raj ◽

Ashalatha Nayak ◽

M. Sathish Kumar

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning ◽

Cognitive Radio ◽

Ad Hoc Networks ◽

Ad Hoc ◽

And Performance ◽

Hoc Networks

Download Full-text

Using fuzzy logic for performance evaluation in reinforcement learning

International Journal of Approximate Reasoning ◽

10.1016/s0888-613x(97)10007-x ◽

1998 ◽

Vol 18 (1-2) ◽

pp. 131-144 ◽

Cited By ~ 9

Author(s):

Hamid R. Berenji ◽

Pratap S. Khedkar

Keyword(s):

Fuzzy Logic ◽

Performance Evaluation ◽

Reinforcement Learning

Download Full-text

Learning Sparse Representations in Reinforcement Learning with Sparse Coding

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/287 ◽

2017 ◽

Author(s):

Lei Le ◽

Raksha Kumaraswamy ◽

Martha White

Keyword(s):

Reinforcement Learning ◽

Sparse Representation ◽

Policy Evaluation ◽

Sparse Coding ◽

Representation Learning ◽

Sparse Representations ◽

Learning Approaches ◽

Local Minima ◽

Global Minima ◽

Tile Coding

A variety of representation learning approaches have been investigated for reinforcement learning; much less attention, however, has been given to investigating the utility of sparse coding. Outside of reinforcement learning, sparse coding representations have been widely used, with non-convex objectives that result in discriminative representations. In this work, we develop a supervised sparse coding objective for policy evaluation. Despite the non-convexity of this objective, we prove that all local minima are global minima, making the approach amenable to simple optimization strategies. We empirically show that it is key to use a supervised objective, rather than the more straightforward unsupervised sparse coding approach. We then compare the learned representations to a canonical fixed sparse representation, called tile-coding, demonstrating that the sparse coding representation outperforms a wide variety of tile-coding representations.

Download Full-text

Performance Evaluation of AI Based Load Balancing Algorithm (Reinforcement Learning) with other load balancing algorithms in a JPPF Grid: E.coli Genome Sequence Alignment Problem

2018 International Conference on Bioinformatics and Systems Biology (BSB) ◽

10.1109/bsb.2018.8770657 ◽

2018 ◽

Author(s):

Subrata Sinha ◽

Abinash Hazarika ◽

Surabhi Johari

Keyword(s):

Performance Evaluation ◽

Reinforcement Learning ◽

Load Balancing ◽

Genome Sequence ◽

Sequence Alignment ◽

Alignment Problem ◽

Load Balancing Algorithm

Download Full-text