scholarly journals Efficient Algorithm for Reinforcement Learning with Partial Variation

2006 ◽  
Vol 42 (8) ◽  
pp. 959-965
Author(s):  
Kei SENDA ◽  
Shinji FUJII
2012 ◽  
Vol 220-223 ◽  
pp. 2772-2776
Author(s):  
Zhao Hui Hu

This paper presents reinforcement learning (RL) algorithm for routing strategy considering the state of network link, which can be deemed as a dynamic programming problem with stochastic needs. Through modeling those four elements and experiments, we draw the conclusion that upon the state of network link, RL is an efficient algorithm for routing strategy; the data can be efficient forwarded to the destination.


2004 ◽  
Vol 15 (09) ◽  
pp. 1235-1247 ◽  
Author(s):  
M. ANDRECUT ◽  
M. K. ALI

The eligibility trace is the most important mechanism used so far in reinforcement learning to handle delayed reward. Here, we introduce a new kind of eligibility trace, the goal-directed trace, and show that it results in more reliable learning than the conventional trace. In addition, we also propose a new efficient algorithm for solving the goal-directed reinforcement learning problem.


Author(s):  
P.J. Phillips ◽  
J. Huang ◽  
S. M. Dunn

In this paper we present an efficient algorithm for automatically finding the correspondence between pairs of stereo micrographs, the key step in forming a stereo image. The computation burden in this problem is solving for the optimal mapping and transformation between the two micrographs. In this paper, we present a sieve algorithm for efficiently estimating the transformation and correspondence.In a sieve algorithm, a sequence of stages gradually reduce the number of transformations and correspondences that need to be examined, i.e., the analogy of sieving through the set of mappings with gradually finer meshes until the answer is found. The set of sieves is derived from an image model, here a planar graph that encodes the spatial organization of the features. In the sieve algorithm, the graph represents the spatial arrangement of objects in the image. The algorithm for finding the correspondence restricts its attention to the graph, with the correspondence being found by a combination of graph matchings, point set matching and geometric invariants.


Decision ◽  
2016 ◽  
Vol 3 (2) ◽  
pp. 115-131 ◽  
Author(s):  
Helen Steingroever ◽  
Ruud Wetzels ◽  
Eric-Jan Wagenmakers

Sign in / Sign up

Export Citation Format

Share Document