prioritized sweeping Latest Research Papers

<p class="Default">In this paper, prioritized sweeping confidence based dual reinforcement learning based adaptive network routing is investigated. Shortest Path routing is always not suitable for any wireless mobile network as in high traffic conditions, shortest path will always select the shortest path which is in terms of number of hops, between source and destination thus generating more congestion. In prioritized sweeping reinforcement learning method, optimization is carried out over confidence based dual reinforcement routing on mobile ad hoc network and path is selected based on the actual traffic present on the network at real time. Thus they guarantee the least delivery time to reach the packets to the destination. Analysis is done on 50 Nodes Mobile ad hoc networks with random mobility. Various performance parameters such as Interval and number of nodes are used for judging the network. Packet delivery ratio, dropping ratio and delay shows optimum results using the prioritized sweeping reinforcement learning method.</p>

Download Full-text

Sparse Kernel-Based Least Squares Temporal Difference with Prioritized Sweeping

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-46675-0_25 ◽

2016 ◽

pp. 221-230

Author(s):

Cijia Sun ◽

Xinghong Ling ◽

Yuchen Fu ◽

Quan Liu ◽

Haijun Zhu ◽

...

Keyword(s):

Least Squares ◽

Temporal Difference ◽

Sparse Kernel ◽

Prioritized Sweeping

Download Full-text

Accelerating Reinforcement Learning through Implicit Imitation

Journal of Artificial Intelligence Research ◽

10.1613/jair.898 ◽

2003 ◽

Vol 19 ◽

pp. 569-629 ◽

Cited By ~ 72

Author(s):

B. Price ◽

C. Boutilier

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Formal Model ◽

The State ◽

Learning Agent ◽

Relative Value ◽

Multiagent Environments ◽

Improved Performance ◽

Prioritized Sweeping ◽

Extract Information

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent's ability to learn useful behaviors by making intelligent use of the knowledge implicit in behaviors demonstrated by cooperative teachers or other more experienced agents. We propose and study a formal model of implicit imitation that can accelerate reinforcement learning dramatically in certain cases. Roughly, by observing a mentor, a reinforcement-learning agent can extract information about its own capabilities in, and the relative value of, unvisited parts of the state space. We study two specific instantiations of this model, one in which the learning agent and the mentor have identical abilities, and one designed to deal with agents and mentors with different action sets. We illustrate the benefits of implicit imitation by integrating it with prioritized sweeping, and demonstrating improved performance and convergence through observation of single and multiple mentors. Though we make some stringent assumptions regarding observability and possible interactions, we briefly comment on extensions of the model that relax these restricitions.

Download Full-text

Prioritized sweeping: Reinforcement learning with less data and less time

Machine Learning ◽

10.1007/bf00993104 ◽

1993 ◽

Vol 13 (1) ◽

pp. 103-130 ◽

Cited By ~ 214

Author(s):

Andrew W. Moore ◽

Christopher G. Atkeson

Keyword(s):

Reinforcement Learning ◽

Prioritized Sweeping

Download Full-text

prioritized sweeping
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Energy management for a hybrid electric vehicle based on blended reinforcement learning with backward focusing and prioritized sweeping

Morphing Strategy Design for UAV based on Prioritized Sweeping Reinforcement Learning

Incremental Reinforcement Learning With Prioritized Sweeping for Dynamic Environments

Epoch-incremental Dyna-learning and prioritized sweeping algorithms

Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

Prioritized Sweeping Reinforcement Learning Based Routing for MANETs

Sparse Kernel-Based Least Squares Temporal Difference with Prioritized Sweeping

Accelerating Reinforcement Learning through Implicit Imitation

Prioritized sweeping: Reinforcement learning with less data and less time

Export Citation Format

prioritized sweepingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Energy management for a hybrid electric vehicle based on blended reinforcement learning with backward focusing and prioritized sweeping

Morphing Strategy Design for UAV based on Prioritized Sweeping Reinforcement Learning

Incremental Reinforcement Learning With Prioritized Sweeping for Dynamic Environments

Epoch-incremental Dyna-learning and prioritized sweeping algorithms

Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

Prioritized Sweeping Reinforcement Learning Based Routing for MANETs

Sparse Kernel-Based Least Squares Temporal Difference with Prioritized Sweeping

Accelerating Reinforcement Learning through Implicit Imitation

Prioritized sweeping: Reinforcement learning with less data and less time

prioritized sweeping
Recently Published Documents