Sciweavers

2108 search results - page 36 / 422
» Tracking in Reinforcement Learning
Sort
View
ATAL
2009
Springer
16 years 1 months ago
Learning with whom to communicate using relational reinforcement learning
Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, ...
NIPS
1998
15 years 7 months ago
Gradient Descent for General Reinforcement Learning
A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...
Leemon C. Baird III, Andrew W. Moore
AUSAI
2005
Springer
16 years 22 hour ago
Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning
: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...
Peter Vamplew, Robert Ollington