Sciweavers

1233 search results - page 110 / 247
» Reinforcement Learning in MirrorBot
Sort
View
194
Voted
ICML
2010
IEEE
15 years 4 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
ROBOCUP
2005
Springer
134views Robotics» more  ROBOCUP 2005»
16 years 6 days ago
Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System
The existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments. A typical example is a case of RoboCup...
Yasutake Takahashi, Kazuhiro Edazawa, Kentarou Nom...
ATAL
2009
Springer
16 years 1 months ago
Generalized model learning for reinforcement learning in factored domains
Improving the sample efficiency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...
Todd Hester, Peter Stone
ICML
1998
IEEE
16 years 7 months ago
RL-TOPS: An Architecture for Modularity and Re-Use in Reinforcement Learning
This paper introduces the RL-TOPs architecture for robot learning, a hybrid system combining teleo-reactive planning and reinforcement learning techniques. The aim of this system ...
Malcolm R. K. Ryan, Mark D. Pendrith
ICML
2005
IEEE
16 years 7 months ago
Learning strategies for story comprehension: a reinforcement learning approach
This paper describes the use of machine learning to improve the performance of natural language question answering systems. We present a model for improving story comprehension th...
Eugene Grois, David C. Wilkins