Sciweavers

288 search results - page 18 / 58
» Learning to Play Chess Using Temporal Differences
Sort
View
193
Voted
ICANN
2010
Springer
15 years 4 months ago
Learning in a Unitary Coherent Hippocampus
Abstract. A previous paper [2] presented a model (UCPF-HC) of the hippocampus as a unitary coherent particle filter, which combines the classical hippocampal roles of associative m...
Charles W. Fox, Tony J. Prescott
127
Voted
CG
2002
Springer
15 years 6 months ago
Learning a Game Strategy Using Pattern-Weights and Self-play
Abstract. This paper demonstrates the use of pattern-weights in order to develop a strategy for an automated player of a non-cooperative version of the game of Diplomacy. Diplomacy...
Ari Shapiro, Gil Fuchs, Robert Levinson
NIPS
2001
15 years 7 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
162
Voted
ICML
2009
IEEE
16 years 7 months ago
Proto-predictive representation of states with simple recurrent temporal-difference networks
We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...
Takaki Makino
181
Voted
ROMAN
2007
IEEE
191views Robotics» more  ROMAN 2007»
16 years 19 days ago
Learning and Recognition of Object Manipulation Actions Using Linear and Nonlinear Dimensionality Reduction
— In this work, we perform an extensive statistical evaluation for learning and recognition of object manipulation actions. We concentrate on single arm/hand actions but study th...
Isabel Serrano Vicente, Danica Kragic, Jan-Olof Ek...