Sciweavers

122 search results - page 17 / 25
» Linear manifold approximation based on differences of tangen...
Sort
View
JMLR
2010
119views more  JMLR 2010»
15 years 1 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
IJCAI
2003
15 years 8 months ago
Generalizing Plans to New Environments in Relational MDPs
A longstanding goal in planning research is the ability to generalize plans developed for some set of environments to a new but similar environment, with minimal or no replanning....
Carlos Guestrin, Daphne Koller, Chris Gearhart, Ne...
CGF
2004
93views more  CGF 2004»
15 years 6 months ago
Prototype Modeling from Sketched Silhouettes based on Convolution Surfaces
This paper presents a hybrid method for creating three-dimensional shapes by sketching silhouette curves. Given a silhouette curve, we approximate its medial axis as a set of line...
Chiew-Lan Tai, Hongxin Zhang, Jacky Chun-Kin Fong
JMLR
2006
153views more  JMLR 2006»
15 years 6 months ago
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...
Jelle R. Kok, Nikos A. Vlassis
192
Voted
JMLR
2010
129views more  JMLR 2010»
15 years 1 months ago
Expectation Truncation and the Benefits of Preselection In Training Generative Models
We show how a preselection of hidden variables can be used to efficiently train generative models with binary hidden variables. The approach is based on Expectation Maximization (...
Jörg Lücke, Julian Eggert