Search Sciweavers | Sciweavers

162 search results - page 21 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

160

Voted

ICASSP
2009
IEEE

125views Signal Processing» more ICASSP 2009»

Fast belief propagation process element for high-quality stereo estimation

15 years 4 months ago

Download mpac.ee.ntu.edu.tw

Belief propagation is a popular global optimization technique for many computer vision problems. However, it requires extensive computation due to the iterative message passing op...

Chao-Chung Cheng, Chia-Kai Liang, Yen-Chieh Lai, H...

claim paper

Read More »

151

Voted

CDC
2008
IEEE

118views Control Systems» more CDC 2008»

A density projection approach to dimension reduction for continuous-state POMDPs

16 years 26 days ago

Download netfiles.uiuc.edu

Abstract— Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on discrete-state models, and these algorithms ...

Enlu Zhou, Michael C. Fu, Steven I. Marcus

claim paper

Read More »

178

Voted

CONCUR
2006
Springer

159views Distributed And Parallel Com...» more CONCUR 2006»

Strategy Improvement for Stochastic Rabin and Streett Games

15 years 10 months ago

Download mtc.epfl.ch

A stochastic graph game is played by two players on a game graph with probabilistic transitions. We consider stochastic graph games with -regular winning conditions specified as Ra...

Krishnendu Chatterjee, Thomas A. Henzinger

claim paper

Read More »

166

click to vote

ATAL
2006
Springer

157views Intelligent Agents» more ATAL 2006»

Decentralized planning under uncertainty for teams of communicating agents

15 years 10 months ago

Download www.cs.cmu.edu

Decentralized partially observable Markov decision processes (DEC-POMDPs) form a general framework for planning for groups of cooperating agents that inhabit a stochastic and part...

Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A....

claim paper

Read More »

164

Voted

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 7 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

« Prev « First page 21 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers