Search Sciweavers | Sciweavers

515 search results - page 68 / 103

» Approximating Markov Processes by Averaging

151

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

165

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 7 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

154

click to vote

ICPR
2008
IEEE

184views Computer Vision» more ICPR 2008»

Direct 3-D shape recovery from image sequence based on multi-scale Bayesian network

16 years 1 months ago

Download figment.cse.usf.edu

We propose a new method for recovering a 3-D object shape from an image sequence. In order to recover high-resolution relative depth without using the complex Markov random ﬁeld...

Norio Tagawa, Junya Kawaguchi, Shoichi Naganuma, K...

claim paper

Read More »

193

click to vote

3DPVT
2006
IEEE

176views Visualization» more 3DPVT 2006»

Belief Propagation for Panorama Generation

16 years 22 days ago

Download people.scs.carleton.ca

We present an algorithm for generating panoramic images of complex scenes from a multi-sensor camera. We further present a programmable graphics hardware implementation to process...

Alan Brunton, Chang Shu

claim paper

Read More »

153

click to vote

IJCAI
2007

147views Artificial Intelligence» more IJCAI 2007»

The Value of Observation for Monitoring Dynamic Systems

15 years 8 months ago

Download ijcai.org

We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief st...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

« Prev « First page 68 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers