Sciweavers

397 search results - page 60 / 80
» Reinforcement Learning with Hierarchies of Machines
Sort
View
COLT
2010
Springer
15 years 4 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
ICML
2005
IEEE
16 years 7 months ago
Combining model-based and instance-based learning for first order regression
T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...
Kurt Driessens, Saso Dzeroski
ECCV
2008
Springer
16 years 8 months ago
Training Hierarchical Feed-Forward Visual Recognition Models Using Transfer Learning from Pseudo-Tasks
Abstract. Building visual recognition models that adapt across different domains is a challenging task for computer vision. While feature-learning machines in the form of hierarchi...
Amr Ahmed, Kai Yu, Wei Xu, Yihong Gong, Eric P. Xi...
IROS
2006
IEEE
147views Robotics» more  IROS 2006»
16 years 21 days ago
A Hybrid Control Architecture for Autonomous Robotic Fish
— This paper presents a hybrid control architecture for autonomous robotic fishes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...
Jindong Liu, Huosheng Hu, Dongbing Gu
153
Voted
ICRA
1994
IEEE
105views Robotics» more  ICRA 1994»
15 years 10 months ago
Harmonic Functions and Collision Probabilities
There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...
Christopher I. Connolly