Sciweavers

1176 search results - page 12 / 236
» Sparse reward processes
Sort
View
COLING
2010
15 years 1 months ago
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...
ICASSP
2011
IEEE
14 years 10 months ago
Logarithmic weak regret of non-Bayesian restless multi-armed bandit
Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...
Haoyang Liu, Keqin Liu, Qing Zhao
ICA
2010
Springer
15 years 6 months ago
SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms
SMALLbox is a new foundational framework for processing signals, using adaptive sparse structured representations. The main aim of SMALLbox is to become a test ground for explorati...
Ivan Damnjanovic, Matthew E. P. Davies, Mark D. Pl...
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
16 years 19 days ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...