Search Sciweavers | Sciweavers

23 search results - page 3 / 5

» Online Optimization in X-Armed Bandits

188

click to vote

NIPS
2004

136views Information Technology» more NIPS 2004»

Nearly Tight Bounds for the Continuum-Armed Bandit Problem

15 years 8 months ago

Download books.nips.cc

In the multi-armed bandit problem, an online algorithm must choose from a set of strategies in a sequence of n trials so as to minimize the total cost of the chosen strategies. Wh...

Robert D. Kleinberg

claim paper

Read More »

186

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

15 years 6 months ago

Download wireless.cs.uh.edu

We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...

Cem Tekin, Mingyan Liu

claim paper

Read More »

212

click to vote

CORR
2010
Springer

171views Education» more CORR 2010»

Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach

15 years 1 months ago

Download www.eecs.umich.edu

We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...

Cem Tekin, Mingyan Liu

claim paper

Read More »

158

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

15 years 8 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

198

click to vote

LION
2010
Springer

190views Optimization» more LION 2010»

Algorithm Selection as a Bandit Problem with Unbounded Losses

15 years 10 months ago

Download como.vub.ac.be

Abstract. Algorithm selection is typically based on models of algorithm performance learned during a separate ofﬂine training sequence, which can be prohibitively expensive. In r...

Matteo Gagliolo, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers