Search Sciweavers | Sciweavers

1106 search results - page 125 / 222

» On regularization algorithms in learning theory

177

click to vote

AIIA
2005
Springer

114views Artificial Intelligence» more AIIA 2005»

Experimental Evaluation of Hierarchical Hidden Markov Models

16 years 9 days ago

Download www.ugogalassi.net

Building proﬁles for processes and for interactive users is a important task in intrusion detection. This paper presents the results obtained with a Hierarchical Hidden Markov Mo...

Attilio Giordana, Ugo Galassi, Lorenza Saitta

claim paper

Read More »

168

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

15 years 7 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

171

click to vote

ICC
2007
IEEE

120views Communications» more ICC 2007»

Dynamic Network Selection using Kernels

16 years 1 months ago

Download www.prism.uvsq.fr

—We present a new algorithm for vertical handover and dynamic network selection, based on a combination of multiattribute utility theory, kernel learning and stochastic gradient ...

Eric van den Berg, Praveen Gopalakrishnan, Byungsu...

claim paper

Read More »

179

click to vote

ICML
2009
IEEE

98views Machine Learning» more ICML 2009»

On primal and dual sparsity of Markov networks

16 years 7 months ago

Download www.cs.cmu.edu

Sparsity is a desirable property in high dimensional learning. The 1-norm regularization can lead to primal sparsity, while max-margin methods achieve dual sparsity. Combining the...

Jun Zhu, Eric P. Xing

claim paper

Read More »

207

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

16 years 9 days ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

« Prev « First page 125 / 222 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers