Peter Auer

121 - 130 out of 132Page size: 10

Sort by: Title

Research output

Published
UCB Revisited: Improved Regret Bounds for the Stochastic Multi-Armed Bandit Problem
Auer, P. & Ortner, R., 2010, In: Periodica Mathematica Hungarica. 61, p. 55-65
Research output: Contribution to journal › Article › Research › peer-review
Published
Understanding the Gaps in Satisficing Bandits
Rouyer, C., Ortner, R. & Auer, P., 2024.
Research output: Contribution to conference › Poster › Research › peer-review
Published
Unification in the Combination of Disjoint Theories
Auer, P., 1991, Unification in the Combination of Disjoint Theories. p. 177-186
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Upper-Confidence-Bound Algorithms for Active Learning in Mulit-armed Bandits
Auer, P., Carpentier, A., Lazaric, A., Ghavamzadeh, M. & Munos, R., 2011, The 22nd International Conference on Algorithmic Learning Theory. p. 189-203
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Upper-confidence-bound algorithms for active learning in multi-armed bandits
Carpentier, A., Lazaric, A., Ghavamzadeh, M., Munos, R. & Auer, P., 20 Oct 2011, Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Proceedings. p. 189-203 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6925 LNAI).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Using a spatio-temporal reasoning system to improve object models on the fly
Antenreiter, M., Prankl, J., Vincze, M. & Auer, P., 2009, 33rd Workshop of the Austrian Association for Pattern Recognition - Visual Learning. p. 25-36
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Using confidence bounds for exploitation-exploration trade-offs
Auer, P., 2002, In: Journal of machine learning research (JMLR). p. 397-422
Research output: Contribution to journal › Article › Research › peer-review
Published
Using Upper Confidence Bounds for Online Learning
Auer, P., 2000, 41th Annual Symposium on Foundations of Computer Science. p. 270-293
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019.
Research output: Contribution to conference › Paper › peer-review
Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019, Proceedings of The 35th Uncertainty in Artificial Intelligence Conference, UAI 2019. p. 81-90
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Previous 1...9 10 11 12 13 14 Next

Research Portal

Peter Auer

Research output

UCB Revisited: Improved Regret Bounds for the Stochastic Multi-Armed Bandit Problem

Understanding the Gaps in Satisficing Bandits

Unification in the Combination of Disjoint Theories

Upper-Confidence-Bound Algorithms for Active Learning in Mulit-armed Bandits

Upper-confidence-bound algorithms for active learning in multi-armed bandits

Using a spatio-temporal reasoning system to improve object models on the fly

Using confidence bounds for exploitation-exploration trade-offs

Using Upper Confidence Bounds for Online Learning

Variational Regret Bounds for Reinforcement Learning

Variational Regret Bounds for Reinforcement Learning

Contact

Research Portal

Peter Auer

Research output

Contact

Latest Research Output