Peter Auer
Research output
- Published
UCB Revisited: Improved Regret Bounds for the Stochastic Multi-Armed Bandit Problem
Auer, P. & Ortner, R., 2010, In: Periodica Mathematica Hungarica. 61, p. 55-65Research output: Contribution to journal › Article › Research › peer-review
- Published
Understanding the Gaps in Satisficing Bandits
Rouyer, C., Ortner, R. & Auer, P., 2024.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Unification in the Combination of Disjoint Theories
Auer, P., 1991, Unification in the Combination of Disjoint Theories. p. 177-186Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Upper-Confidence-Bound Algorithms for Active Learning in Mulit-armed Bandits
Auer, P., Carpentier, A., Lazaric, A., Ghavamzadeh, M. & Munos, R., 2011, The 22nd International Conference on Algorithmic Learning Theory. p. 189-203Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Upper-confidence-bound algorithms for active learning in multi-armed bandits
Carpentier, A., Lazaric, A., Ghavamzadeh, M., Munos, R. & Auer, P., 20 Oct 2011, Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Proceedings. p. 189-203 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6925 LNAI).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Using a spatio-temporal reasoning system to improve object models on the fly
Antenreiter, M., Prankl, J., Vincze, M. & Auer, P., 2009, 33rd Workshop of the Austrian Association for Pattern Recognition - Visual Learning. p. 25-36Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Using confidence bounds for exploitation-exploration trade-offs
Auer, P., 2002, In: Journal of machine learning research (JMLR). p. 397-422Research output: Contribution to journal › Article › Research › peer-review
- Published
Using Upper Confidence Bounds for Online Learning
Auer, P., 2000, 41th Annual Symposium on Foundations of Computer Science. p. 270-293Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019.Research output: Contribution to conference › Paper › peer-review
- Published
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019, Proceedings of The 35th Uncertainty in Artificial Intelligence Conference, UAI 2019. p. 81-90Research output: Chapter in Book/Report/Conference proceeding › Conference contribution