Peter Auer

121 - 130 von 132Seitengröße: 10

Sortieren nach: Titel

Publikationen

Veröffentlicht
UCB Revisited: Improved Regret Bounds for the Stochastic Multi-Armed Bandit Problem
Auer, P. & Ortner, R., 2010, in: Periodica Mathematica Hungarica. 61, S. 55-65
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
Veröffentlicht
Understanding the Gaps in Satisficing Bandits
Rouyer, C., Ortner, R. & Auer, P., 2024.
Publikationen: Konferenzbeitrag › Poster › Forschung › (peer-reviewed)
Veröffentlicht
Unification in the Combination of Disjoint Theories
Auer, P., 1991, Unification in the Combination of Disjoint Theories. S. 177-186
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Upper-Confidence-Bound Algorithms for Active Learning in Mulit-armed Bandits
Auer, P., Carpentier, A., Lazaric, A., Ghavamzadeh, M. & Munos, R., 2011, The 22nd International Conference on Algorithmic Learning Theory. S. 189-203
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Upper-confidence-bound algorithms for active learning in multi-armed bandits
Carpentier, A., Lazaric, A., Ghavamzadeh, M., Munos, R. & Auer, P., 20 Okt. 2011, Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Proceedings. S. 189-203 15 S. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 6925 LNAI).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Using a spatio-temporal reasoning system to improve object models on the fly
Antenreiter, M., Prankl, J., Vincze, M. & Auer, P., 2009, 33rd Workshop of the Austrian Association for Pattern Recognition - Visual Learning. S. 25-36
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Using confidence bounds for exploitation-exploration trade-offs
Auer, P., 2002, in: Journal of machine learning research (JMLR). S. 397-422
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
Veröffentlicht
Using Upper Confidence Bounds for Online Learning
Auer, P., 2000, 41th Annual Symposium on Foundations of Computer Science. S. 270-293
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019.
Publikationen: Konferenzbeitrag › Paper › (peer-reviewed)
Veröffentlicht
Variational Regret Bounds for Reinforcement Learning
Ortner, R., Gajane, P. & Auer, P., 2019, Proceedings of The 35th Uncertainty in Artificial Intelligence Conference, UAI 2019. S. 81-90
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband

Vorherige 1...9 10 11 12 13 14 Nächste

Forschungsportal

Peter Auer

Publikationen

UCB Revisited: Improved Regret Bounds for the Stochastic Multi-Armed Bandit Problem

Understanding the Gaps in Satisficing Bandits

Unification in the Combination of Disjoint Theories

Upper-Confidence-Bound Algorithms for Active Learning in Mulit-armed Bandits

Upper-confidence-bound algorithms for active learning in multi-armed bandits

Using a spatio-temporal reasoning system to improve object models on the fly

Using confidence bounds for exploitation-exploration trade-offs

Using Upper Confidence Bounds for Online Learning

Variational Regret Bounds for Reinforcement Learning

Variational Regret Bounds for Reinforcement Learning

Kontakt

Forschungsportal

Peter Auer

Publikationen

Kontakt

Neuester Forschungsoutput