Ronald Ortner
Research output
- Published
Logarithmic online regret bounds for undiscounted reinforcement learning
Auer, P. & Ortner, R., 2006.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Logarithmic online regret bounds for undiscounted reinforcement learning
Auer, P. & Ortner, R., 2007, Advances in Neural Information Processing Systems 19. MIT Press, p. 49-56Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Near-optimal Regret Bounds for Reinforcement Learning
Auer, P., Jaksch, T. & Ortner, R., 2009, Advances in neural information processing systems 21. MIT Press, p. 89-96Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
UCB Revisited: Improved Regret Bounds for the Stochastic Multi-Armed Bandit Problem
Auer, P. & Ortner, R., 2010, In: Periodica Mathematica Hungarica. 61, p. 55-65Research output: Contribution to journal › Article › Research › peer-review
- Published
Near-optimal Regret Bounds for Reinforcement Learning
Auer, P., Jaksch, T. & Ortner, R., 2008.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Pareto Front Identification from Stochastic Bandit Feedback
Auer, P., Chiang, C.-K., Ortner, R. & Drugan, M., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. p. 939-947 (JMLR Workshop and Conference Proceedings).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2018.Research output: Contribution to conference › Paper › peer-review
- Published
Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 2018.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Adaptively Tracking the Best Bandit Arm with an Unknown Number of Distribution Changes
Auer, P., Gajane, P. & Ortner, R., 27 Jun 2019.Research output: Contribution to conference › Poster › Research › peer-review
- Published
Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information
Auer, P., Chen, Y., Gajane, P., Lee, C.-W., Luo, H., Ortner, R. & Wei, C.-Y., 2019.Research output: Contribution to conference › Abstract › peer-review