Ronald Ortner

Research output

  1. Published

    Logarithmic online regret bounds for undiscounted reinforcement learning

    Auer, P. & Ortner, R., 2006.

    Research output: Contribution to conferencePosterResearchpeer-review

  2. Published

    Logarithmic online regret bounds for undiscounted reinforcement learning

    Auer, P. & Ortner, R., 2007, Advances in Neural Information Processing Systems 19. MIT Press, p. 49-56

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Published

    Near-optimal Regret Bounds for Reinforcement Learning

    Auer, P., Jaksch, T. & Ortner, R., 2009, Advances in neural information processing systems 21. MIT Press, p. 89-96

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Published

    UCB Revisited: Improved Regret Bounds for the Stochastic Multi-Armed Bandit Problem

    Auer, P. & Ortner, R., 2010, In: Periodica Mathematica Hungarica. 61, p. 55-65

    Research output: Contribution to journalArticleResearchpeer-review

  5. Published

    Near-optimal Regret Bounds for Reinforcement Learning

    Auer, P., Jaksch, T. & Ortner, R., 2008.

    Research output: Contribution to conferencePosterResearchpeer-review

  6. Published

    Pareto Front Identification from Stochastic Bandit Feedback

    Auer, P., Chiang, C.-K., Ortner, R. & Drugan, M., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. p. 939-947 (JMLR Workshop and Conference Proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Published

    Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes

    Auer, P., Gajane, P. & Ortner, R., 2018.

    Research output: Contribution to conferencePaperpeer-review

  8. Published

    Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes

    Auer, P., Gajane, P. & Ortner, R., 2018.

    Research output: Contribution to conferencePosterResearchpeer-review

  9. Published

    Adaptively Tracking the Best Bandit Arm with an Unknown Number of Distribution Changes

    Auer, P., Gajane, P. & Ortner, R., 27 Jun 2019.

    Research output: Contribution to conferencePosterResearchpeer-review

  10. Published

    Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information

    Auer, P., Chen, Y., Gajane, P., Lee, C.-W., Luo, H., Ortner, R. & Wei, C.-Y., 2019.

    Research output: Contribution to conferenceAbstractpeer-review