MIT Press
Publisher
1 - 5 out of 5Page size: 10
Research output
- 2012
- Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012, Advances in Neural Information Processing Systems 25. MIT Press, p. 1772-1780Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 2009
- Published
Near-optimal Regret Bounds for Reinforcement Learning
Auer, P., Jaksch, T. & Ortner, R., 2009, Advances in neural information processing systems 21. MIT Press, p. 89-96Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 2007
- Published
Logarithmic online regret bounds for undiscounted reinforcement learning
Auer, P. & Ortner, R., 2007, Advances in Neural Information Processing Systems 19. MIT Press, p. 49-56Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 1996
- Published
Exponentially Many Local Minima for Single Neurons
Auer, P., Herbster, M. & Warmuth, M. K., 1996, Advances in Neural Information Processing System 8. MIT Press, p. 316-322Research output: Chapter in Book/Report/Conference proceeding › Conference contribution