Peter Auer
Research output
- 2013
- Published
Reinforcement Learning (Dagstuhl Seminar 13321)
Auer, P., 2013, In: Dagstuhl Reports. 3, p. 1-26Research output: Contribution to journal › Article › Research › peer-review
- 2012
- Published
Autonomous Exploration For Navigating In MDPs.
Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. p. 40.1-40.24Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments
Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. p. 103-116 (JMLR proceedings).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.
Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Inequalities for Martingales
Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Inequalities for Martingales.
Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093Research output: Contribution to journal › Article › Research › peer-review
- Published
PAC Subset Selection in Stochastic Multi-armed Bandits
Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2012, Algorithmic Learning Theory 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings. p. 214-228Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 2011
- Published
Upper-confidence-bound algorithms for active learning in multi-armed bandits
Carpentier, A., Lazaric, A., Ghavamzadeh, M., Munos, R. & Auer, P., 20 Oct 2011, Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Proceedings. p. 189-203 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6925 LNAI).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution