Research output

  1. 2012
  2. Published

    Autonomous Exploration For Navigating In MDPs.

    Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. p. 40.1-40.24

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Published

    Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments

    Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. p. 103-116 (JMLR proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Published

    Online Optimization with Gradual Variations

    Chiang, C-K., 2012, COLT 2012: Proceedings of the 25th Annual Conference on Learning Theory June 25-27, 2012, Edinburgh, Scotland. Mannor, S., Srebro, N. & Willamson, R. C. (eds.). p. 6.1-6.20 (JMLR Workshop and Conference Proceedings; vol. 23).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Published

    PAC Subset Selection in Stochastic Multi-armed Bandits

    Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Published

    PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.

    Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Published

    PAC-Bayesian Analysis of Contextual Bandits

    Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. Published

    PAC-Bayesian Inequalities for Martingales

    Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. Published

    PAC-Bayesian Inequalities for Martingales.

    Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093

    Research output: Contribution to journalArticleResearchpeer-review

  10. Published

    Regret Bounds for Restless Markov Bandits

    Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2012, Algorithmic Learning Theory 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings. p. 214-228

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. 2011
  12. Published

    Upper-confidence-bound algorithms for active learning in multi-armed bandits

    Carpentier, A., Lazaric, A., Ghavamzadeh, M., Munos, R. & Auer, P., 20 Oct 2011, Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Proceedings. p. 189-203 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6925 LNAI).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Previous 1 2 3 4 5 6 7 8 ...14 Next