Chair of Information Technology (150)

Organisational unit: Chair

Research output

  1. 2012
  2. Published

    Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

    Ortner, R. & Ryabko, D., 2012.

    Research output: Contribution to conferencePosterResearchpeer-review

  3. Published

    PAC Subset Selection in Stochastic Multi-armed Bandits

    Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Published

    PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.

    Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Published

    PAC-Bayesian Analysis of Contextual Bandits

    Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Published

    PAC-Bayesian Inequalities for Martingales

    Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Published

    PAC-Bayesian Inequalities for Martingales.

    Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093

    Research output: Contribution to journalArticleResearchpeer-review

  8. Published

    Regret Bounds for Restless Markov Bandits

    Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2012, Algorithmic Learning Theory 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings. p. 214-228

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. 2011
  10. Published

    Adaptive bandits: Towards the best history-dependent strategy

    Maillard, O-A., 2011, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. p. 570-578

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. Published
  12. Published

    Exploration and Exploitation in Online Learning

    Auer, P., 2011, International Conference on Adaptive and Intelligent Symstems - ICAIS 2011. p. 2-2

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Previous 1...6 7 8 9 10 11 12 13 ...22 Next