Department Mathematics and Information Technology

Organisational unit: Departments and Institutes

Research output

  1. 2012
  2. Published

    Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

    Ortner, R. & Ryabko, D., 2012.

    Research output: Contribution to conferencePosterResearchpeer-review

  3. Published
  4. Published

    PAC Subset Selection in Stochastic Multi-armed Bandits

    Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Published

    PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.

    Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Published

    PAC-Bayesian Analysis of Contextual Bandits

    Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Published

    PAC-Bayesian Inequalities for Martingales

    Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. Published

    PAC-Bayesian Inequalities for Martingales.

    Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093

    Research output: Contribution to journalArticleResearchpeer-review

  9. Published

    Pathwise space approximations of semi-linear parabolic SPDEs with multiplicative noise

    Hausenblas, E., 2012, In: International Journal of Computer Mathematics. 89, p. 2460-2478

    Research output: Contribution to journalArticleResearchpeer-review

  10. Published

    Prognose von Produktionsmengen mit neuronalen Netzen

    Wieser, D., 2012

    Research output: ThesisMaster's Thesis

  11. Published

    Regret Bounds for Restless Markov Bandits

    Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2012, Algorithmic Learning Theory 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings. p. 214-228

    Research output: Chapter in Book/Report/Conference proceedingConference contribution