Chair of Information Technology (150)
Organisational unit: Chair
Research output
- 2012
- Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012.Research output: Contribution to conference › Poster › Research › peer-review
- Published
PAC Subset Selection in Stochastic Multi-armed Bandits
Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.
Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Inequalities for Martingales
Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
PAC-Bayesian Inequalities for Martingales.
Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093Research output: Contribution to journal › Article › Research › peer-review
- Published
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2012, Algorithmic Learning Theory 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings. p. 214-228Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- 2011
- Published
Adaptive bandits: Towards the best history-dependent strategy
Maillard, O-A., 2011, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. p. 570-578Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Analyse, Bewertung und Verbesserung der Statistiken eines Warehouse Control Systems
Schlögl, D., 2011Research output: Thesis › Master's Thesis
- Published
Exploration and Exploitation in Online Learning
Auer, P., 2011, International Conference on Adaptive and Intelligent Symstems - ICAIS 2011. p. 2-2Research output: Chapter in Book/Report/Conference proceeding › Conference contribution