Chair of Information Technology (150)

Organisational unit: Chair

91 - 100 out of 215Page size: 10

Sort by: Publication date

Research output

2012
Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012.
Research output: Contribution to conference › Poster › Research › peer-review
Published
PAC Subset Selection in Stochastic Multi-armed Bandits
Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.
Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. p. 98-111
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, p. 1683-1691
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
PAC-Bayesian Inequalities for Martingales
Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
PAC-Bayesian Inequalities for Martingales.
Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, In: IEEE transactions on information theory. 58, p. 7086-7093
Research output: Contribution to journal › Article › Research › peer-review
Published
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2012, Algorithmic Learning Theory 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings. p. 214-228
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
2011
Published
Adaptive bandits: Towards the best history-dependent strategy
Maillard, O-A., 2011, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. p. 570-578
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Analyse, Bewertung und Verbesserung der Statistiken eines Warehouse Control Systems
Schlögl, D., 2011
Research output: Thesis › Master's Thesis
Published
Exploration and Exploitation in Online Learning
Auer, P., 2011, International Conference on Adaptive and Intelligent Symstems - ICAIS 2011. p. 2-2
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Previous 1...6 7 8 9 10 11 12 13 ...22 Next

Research Portal

Chair of Information Technology (150)

Research output

Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

PAC Subset Selection in Stochastic Multi-armed Bandits

PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.

PAC-Bayesian Analysis of Contextual Bandits

PAC-Bayesian Inequalities for Martingales

PAC-Bayesian Inequalities for Martingales.

Regret Bounds for Restless Markov Bandits

Adaptive bandits: Towards the best history-dependent strategy

Analyse, Bewertung und Verbesserung der Statistiken eines Warehouse Control Systems

Exploration and Exploitation in Online Learning

Contact information