Chair of Information Technology (150)
Organisational unit: Chair
Research output
- 2013
- Published
Beating Bandits in Gradually Evolving Worlds
Chiang, C-K., 2013, Conference on Learning Theory. Shalev-Shwartz, S. & Steinwart, I. (eds.). p. 210-227 (JMLR Workshop and Conference Proceedings; vol. 30).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Competing with an Infinite Set of Models in Reinforcement Learning
Nguyen, P., Maillard, O-A., Ryabko, D. & Ortner, R., 2013, JMLR Workshop and Conference Proceedings Volume 31 : Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. p. 463-471Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Linear regression with random projections.
Maillard, O-A., 2013, In: Journal of machine learning research (JMLR). 13, p. 1-1Research output: Contribution to journal › Article › Research › peer-review
- Published
Optimal regret bounds for selecting the state representation in reinforcement learning.
Maillard, O-A., Nguyen, P., Ortner, R. & Ryabko, D., 2013, JMLR Workshop and Conference Proceedings Volume 28 : Proceedings of The 30th International Conference on Machine Learning. p. 543-551Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Reinforcement Learning (Dagstuhl Seminar 13321)
Auer, P., 2013, In: Dagstuhl Reports. 3, p. 1-26Research output: Contribution to journal › Article › Research › peer-review
- 2012
- Published
Autonomous Exploration For Navigating In MDPs.
Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. p. 40.1-40.24Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Entwicklung einer Simulation für Kommissioniersysteme
Salmutter, A., 2012Research output: Thesis › Master's Thesis
- Published
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments
Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. p. 103-116 (JMLR proceedings).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Online Optimization with Gradual Variations
Chiang, C-K., 2012, COLT 2012: Proceedings of the 25th Annual Conference on Learning Theory June 25-27, 2012, Edinburgh, Scotland. Mannor, S., Srebro, N. & Willamson, R. C. (eds.). p. 6.1-6.20 (JMLR Workshop and Conference Proceedings; vol. 23).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
- Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012, Advances in Neural Information Processing Systems 25. MIT Press, p. 1772-1780Research output: Chapter in Book/Report/Conference proceeding › Conference contribution