Chair of Information Technology (150)

Organisational unit: Chair

81 - 90 out of 215Page size: 10

Sort by: Publication date

Research output

2013
Published
Beating Bandits in Gradually Evolving Worlds
Chiang, C-K., 2013, Conference on Learning Theory. Shalev-Shwartz, S. & Steinwart, I. (eds.). p. 210-227 (JMLR Workshop and Conference Proceedings; vol. 30).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Competing with an Infinite Set of Models in Reinforcement Learning
Nguyen, P., Maillard, O-A., Ryabko, D. & Ortner, R., 2013, JMLR Workshop and Conference Proceedings Volume 31 : Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. p. 463-471
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Linear regression with random projections.
Maillard, O-A., 2013, In: Journal of machine learning research (JMLR). 13, p. 1-1
Research output: Contribution to journal › Article › Research › peer-review
Published
Optimal regret bounds for selecting the state representation in reinforcement learning.
Maillard, O-A., Nguyen, P., Ortner, R. & Ryabko, D., 2013, JMLR Workshop and Conference Proceedings Volume 28 : Proceedings of The 30th International Conference on Machine Learning. p. 543-551
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Reinforcement Learning (Dagstuhl Seminar 13321)
Auer, P., 2013, In: Dagstuhl Reports. 3, p. 1-26
Research output: Contribution to journal › Article › Research › peer-review
2012
Published
Autonomous Exploration For Navigating In MDPs.
Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. p. 40.1-40.24
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Entwicklung einer Simulation für Kommissioniersysteme
Salmutter, A., 2012
Research output: Thesis › Master's Thesis
Published
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments
Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. p. 103-116 (JMLR proceedings).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Online Optimization with Gradual Variations
Chiang, C-K., 2012, COLT 2012: Proceedings of the 25th Annual Conference on Learning Theory June 25-27, 2012, Edinburgh, Scotland. Mannor, S., Srebro, N. & Willamson, R. C. (eds.). p. 6.1-6.20 (JMLR Workshop and Conference Proceedings; vol. 23).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Published
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Ortner, R. & Ryabko, D., 2012, Advances in Neural Information Processing Systems 25. MIT Press, p. 1772-1780
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Previous 1...5 6 7 8 9 10 11 12 ...22 Next

Research Portal

Chair of Information Technology (150)

Research output

Beating Bandits in Gradually Evolving Worlds

Competing with an Infinite Set of Models in Reinforcement Learning

Linear regression with random projections.

Optimal regret bounds for selecting the state representation in reinforcement learning.

Reinforcement Learning (Dagstuhl Seminar 13321)

Autonomous Exploration For Navigating In MDPs.

Entwicklung einer Simulation für Kommissioniersysteme

Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments

Online Optimization with Gradual Variations

Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

Contact information