Publikationen

31 - 40 von 139Seitengröße: 10

Sortieren nach: Erscheinungsjahr

2016
Veröffentlicht
Pareto Front Identification from Stochastic Bandit Feedback
Auer, P., Chiang, C-K., Ortner, R. & Drugan, M., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. S. 939-947 (JMLR Workshop and Conference Proceedings).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
2014
Veröffentlicht
Algorithmic Learning Theory: 25th International Conference, ALT 2014 Bled, Slovenia, October 8-10, 2014 Proceedings
Auer, P., Clark, A., Zeugmann, T. & Zilles, S., 1 Jan. 2014, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer Berlin, Band 8776. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 8776).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Editors’ introduction
Auer, P., Clark, A., Zeugmann, T. & Zilles, S., 1 Jan. 2014, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8776, S. 1-7 7 S.
Publikationen: Beitrag in Fachzeitschrift › Leitartikel › (peer-reviewed)
Veröffentlicht
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2014, in: Theoretical Computer Science. 558, S. 62-76
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
2013
Veröffentlicht
Reinforcement Learning (Dagstuhl Seminar 13321)
Auer, P., 2013, in: Dagstuhl Reports. 3, S. 1-26
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
2012
Veröffentlicht
Autonomous Exploration For Navigating In MDPs.
Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. S. 40.1-40.24
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments
Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. S. 103-116 (JMLR proceedings).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
PAC Subset Selection in Stochastic Multi-armed Bandits
Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.
Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. S. 98-111
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, S. 1683-1691
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband

Vorherige 1 2 3 4 5 6 7 8 ...14 Nächste

Forschungsportal

Publikationen

Pareto Front Identification from Stochastic Bandit Feedback

Algorithmic Learning Theory: 25th International Conference, ALT 2014 Bled, Slovenia, October 8-10, 2014 Proceedings

Editors’ introduction

Regret Bounds for Restless Markov Bandits

Reinforcement Learning (Dagstuhl Seminar 13321)

Autonomous Exploration For Navigating In MDPs.

Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments

PAC Subset Selection in Stochastic Multi-armed Bandits

PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.

PAC-Bayesian Analysis of Contextual Bandits

Erweiterte Suche