Peter Auer

21 - 40 von 132Seitengröße: 20

Sortieren nach: Erscheinungsjahr

Publikationen

2018
Veröffentlicht
A Sliding-Window Approach for Reinforcement Learning in MDPs with Arbitrarily Changing Rewards and Transitions.
Gajane, P., Ortner, R. & Auer, P., 2018.
Publikationen: Konferenzbeitrag › Paper › (peer-reviewed)
2017
Veröffentlicht
Online Learning
Auer, P., 2017, Encyclopedia of Machine Learning and Data Mining.
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Buch/Sammelband › Forschung
2016
Veröffentlicht
Algorithmic Learning Theory
Auer, P., Clark, A. & Zeugmann, T., 18 Okt. 2016, in: Theoretical Computer Science. 650, S. 1-3
Publikationen: Beitrag in Fachzeitschrift › Sonderausgabe/Sonderband › Forschung › (peer-reviewed)
Veröffentlicht
Guest editors' foreword
Auer, P., Clark, A. & Zeugmann, T., 18 Okt. 2016, in: Theoretical Computer Science. 650.2016, 18 October, S. 1-3 3 S.
Publikationen: Beitrag in Fachzeitschrift › Kurzmitteilung › (peer-reviewed)
Veröffentlicht
An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits
Auer, P. & Chiang, C.-K., 23 Juni 2016, Proceedings of the 29th Conference on Learning Theory, COLT 2016. S. 116-120 (JMLR Workshop and Conference Proceedings; Band 49).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Learning with Malicious Noise
Auer, P., 22 Apr. 2016, Encyclopedia of Algorithms. Springer, S. 1086-1089
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Eintrag in Nachschlagewerk › Forschung
Veröffentlicht
Pareto Front Identification from Stochastic Bandit Feedback
Auer, P., Chiang, C.-K., Ortner, R. & Drugan, M., 2016, Proceedings of the Nineteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2016. S. 939-947 (JMLR Workshop and Conference Proceedings).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
2014
Veröffentlicht
Algorithmic Learning Theory: 25th International Conference, ALT 2014 Bled, Slovenia, October 8-10, 2014 Proceedings
Auer, P. (Mit-Herausgeber), Clark, A. (Mit-Herausgeber), Zeugmann, T. (Mit-Herausgeber) & Zilles, S. (Mit-Herausgeber), 1 Jan. 2014, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer Berlin, Band 8776. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 8776).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Editors’ introduction
Auer, P., Clark, A., Zeugmann, T. & Zilles, S., 1 Jan. 2014, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8776, S. 1-7 7 S.
Publikationen: Beitrag in Fachzeitschrift › Leitartikel › (peer-reviewed)
Veröffentlicht
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2014, in: Theoretical Computer Science. 558, S. 62-76
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
2013
Veröffentlicht
Reinforcement Learning (Dagstuhl Seminar 13321)
Auer, P., 2013, in: Dagstuhl Reports. 3, S. 1-26
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
2012
Veröffentlicht
Autonomous Exploration For Navigating In MDPs.
Lim, S. H. & Auer, P., 2012, Proceedings of the 25th Annual Conference on Learning Theory. S. 40.1-40.24
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments
Seldin, Y., Szepesvári, C., Auer, P. & Abbasi-Yadkori, Y., 2012, Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012. S. 103-116 (JMLR proceedings).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.
Seldin, Y., Cesa-Bianchi, N., Auer, P., Laviolette, F. & Shawe-Taylor, J., 2012, Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2. S. 98-111
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
PAC-Bayesian Analysis of Contextual Bandits
Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J. S. & Ortner, R., 2012, Advances in Neural Information Processing Systems 24. MIT Press, S. 1683-1691
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
PAC-Bayesian Inequalities for Martingales
Seldin, Y., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, UAI 2012.
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
PAC-Bayesian Inequalities for Martingales.
Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J. & Auer, P., 2012, in: IEEE transactions on information theory. 58, S. 7086-7093
Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)
Veröffentlicht
PAC Subset Selection in Stochastic Multi-armed Bandits
Kalyanakrishnan, S., Tewari, A., Auer, P. & Stone, P., 2012, Proceedings of the 29th International Conference on Machine Learning, ICML 2012.
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Veröffentlicht
Regret Bounds for Restless Markov Bandits
Ortner, R., Ryabko, D., Auer, P. & Munos, R., 2012, Algorithmic Learning Theory 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings. S. 214-228
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
2011
Veröffentlicht
Upper-confidence-bound algorithms for active learning in multi-armed bandits
Carpentier, A., Lazaric, A., Ghavamzadeh, M., Munos, R. & Auer, P., 20 Okt. 2011, Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Proceedings. S. 189-203 15 S. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 6925 LNAI).
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband

Vorherige 1 2 3 4 5 6 7 Nächste

Forschungsportal

Peter Auer

Publikationen

A Sliding-Window Approach for Reinforcement Learning in MDPs with Arbitrarily Changing Rewards and Transitions.

Online Learning

Algorithmic Learning Theory

Guest editors' foreword

An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits

Learning with Malicious Noise

Pareto Front Identification from Stochastic Bandit Feedback

Algorithmic Learning Theory: 25th International Conference, ALT 2014 Bled, Slovenia, October 8-10, 2014 Proceedings

Editors’ introduction

Regret Bounds for Restless Markov Bandits

Reinforcement Learning (Dagstuhl Seminar 13321)

Autonomous Exploration For Navigating In MDPs.

Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments

PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits.

PAC-Bayesian Analysis of Contextual Bandits

PAC-Bayesian Inequalities for Martingales

PAC-Bayesian Inequalities for Martingales.

PAC Subset Selection in Stochastic Multi-armed Bandits

Regret Bounds for Restless Markov Bandits

Upper-confidence-bound algorithms for active learning in multi-armed bandits

Kontakt

Forschungsportal

Peter Auer

Publikationen

Kontakt

Neuester Forschungsoutput