An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Details

Original languageEnglish
Title of host publicationProceedings of the 29th Conference on Learning Theory, COLT 2016
Pages116-120
Publication statusPublished - 23 Jun 2016

Publication series

NameJMLR Workshop and Conference Proceedings
PublisherJMLR.org
Volume49
ISSN (electronic)1938-7228