Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Publikationen: Konferenzbeitrag › Poster › Forschung › (peer-reviewed)
Standard
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. / Ortner, Ronald; Ryabko, Daniil.
2012. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.
2012. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.
Publikationen: Konferenzbeitrag › Poster › Forschung › (peer-reviewed)
Harvard
Ortner, R & Ryabko, D 2012, 'Online Regret Bounds for Undiscounted Continuous Reinforcement Learning', 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, USA / Vereinigte Staaten, 6/12/12 - 6/12/12.
APA
Ortner, R., & Ryabko, D. (2012). Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.
Vancouver
Ortner R, Ryabko D. Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. 2012. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.
Author
Bibtex - Download
@conference{cfa813b1d63241de96c3f3d28a369d04,
title = "Online Regret Bounds for Undiscounted Continuous Reinforcement Learning",
author = "Ronald Ortner and Daniil Ryabko",
year = "2012",
language = "English",
note = "Advances in Neural Information Processing Systems ; Conference date: 06-12-2012 Through 06-12-2012",
}
RIS (suitable for import to EndNote) - Download
TY - CONF
T1 - Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
AU - Ortner, Ronald
AU - Ryabko, Daniil
PY - 2012
Y1 - 2012
M3 - Poster
T2 - Advances in Neural Information Processing Systems
Y2 - 6 December 2012 through 6 December 2012
ER -