Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

Publikationen: KonferenzbeitragPosterForschung(peer-reviewed)

Standard

Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. / Ortner, Ronald; Ryabko, Daniil.
2012. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.

Publikationen: KonferenzbeitragPosterForschung(peer-reviewed)

Harvard

Ortner, R & Ryabko, D 2012, 'Online Regret Bounds for Undiscounted Continuous Reinforcement Learning', 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, USA / Vereinigte Staaten, 6/12/12 - 6/12/12.

APA

Ortner, R., & Ryabko, D. (2012). Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.

Vancouver

Ortner R, Ryabko D. Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. 2012. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.

Author

Ortner, Ronald ; Ryabko, Daniil. / Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. Postersitzung präsentiert bei 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA / Vereinigte Staaten.

Bibtex - Download

@conference{cfa813b1d63241de96c3f3d28a369d04,
title = "Online Regret Bounds for Undiscounted Continuous Reinforcement Learning",
author = "Ronald Ortner and Daniil Ryabko",
year = "2012",
language = "English",
note = "Advances in Neural Information Processing Systems ; Conference date: 06-12-2012 Through 06-12-2012",

}

RIS (suitable for import to EndNote) - Download

TY - CONF

T1 - Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

AU - Ortner, Ronald

AU - Ryabko, Daniil

PY - 2012

Y1 - 2012

M3 - Poster

T2 - Advances in Neural Information Processing Systems

Y2 - 6 December 2012 through 6 December 2012

ER -