Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Standard

Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. / Ortner, Ronald; Ryabko, Daniil.
Advances in Neural Information Processing Systems 25. MIT Press, 2012. p. 1772-1780.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Harvard

Ortner, R & Ryabko, D 2012, Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. in Advances in Neural Information Processing Systems 25. MIT Press, pp. 1772-1780.

APA

Ortner, R., & Ryabko, D. (2012). Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. In Advances in Neural Information Processing Systems 25 (pp. 1772-1780). MIT Press.

Vancouver

Ortner R, Ryabko D. Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. In Advances in Neural Information Processing Systems 25. MIT Press. 2012. p. 1772-1780

Author

Ortner, Ronald ; Ryabko, Daniil. / Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. Advances in Neural Information Processing Systems 25. MIT Press, 2012. pp. 1772-1780

Bibtex - Download

@inproceedings{e17d94b65d0945d29c6beffee1cad4df,
title = "Online Regret Bounds for Undiscounted Continuous Reinforcement Learning",
author = "Ronald Ortner and Daniil Ryabko",
year = "2012",
language = "English",
pages = "1772--1780",
booktitle = "Advances in Neural Information Processing Systems 25",
publisher = "MIT Press",
address = "United States",

}

RIS (suitable for import to EndNote) - Download

TY - GEN

T1 - Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

AU - Ortner, Ronald

AU - Ryabko, Daniil

PY - 2012

Y1 - 2012

M3 - Conference contribution

SP - 1772

EP - 1780

BT - Advances in Neural Information Processing Systems 25

PB - MIT Press

ER -