Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Standard
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. / Ortner, Ronald; Ryabko, Daniil.
Advances in Neural Information Processing Systems 25. MIT Press, 2012. S. 1772-1780.
Advances in Neural Information Processing Systems 25. MIT Press, 2012. S. 1772-1780.
Publikationen: Beitrag in Buch/Bericht/Konferenzband › Beitrag in Konferenzband
Harvard
Ortner, R & Ryabko, D 2012, Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. in Advances in Neural Information Processing Systems 25. MIT Press, S. 1772-1780.
APA
Ortner, R., & Ryabko, D. (2012). Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. In Advances in Neural Information Processing Systems 25 (S. 1772-1780). MIT Press.
Vancouver
Ortner R, Ryabko D. Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. in Advances in Neural Information Processing Systems 25. MIT Press. 2012. S. 1772-1780
Author
Bibtex - Download
@inproceedings{e17d94b65d0945d29c6beffee1cad4df,
title = "Online Regret Bounds for Undiscounted Continuous Reinforcement Learning",
author = "Ronald Ortner and Daniil Ryabko",
year = "2012",
language = "English",
pages = "1772--1780",
booktitle = "Advances in Neural Information Processing Systems 25",
publisher = "MIT Press",
address = "United States",
}
RIS (suitable for import to EndNote) - Download
TY - GEN
T1 - Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
AU - Ortner, Ronald
AU - Ryabko, Daniil
PY - 2012
Y1 - 2012
M3 - Conference contribution
SP - 1772
EP - 1780
BT - Advances in Neural Information Processing Systems 25
PB - MIT Press
ER -