Logarithmic online regret bounds for undiscounted reinforcement learning

Research output: Contribution to conferencePosterResearchpeer-review

Details

Translated title of the contributionLogarithmic online regret bounds for undiscounted reinforcement learning
Original languageEnglish
Publication statusPublished - 2006
EventAdvances in Neural Information Processing Systems (NIPS) 2006 - Vancouver, Canada
Duration: 4 Dec 20067 Dec 2006

Conference

ConferenceAdvances in Neural Information Processing Systems (NIPS) 2006
Country/TerritoryCanada
CityVancouver
Period4/12/067/12/06