Intrinsic motivation and mental replay enable efficient online adaptation in stochastic recurrent networks

Daniel Tanneberg; Jan Peters; Elmar Rueckert

doi:10.1016/j.neunet.2018.10.005

Intrinsic motivation and mental replay enable efficient online adaptation in stochastic recurrent networks

Publikationen: Beitrag in Fachzeitschrift › Artikel › Forschung › (peer-reviewed)

Autoren

Daniel Tanneberg
Jan Peters
Elmar Rueckert

Externe Organisationseinheiten

Technische Universität Darmstadt
Max-Planck Institute for Intelligent Systems
Universität Lübeck

Abstract

Autonomous robots need to interact with unknown, unstructured and changing environments, constantly facing novel challenges. Therefore, continuous online adaptation for lifelong-learning and the need of sample-efficient mechanisms to adapt to changes in the environment, the constraints, the tasks, or the robot itself are crucial. In this work, we propose a novel framework for probabilistic online motion planning with online adaptation based on a bio-inspired stochastic recurrent neural network. By using learning signals which mimic the intrinsic motivation signal cognitive dissonance in addition with a mental replay strategy to intensify experiences, the stochastic recurrent network can learn from few physical interactions and adapts to novel environments in seconds. We evaluate our online planning and adaptation framework on an anthropomorphic KUKA LWR arm. The rapid online adaptation is shown by learning unknown workspace constraints sample-efficiently from few physical interactions while following given way points.

Details

Originalsprache	Englisch
Seiten (von - bis)	67-80
Seitenumfang	14
Fachzeitschrift	Neural networks
Jahrgang	109.2019
Ausgabenummer	January
Frühes Online-Datum	22 Okt. 2018
DOIs	https://doi.org/10.1016/j.neunet.2018.10.005
Status	Veröffentlicht - Jan. 2019
Extern publiziert	Ja

Forschungsportal

Intrinsic motivation and mental replay enable efficient online adaptation in stochastic recurrent networks

Autoren

Externe Organisationseinheiten

Abstract

Details

360 Link