← Back IROS 2024

Interactive Reinforcement Learning from Natural Language Feedback

Imene Tarakli, Samuele Vinanzi, Alessandro Di Nuovo

PDF

Abstract

Large Language Models (LLMs) are increas- ingly influential in advancing robotics. This paper introduces ECLAIR (Evaluative Corrective Guidance Language as Rein- forcement), a novel framework that leverages LLMs to interpret and incorporate diverse natural language feedback into robotic learning. ECLAIR unifies various forms of human advice into actionable insights within a Reinforcement Learning context, enabling more efficient robot instruction. Experiments with real-world users demonstrate that ECLAIR accelerates the robot’s learning process, aligning its policy closer to optimal from the outset and reducing the need for extensive human intervention. Additionally, ECLAIR effectively integrates mul- tiple types of advice and adapts well to prompt modifica- tions. It also supports multilingual instruction, broadening its applicability and fostering more inclusive human-robot in- teractions. Project website: https://sites.google.com/ view/eclairiros

Index terms

Human Factors and Human-in-the-Loop Reinforcement Learning Cognitive Modeling