Research Analyzer
← Back IROS 2024

SurrealDriver: Designing LLM-Powered Generative Driver Agent Framework Based on Human Drivers' Driving-Thinking Data

Ye Jin, Ruoxuan Yang, Zhijie Yi, Xiaoxi SHEN, Peng Huiling, Xiaoan Liu, Jingli Qin, Li Jiayang, Peizhong Gao, Guyue Zhou, Jiangtao Gong

PDF

Abstract

Leveraging advanced reasoning capabilities and extensive world knowledge of large language models (LLMs) to construct generative agents for solving complex real-world problems is a major trend. However, LLMs inherently lack embodiment as humans, resulting in suboptimal performance in many embodied decision-making tasks. In this paper, we introduce a framework for building human-like generative driving agents using post-driving self-report driving-thinking data from human drivers as both demonstration and feed- back. To capture high-quality, natural language data from drivers, we conducted urban driving experiments, recording drivers’ verbalized thoughts under various conditions to serve as chain-of-thought prompts and demonstration examples for the LLM-Agent. The framework’s effectiveness was evalu- ated through simulations and human assessments. Results indicate that incorporating expert demonstration data sig- nificantly reduced collision rates by 81.04% and increased human likeness by 50% compared to a baseline LLM-based agent. Our study provides insights into using natural language- based human demonstration data for embodied tasks. The driving-thinking dataset is available at https://github. com/AIR-DISCOVER/Driving-Thinking-Dataset.

Index terms

Intelligent Transportation Systems Embodied Cognitive Science Agent-Based Systems