Research Analyzer
← Back ICRA 2026

ProVox: Personalization and Proactive Planning for Situated Human-Robot Collaboration

Jennifer Grannen, Siddharth Karamcheti, Blake Wulfe, Dorsa Sadigh

PDF

AI summary

Key figure (auto-extracted from paper)
ProVox enables collaborative robots to personalize to individual users and proactively suggest actions, cutting task completion time by 38.7% and reducing user burden by 31.9%.
Personalization Proactive Planning Human-Robot Collaboration Large Language Models Meta-Prompting Situated Interaction

Problem

Situated human-robot collaboration requires adapting to diverse partners' unique goals and preferences, but existing systems rely on static interfaces that force users to constantly supervise and explicitly instruct the robot.

Approach

The framework introduces a meta-prompting protocol that lets users define their goals and preferred behaviors upfront, which conditions a proactive large language model planner to anticipate needs and suggest helpful actions during tasks.

Key results

  • 38.7% faster task completion times
  • 31.9% reduction in user burden
  • Improved perceived helpfulness (+18.4%) and ease of use (+27.3%)
  • Meta-prompting effectively handles cross-user diversity and improves planning accuracy

Why it matters

It makes proactive, personalized robot assistance practical for non-expert users in dynamic real-world settings like households and assistive care.

Abstract

Collaborative robots must quickly adapt to their partner’s intent and preferences to proactively identify helpful actions. This is especially true in situated settings where human partners can continually teach robots new high-level behaviors, visual concepts, and physical skills (e.g., through demonstration), growing the robot’s capabilities as the human-robot pair work together to accomplish diverse tasks. In this work, we argue that robots should be able to infer their partner’s goals from early interactions and use this information to proactively plan behaviors ahead of explicit instructions from the user. Building from the strong commonsense priors and steerability of large language models, we introduce ProVox (“Proactive Voice”), a novel framework that enables robots to efficiently personalize and adapt to individual collaborators. We design a meta-prompting protocol that empowers users to communicate their distinct preferences, intent, and expected robot behaviors ahead of starting a physical interaction. ProVox then uses the personalized prompt to condition a proactive language model task planner that anticipates a user’s intent from the current interaction context and robot capabilities to suggest helpful actions; in doing so, we alleviate user burden, minimizing the amount of time partners spend explicitly instructing and supervising the robot. We evaluate ProVox through user studies grounded in household manipulation tasks (e.g., assembling lunch bags) that measure the efficiency of the collaboration, as well as features such as perceived helpfulness, ease of use, and reliability. Our analysis suggests that both meta-prompting and proactivity are critical, resulting in 38.7% faster task completion times and 31.9% less user burden relative to non-active baselines.1

Index terms

Human-Robot Collaboration Long term Interaction Human-Robot Teaming

Related papers