← Back ICRA 2026

MICA: Multi-Agent Industrial Coordination Assistant

Di Wen, Kunyu Peng, Junwei Zheng, Yufan Chen, Yitian Shi, Jiale Wei, Ruiping LIU, Kailun Yang, Rainer Stiefelhagen

PDF

AI summary

Key figure (auto-extracted from paper)

MICA delivers reliable, real-time industrial assistance on edge hardware by combining egocentric vision, adaptive step fusion, and safety-audited multi-agent reasoning.

multi-agent systems industrial assistance edge computing adaptive step fusion egocentric vision safety auditing

Problem

Existing multi-agent LLM assistants lack perception grounding, real-time responsiveness, and privacy-preserving edge deployment required for safety-critical factory workflows.

Approach

MICA integrates depth-guided egocentric vision with a lightweight multi-agent reasoning core and Adaptive Step Fusion (ASF) to dynamically blend workflow rules, visual retrieval, and online speech feedback under strict safety auditing.

Key results

Adaptive Step Fusion improves step recognition accuracy and calibration via online speech feedback
Role-specialized multi-agent routing consistently outperforms baseline coordination topologies in task success and reliability
System achieves real-time, safety-audited guidance while running fully offline on practical edge hardware
New benchmark with Knowledge Base Alignment and Energy-per-success metrics enables standardized evaluation of industrial assistance

Why it matters

Enables deployable, privacy-preserving multi-agent assistance for dynamic factory environments where cloud offloading and large annotated datasets are infeasible.

Abstract

Industrial workflows demand adaptive and trust- worthy assistance that can operate under limited computing, connectivity, and strict privacy constraints. In this work, we present MICA (Multi-Agent Industrial Coordination Assis- tant), a perception-grounded and speech-interactive system that delivers real-time guidance for assembly, troubleshoot- ing, part queries, and maintenance. MICA coordinates five role-specialized language agents, audited by a safety checker, to ensure accurate and compliant support. To achieve ro- bust step understanding, we introduce Adaptive Step Fusion (ASF), which dynamically blends expert reasoning with online adaptation from natural speech feedback. Furthermore, we establish a new multi-agent coordination benchmark across representative task categories and propose evaluation metrics tailored to industrial assistance, enabling systematic comparison of different coordination topologies. Our experiments demon- strate that MICA consistently improves task success, reliability, and responsiveness over baseline structures, while remain- ing deployable on practical offline hardware. Together, these contributions highlight MICA as a step toward deployable, privacy-preserving multi-agent assistants for dynamic factory environments. The source code will be made publicly available at https://github.com/Kratos-Wen/MICA.

Index terms

Assembly Industrial Robots Wearable Robotics