← Back SII 2026

A ROS-Based Multi-Modal Architecture for Fall Detection and Response with a Social Robot

Kavyan Zoughalian, Imene Tarakli, Aung Htet, Joshua Bamforth, Alejandro Jimenez-Rodriguez, Jims Marchang, Alessandro Di Nuovo

PDF

Abstract

Falls are a leading cause of injury in older adults, requiring detection systems that are both sensitive and reliable. We present a multi-modal robotic framework that integrates wearable sensing, vision-based verification, and dialogue-driven assessment. A smartwatch streams inertial data, with thresholds tuned through pilot testing to maximise fall sensitivity. Vision verification is performed using a fine-tuned YOLOv11 model, while Whisper ASR and a lightweight GPT-based classifier enable simple verbal checks of user responsiveness. Our tuned thresholds outperformed published baselines (F1 = 0.857), and the vision module achieved strong accuracy (mAP@0.5 = 0.827). In integrated trials, the system reached a 90.6% success rate with a mean end-to-end response time of 43.5 seconds. These results show that combining complementary modalities enhances robustness and moves socially assistive robots toward interactive fall response in real-world care.

Index terms

Assistive Robotics Human-robot Interaction / Collaboration Decision-making systems