Research Analyzer
← Back IROS 2024

Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks

Pranjali Pathre, Gunjan Gupta, Mohammad Nomaan Qureshi, Brunda Mandyam, Samarth Manoj Brahmbhatt, Madhava Krishna

PDF

Abstract

Visual servoing, the method of controlling robot motion through feedback from visual sensors, has seen signif- icant advancements with the integration of optical flow-based methods. However, its application remains limited by inherent challenges, such as the necessity for a target image at test time, the requirement of substantial overlap between initial and target images, and the reliance on feedback from a single camera. This paper introduces Imagine2Servo†, an innovative approach leveraging diffusion-based image editing techniques to enhance visual servoing algorithms by generating intermediate goal images. This methodology allows for the extension of visual servoing applications beyond traditional constraints, enabling tasks like long-range navigation and manipulation without pre- defined goal images. We propose a pipeline that synthesizes subgoal images grounded in the task at hand, facilitating ser- voing in scenarios with minimal initial and target image overlap and integrating multi-camera feedback for comprehensive task execution. Our contributions demonstrate a novel application of image generation to robotic control, significantly broadening the capabilities of visual servoing systems. Real-world experiments validate the effectiveness and versatility of the Imagine2Servo framework in accomplishing a variety of tasks, marking a notable advancement in the field of visual servoing.

Index terms

Visual Servoing Vision-Based Navigation Task and Motion Planning