Research Analyzer
← Back ICRA 2024

GraspGPT: Leveraging Semantic Knowledge from a Large Language Model for Task-Oriented Grasping

Chao Tang, Dehao Huang, Wenqi Ge, Weiyu Liu, Hong Zhang

PDF

Abstract

Task-oriented grasping (TOG) refers to the problem of predicting grasps on an object that enable subsequent manipu- lation tasks. To model the complex relationships between objects, tasks, and grasps, existing methods incorporate semantic knowl- edge as priors into TOG pipelines. However, the existing semantic knowledge is typically constructed based on closed-world concept sets, restraining the generalization to novel concepts out of the pre-defined sets. To address this issue, we propose GraspGPT, a large language model (LLM) based TOG framework that lever- ages the open-end semantic knowledge from an LLM to achieve zero-shotgeneralizationtonovelconcepts.Weconductexperiments on Language Augmented TaskGrasp (LA-TaskGrasp) dataset and demonstrate that GraspGPT outperforms existing TOG methods on different held-out settings when generalizing to novel concepts out of the training set. The effectiveness of GraspGPT is further validated in real-robot experiments.

Index terms

Grasping Perception for Grasping and Manipulation Deep Learning in Grasping and Manipulation