Research Analyzer
← Back ICRA 2024

EfficientDPS: Efficient and End-To-End Depth-Aware Panoptic Segmentation

Shengkai Wu, Liangliang Ren, linfeng gao, Yupeng Li, Wenyu Liu

PDF
Key figure (auto-extracted from paper)

Abstract

Depth-aware panoptic segmentation (DPS) com- bines image segmentation and monocular depth estimation in a single model to achieve semantic and geometry perception simultaneously. DPS task has important applications in the robot area but the previous DPS models are too heavy to be applied. Thus, we propose EfficientDPS, an efficient, end- to-end, and unified model for DPS. In our method, query features extracted with convolution networks are used to represent things/stuff. In this way, different vision tasks such as classification, segmentation, and depth estimation can be realized in a unified manner, leading to a compact and efficient model. EfficientDPS can be trained and tested in an end-to-end manner via bipartite matching and complex post-process is not needed at inference. To enhance the supervision signal, group query representation is proposed, leading to better performance without affecting the inference speed. Extensive experiments on Cityscapes-DPS and SemKITTI-DPS show that EfficientDPS can achieve the best trade-off between speed and accuracy than the state-of-the-art methods.

Index terms

Object Detection Segmentation and Categorization Semantic Scene Understanding Deep Learning for Visual Perception