Omnidirectional Dense SLAM for Back-To-Back Fisheye Cameras
Weijian Xie, Guanyi Chu, Quanhao Qian, Yihao Yu, Shangjin Zhai, Danpeng Chen, Nan Wang, Hujun Bao, Guofeng Zhang
Abstract
We propose a real-time visual-inertial dense SLAM system that utilizes the online data streams from back- to-back dual fisheye cameras setup, providing 360◦coverage of the environment. Firstly, we employ a sliding-window-based front-end to estimate real-time poses from the binocular fisheye images and IMU data. Then, we implement a lightweight panoramic depth completion network based on multi-basis depth representation. The network takes panoramic images (obtained by stitching dual-fisheye images with extrinsics and intrinsic parameters) and sparse depths (generated by the front- end local tracking) as input and predicts multiple depth bases along with corresponding confidence as output. The final dense depth is the linear combination of the multiple depth bases. Thanks to the multi-basis depth representation, we can contin- uously optimize the 360◦depth with the traditional optimizer to achieve higher global consistency in depth. We conducted experiments on both simulated and real-world datasets to evaluate our method. The results demonstrate that the proposed method outperforms SoTA methods in terms of depth prediction and 3D reconstruction. In addition, we develop a demo that can run on a mobile to demonstrate the real-time capabilities of our method.