S-Graphs 2.0 � a Hierarchical-Semantic Optimization and Loop Closure for SLAM
Hriday Bavle, Jose Luis Sanchez-Lopez, Muhammad Shaheer, Javier Civera, Holger Voos
AI summary
Problem
Existing SLAM and scene graph methods often ignore structural hierarchy during optimization, causing high computational costs and false loop closures due to geometric aliasing across different building floors.
Approach
The system constructs a four-layer graph of keyframes, walls, rooms, and floors, using floor-level segmentation to enable targeted loop closure and hierarchical optimization strategies that marginalize redundant nodes to reduce computational load.
Key results
- Floor detection and stairway segmentation module
- Floor-based loop closure eliminating cross-floor false positives
- Hierarchical optimization framework marginalizing redundant keyframes
- Up to 10× faster estimation with state-of-the-art accuracy in multi-floor settings
Why it matters
Provides a scalable, real-time SLAM solution for complex multi-floor buildings, crucial for robotics navigation and spatial understanding.
Abstract
The hierarchical nature of 3D scene graphs aligns well with the structure of man-made environments, making them highly suitable for representation purposes. Beyond this, however, their embedded semantics and geometry could also be leveraged to improve the efficiency of map and pose optimization, an opportunity that has been largely overlooked by existing methods. We introduce Situational Graphs 2.0 (S-Graphs 2.0), that effectively uses the hierarchical structure of indoor scenes for efficient data management and optimization. Our approach builds a four-layer situational graph comprising Keyframes, Walls, Rooms, and Floors. Our first contribution lies in the front-end, which includes a floor detection module capable of identifying stairways and assigning floor-level semantic relations to the underlying layers (Keyframes, Walls, and Rooms). Floor-level semantics allows us to propose a floor-based loop closure strategy, that effectively rejects false positive closures that typically appear due to aliasing between different floors of a building. Our second novelty lies in leveraging our representation hierarchy in the optimization. Our proposal consists of: (1) local optimization over a window of recent keyframes and their connected components across the four representation layers, (2) floor-level global opti- mization, which focuses only on keyframes and their connections within the current floor during loop closures, and (3) room-level local optimization, marginalizing redundant keyframes that share observations within the room, which reduces the computational footprint. We validate our algorithm extensively in different real multi-floor environments. Our approach shows state-of-the- art accuracy metrics in large-scale multi-floor environments, estimating hierarchical representations up to 10× faster, in average, than competing baselines. Our code is open-sourced at: https://github.com/snt-arg/lidar situational graphs