OP-SLAM: An RGB-D SLAMMOT Method Leveraging the Constraints of Object Planar Features

Yingli WANG; Yang LIU; Chi GUO

doi:10.1051/wujns/2026311045

Open Access

Issue		Wuhan Univ. J. Nat. Sci. Volume 31, Number 1, February 2026


Page(s)		45 - 57
DOI		https://doi.org/10.1051/wujns/2026311045
Published online		06 March 2026

Wuhan University Journal of Natural Sciences, 2026, Vol.31 No.1, 45-57

Deep Learning and Intelligent Perception

CLC number: TP368.2

OP-SLAM: An RGB-D SLAMMOT Method Leveraging the Constraints of Object Planar Features

OP-SLAM：一种融合物体平面特征约束的RGB-D SLAMMOT方法

Yingli WANG (王滢莉)¹^,2, Yang LIU (刘阳)³ and Chi GUO (郭迟)¹^,4^†

¹ GNSS Research Center, Wuhan University, Wuhan 430072, Hubei, China (武汉大学　卫星导航定位技术研究中心，湖北　武汉　430072)
² School of Geodesy and Geomatics, Wuhan University, Wuhan 430072, Hubei, China (武汉大学　测绘学院，湖北　武汉　430072)
³ School of Electronic Information, Wuhan University, Wuhan 430072, Hubei, China (武汉大学　电子信息学院，湖北　武汉　430072)
⁴ Artificial Intelligence Institute, Wuhan University, Wuhan 430072, Hubei, China (武汉大学　人工智能研究院，湖北　武汉　430072)

^† Corresponding author. E-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Received: 10 October 2025

Abstract

By integrating self-localization, environment mapping, and dynamic object tracking into a unified framework, visual simultaneous localization and mapping with multiple object tracking (SLAMMOT) enhances decision-making and interaction capabilities in applications such as autonomous driving, robotic navigation, and augmented reality. While numerous outstanding visual SLAMMOT methods have been proposed, the majority rely only on point features, overlooking the abundant and stable planar features in artificial objects that can provide valuable constraints. To address this limitation, we propose OP (object planar) -SLAM, an RGB-D SLAMMOT system that leverages planar features to improve object pose estimation and reconstruction accuracy. Specifically, we introduce an accurate object planar feature extraction and association method using normal images, alongside a novel object bundle adjustment framework that incorporates planar constraints for enhanced optimization. The proposed system is evaluated on both synthetic and public real-world datasets, including Oxford multimotion dataset (OMD) and KITTI tracking dataset. Especially on the OMD, where planar features are prominent, our method improves object pose estimation accuracy by approximately 60%. Extensive experiments demonstrate its effectiveness in enhancing object pose estimation and reconstruction, achieving notable performance compared with existing methods. Furthermore, OP-SLAM runs in real time, making it suitable for practical robots and augmented reality applications.

摘要

视觉SLAMMOT(同时定位与建图与多目标跟踪)将自定位、环境建图和动态物体跟踪集成于统一框架，在自动驾驶、机器人导航和增强现实等应用中提升决策与交互能力。尽管已有多种优秀的视觉SLAMMOT方法，但它们大多数仅依赖点特征，忽略了人工物体中丰富且稳定的平面特征，而这些特征可提供宝贵的约束信息。为解决这一局限，本文提出OP-SLAM，一种利用平面特征提升物体位姿估计与重建精度的RGB-D SLAMMOT系统。具体而言，我们基于法向图像提出了精确的物体平面特征提取与关联方法，并设计了一种融合平面约束的新型物体捆集调整框架以增强优化效果，所提出的方法在合成数据集及公开真实数据集(包括OMD和KITTI tracking)上进行了验证。大量实验结果表明，该方法在物体位姿估计与重建方面均取得了显著效果，并优于现有方法。尤其是在平面特征显著的OMD数据集上，我们的方法将物体位姿估计精度提升约60%。此外，OP-SLAM可实时运行，适用于实际机器人和增强现实应用。

Key words: visual simultaneous localization and mapping (SLAM) / multiple object tracking (MOT) / dynamic scenes / planar feature

关键字 : 视觉SLAM / 多目标跟踪 / 动态场景 / 平面特征

Cite this article:WANG Yingli, LIU Yang, GUO Chi. OP-SLAM: An RGB-D SLAMMOT Method Leveraging the Constraints of Object Planar Features[J]. Wuhan Univ J of Nat Sci, 2026, 31(1): 45-57.

Biography: WANG Yingli, female, Master candidate, research direction: dynamic VSLAM and robot navigation. E-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Foundation item: Supported by Major Science and Technology Project of Hubei Province (2022AAA009)

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.