Page 114 - 《武汉大学学报（信息科学版）》2025年第10期

P. 114

第 50 卷第 10 期张晨阳等：一种自适应点线特征和 IMU 耦合的视觉 SLAM 方法 2049

point detection in images. Additionally, the fast line segment detector algorithm makes it easy to detect
short lines and broken line features, and the performance of the algorithm is affected by the change of illumi‑
nation, resulting in over-extraction or wrong-extraction of line features. Accordingly, an adaptive algo‑
rithm for extracting line features was introduced, utilizing edge-detected binary images and incorporating
the removal of erroneous lines based on the geometry characteristics of the vanishing point. Following this,
the algorithm integrates the visual measurements from point-line features with the pre-integration measure‑
ment of IMU to yield reliable outcomes for front-end pose estimation and initialization parameters in a
loosely coupled manner. In the back-end section of our proposed SLAM method, a unified nonlinear mini‑
mization residual function is established for visual and IMU measurements through tight coupling, optimizing
for obtaining precise pose of the image or camera. Results: The proposed SLAM method has been validated
and tested on publicly available benchmarks, showcasing its performance through ablation experiments and
qualitative as well as quantitative comparative analyses against several state-of-the-art visual-inertial
SLAM algorithms. Conclusions: The results indicate that the proposed algorithm improves average localiza‑
tion accuracy by at least 12% and displays significant robustness.
Key words： SLAM； adaptive point-line features； IMU； loose coupling, tight coupling

当前，即时定位与建图（simultaneous localiza‑ 限于实验室场景。当相机置身于低/弱纹理、光
tion and mapping， SLAM）被认为是解决未知环照变化的场景中，或者受图像质量、平台快速旋
［1］
境下机器人感知和定位问题的有效手段之一。转移动等因素影响时，视觉特征的关联准确性和
SLAM 这一概念自 1986 年在美国旧金山举行的鲁棒性会下降，这直接导致视觉 SLAM 算法定位
机器人与自动化会议上提出以来，由于其重要的精度的退化。
理论和应用价值，被认为是实现机器人自主性的惯性测量单元（inertial measurement unit，
关键。SLAM 作为机器人实际生产和生活应用 IMU）不受上述外界因素的影响，其与视觉相机
（如定位导航、自动驾驶、仓储物流等）中的核心的数据互补性显现出显著的优势。近年来，通过
基础［2-3］，对其进行深入研究具有重要意义。此耦合 IMU 和相机的视觉惯导定位导航算法（visual
外，作为近景摄影测量学的重要研究领域， inertial navigation system，VINS）得到广泛关注，
SLAM 的发展受社会信息化和智能化浪潮的推并发展成为多传感器耦合 SLAM 领域研究的重
动，也极大地促进了测绘科学技术向智能化方向点和热点。视觉惯导耦合的方式分主要为紧耦
的转型升级。合和松耦合两类。松耦合是指 IMU 和相机分别
［4］
SLAM 是指机器人搭载特定传感器，在外界估计自身运动状态，再对两者的位姿估计结果融
环境先验信息未知的情况下，通过构建环境感知合，该方法算法复杂度较低；紧耦合则是基于
测量模型并估计自身位姿，同时实现场景地图的 IMU 和相机测量数据共同构建运动和观测方程，
［5］
构建。相机具有成本低、体积小以及能实时捕并联合完成状态估计。与松耦合相比，紧耦合的
捉场景中结构纹理信息等优势，因此 SLAM 早期计算复杂度较高，在定位精度和鲁棒性方面具有
算法基本是以视觉图像作为数据源输入逐渐发明显的优势［15］。早期紧耦合多是通过滤波来实
展起来的，并在国内外取得显著进展。典型的代现的，比如采用扩展的卡尔曼滤波 MSCKF 算
表算法如西班牙萨拉戈萨大学的 ORB- 法［16］，该算法在标准扩展的卡尔曼滤波框架下采
［6-7］［8］
SLAM 、苏黎世联邦理工学院的 DSO 、慕尼用滑动窗口来进行姿态估计。基于优化的紧耦
［9］
黑工业大学的 DVO 和 LSD-SLAM ［10］等。国内合算法是将传感器的测量数据建模为一个非线
前沿研究有浙江大学计算机视觉组提出的 RK- 性问题，并用历史图像帧来进行优化求解，是视
SLAM ［11］、上海交通大学感知导航研究所结合场觉惯导最主要的耦合方式。OKVIS ［17］是最早一
景结构信息提出的 Struct-SLAM ［12］、中山大学数个基于非线性优化的视觉惯导 SLAM，算法构建
据科学与计算机学院的 RGB-T SLAM ［13］、香港非线性函数通过滑动窗口优化相机位姿，但缺少
科技大学机器人研究所基于事件相机的 ES‑ 闭环检测，容易导致运动轨迹的误差累积；后续
VO ［14］等相关的经典（state of the art，SOTA）推出的 OKVIS2 具有回环检测功能，并在开源数
SLAM 算法。然而，这些视觉 SLAM 方案多数受据基准 EuRoC 上测试验证，结果表明其精度和稳

109 110 111 112 113 114 115 116 117 118 119