Page 56 - 《武汉大学学报（信息科学版）》2025年第6期

P. 56

1078 武汉大学学报（信息科学版） 2025 年 6 月

low-level features, thereby mitigating the impact of background noise on the latter. Furthermore, it facili‑
tates a more comprehensive integration of the semantic information of high-level features and the detailed
information of low-level features. This integration significantly enhances the capacity of network to distin‑
guish between background and foreground information in UAV images. Finally, a context-aware module is
designed to address the issue of limited feature information associated with small objects. This module inte‑
grates environmental information surrounding the object, including local features, contextual information,
and global context, thus enhancing the contextual features of small objects and improving the final object
detection accuracy. Results: To verify the effectiveness of the proposed method, case experiment analysis
is conducted on public datasets. The proposed method achieves mean average precision (mAP) of 68.9%,
with precision of 75.5% and recall of 67.7%. Compared to the conventional object detection algorithms,
the proposed method shows improvements ranging from 3.1% to 29.5% in mAP, from 0.9% to 8.6% in
precision, and from 1.0% to 57.9% in recall. The results indicate that the proposed method exhibits a high
level of accuracy. Additionally, the generalization and applicability are confirmed through testing on UAV
images in various scenarios and under different weather conditions. Conclusions: The proposed method can
effectively solve the challenge of detecting small persons in UAV images and significantly improve the
detection accuracy. It has a wide range of application prospects in the fields of emergency rescue and social
security, and possesses good generalization ability for UAV image detection tasks in different environ‑
ments.
Key words： UAV imagery； small object； person detection； selective feature fusion； context-aware

随着无人机遥感技术的快速发展，其凭借测结果。单阶段检测算法，如 SSD ［15］和 YO‑
快速响应、广域覆盖和全景视角等独特优势， LO ［16］，能够直接对目标进行定位，输出目标的
在紧急搜救和执法追踪等领域正发挥着日益类别检测信息。小目标由于占比面积小、分辨
重要的作用［1-4］。通过将无人机技术与先进的率低、可用特征少，其检测精度相比大中目标有
人员检测算法相结合，能够在复杂多变的环境所下降。因此，学者们提出了多种改进方案，文
中实时获取精确的感知数据。在自然灾害发献［17-19］利用多尺度特征融合以保留更多小目
生后的搜救行动中，无人机能够迅速覆盖整个标特征；文献［20-21］通过上下文学习来更好地
灾区，及时发现并定位被困人员的位置［5-6］。同
理解小目标与周围环境的关系；文献［22-23］采
样，在执法行动中，无人机展现出高效追踪逃
用注意力机制以提高对小目标的关注；文献
犯或失踪人员的能力，极大地增强了执法部门
［24-25］通过超分辨率技术解决小目标特征不足
的监控与应对能力［7］。这不仅显著提升了任务
的问题。与其他视角相比，无人机视角下的广
执行的效率和准确性，还有效降低了人力成本
阔视场提供了丰富的信息，但也伴随着更为复
和操作风险，因此，高效的无人机人员检测技
杂的背景和更多噪声的干扰。同时，无人机影
术对于加强公共安全和提升应急救援能力具
像中的目标尺度变化大，且小目标的比例远高
有重要意义。
于自然场景图像。
人员检测由于其应用的广泛性已成为计算
针对上述问题，本文提出面向小目标的无人
机视觉领域中的一个重要研究方向［8-10］。传统
机影像人员检测方法，利用空间深度转换卷积
的人员检测方法如方向梯度直方图（histogram
of oriented gradient， HOG）［11］和支持向量机（space-to-depth convolution，SPD-Conv）提高检
（support vector machine，SVM）［12］，首先通过手测过程中对于小目标的保持能力，设计选择性特
工设计特征描述算子提取图像特征，然后通过征融合模块，更好地融合高层特征的语义信息和
分类器检索图像内的人员目标。基于深度学习低层特征的细节信息，提高对于背景的抗干扰能
的人员检测算法可以分为两阶段和单阶段检测力，通过上下文感知模块综合局部、全局及周围
算法。两阶段检测算法，如 Faster R-CNN ［13］和上下文，增强网络对于小目标人员的特征提取能
Cascade R-CNN ［14］，先通过区域提议网络选取力，提升无人机影像小目标人员检测的准确性与
生成候选区域，再对其进行分类和回归得到检泛化性。

51 52 53 54 55 56 57 58 59 60 61