改进YOLOv11s的无人机航拍小目标检测算法

Improved YOLOv11s Algorithm for Small Target Detection in UAV Aerial Imagery

  • 摘要: 为解决无人机航拍场景中小目标易出现漏检、误检的检测难题,提升小目标检测精度与实时性,本文提出一种基于YOLOv11s的全链路协同优化检测算法。算法构建多尺度特征提取与跨尺度融合协同模块,通过嵌入MEESA(multi-scale edge enhancement module)强化小目标边缘细节与局部特征,依托FSFP(focused small target feature strengthening pyramid)完成轻量化跨尺度特征交互,弥补深浅层特征信息缺失;优化网络特征交互与检测预测机制,采用AIFI-RepBN(adaptive interactive feature interaction-reparameterized batch normalization)替换传统SPPF模块,提升特征交互稳定性与推理效率,通过Detect-LQE(detection-localization quality estimation)机制修正检测头独立预测缺陷,实现定位与类别得分的关联优化;同时设计Inner-EIoU(inner-enhanced intersection over union)损失函数,优化小目标边框定位效果与模型收敛速度。在VisDrone2019数据集上的测试结果表明,优化后算法的mAP@0.5与mAP@0.5:0.95指标较原始YOLOv11s基准模型分别提升2.5%、1.6%,且参数量维持11×106的轻量化状态。所提算法有效解决了无人机航拍小目标检测漏误检问题,兼顾检测精度与推理效率,能够适配无人机交通管理、应急救援等实际场景的实时检测需求。

     

    Abstract: To address the challenges of missed detection and false detection for small objects in unmanned aerial vehicle (UAV) aerial imagery and improve the detection accuracy and real-time performance of small-object detection, this paper proposes a full-link collaborative optimization detection algorithm based on YOLOv11s. A collaborative module for multi-scale feature extraction and cross-scale feature fusion is constructed in the proposed algorithm. The Multi-scale Edge Enhancement Module (MEESA) is embedded to strengthen the edge details and local features of small objects, and the Focused Small Target Feature Strengthening Pyramid (FSFP) module is adopted to implement lightweight cross-scale feature interaction and compensate for the missing feature information between shallow and deep network layers. Furthermore, the feature interaction and prediction mechanism of the network are optimized: the conventional SPPF module is replaced with the AIFI-RepBN (adaptive interactive feature interaction-reparameterized batch normalization) module to enhance the stability of feature interaction and inference efficiency; the Detect-LQE (detection-localization quality estimation) mechanism is introduced to rectify the independent prediction defects of the detection head and realize correlated optimization for bounding box localization and classification confidence scores. Meanwhile, an Inner-EIoU (inner-enhanced intersection over union) loss function is designed to refine the bounding box localization performance of small objects and accelerate model convergence. Experimental results on the VisDrone2019 dataset demonstrate that compared with the original YOLOv11s baseline model, the optimized algorithm achieves 2.5% and 1.6% improvements in mAP@0.5 and mAP@0.5:0.95 respectively, while keeping a lightweight parameter size of 11×106. The proposed algorithm effectively mitigates the missed and false detection issues of small objects in UAV aerial images with balanced detection precision and inference speed, which can satisfy the real-time detection requirements of practical application scenarios including UAV-based traffic management and emergency rescue.

     

/

返回文章
返回