基于多尺度注意力融合的无人机海上溢油分割与校准

罗金格; 闫敬; 曹文强; 杨晛; 严雪飞

doi:10.13976/j.cnki.xk.2025.3413

基于多尺度注意力融合的无人机海上溢油分割与校准

Marine Oil Spill Segmentation and Calibration from UAV Imagery via Multi-Scale Attention Fusion

摘要

摘要: 为提升无人机（UAV）影像中海上溢油的分割精度，提出一种基于多尺度注意力融合的分割与校准方法。首先，利用 YOLOv5 对高分辨率影像进行快速定位与裁剪；随后，构建多尺度注意力网络（MSA-UNet），通过融合空间金字塔池化的全局语义信息与卷积注意力的局部细节提升特征表达能力；进一步，引入色调饱和度区域原型引导校准机制（HSV-PGRM），对油、水及其他类别进行重标定，以减轻光照与云雾干扰。实验结果显示，在De Kerf港口数据集上，本方法的平均交并比（mIoU）达到68.89%，F1分数为74.08%，其中F1分数相比 U-Net 基线模型提升3%；在独立湖面测试集上本方法也表现出较好的特征识别能力。

Abstract: The segmentation accuracy of marine oil spills from unmanned aerial vehicle (UAV) images is aimed to be enhanced. A segmentation and calibration method based on multi-scale attention fusion is proposed to achieve this. First, high-resolution images are rapidly detected and cropped using YOLOv5. Then, a multi-scale attention U-shaped network (MSA-UNet) is constructed, which integrates global semantic information from spatial pyramid pooling and local details from convolutional attention to enhance feature representation. Furthermore, the segmentation results for oil, water, and other classes are refined through the proposed hue-saturation-value region prototype-guided calibration mechanism (HSV-PGRM), which reduces interference from illumination variations and cloud cover. Experimental results on the De Kerf port dataset show that the proposed method achieves a mean intersection over union (mIoU) of 68.89% and an F1-score of 74.08%, with the F1-score improving by 3% over the U-shaped network (U-Net) baseline model. Good feature recognition capability is also demonstrated on an independently collected lake surface test set.

HTML全文

参考文献(22)

施引文献

资源附件(0)