YOLOv8-FT：一种融合高效卷积与多尺度注意力的轻量化关键点检测模型

莫宛霖; 华云松; 王伟

doi:10.13976/j.cnki.xk.2025.4233

YOLOv8-FT：一种融合高效卷积与多尺度注意力的轻量化关键点检测模型

YOLOv8-FT: A Lightweight Keypoint Detection Model Integrating Efficient Convolution and Multi-scale Attention

摘要

摘要: 针对现有轻量化姿态估计模型在足踝等精细关键点检测精度不足的问题，提出一种改进的轻量化检测模型YOLOv8-FT。模型以YOLOv8n-pose为基线，核心改进包括：引入基于部分卷积的FasterNet模块，降低模型复杂度；嵌入高效多尺度注意力机制（EMA），增强对细微特征的提取能力；设计HRNet_FT多分辨率特征融合网络，优化足踝关键点特征的保留与融合。在自建足踝关键点检测数据集上，YOLOv8-FT的平均精度（mAP50-95）达到85.2%，较基线提升11.6%，踝关节角度测量误差小于3°。实验结果表明，该模型在保持轻量化与实时性的同时，显著提升足踝关键点检测精度，可在复杂运动场景下实现稳定跟踪，为移动端运动分析与康复评估提供了一种兼顾精度与效率的轻量化解决方案。

Abstract: To address the issue of insufficient detection accuracy of lightweight pose estimation models for fine-grained keypoints such as the ankle joint, we propose an improved lightweight keypoint detection model, YOLOv8-FT. Built upon the YOLOv8n-pose baseline, the model incorporates three core improvements: introducing the FasterNet module based on partial convolution to reduce model complexity; embedding the efficient multi-scale attention (EMA) mechanism to enhance the extraction of subtle features; designing the HRNet_FT multi-resolution feature fusion network to optimize the retention and fusion of ankle keypoint features. Evaluated on a self-constructed ankle keypoint detection dataset, YOLOv8-FT achieves an average precision (mAP50-95) of 85.2%, an 11.6% improvement over the baseline, with an ankle joint angle measurement error of less than 3°. Experimental results demonstrate that the model significantly improves the detection accuracy of ankle keypoints while maintaining lightweight and real-time performance, enabling stable tracking in complex motion scenes. It provides a lightweight solution that balances accuracy and efficiency for mobile motion analysis and rehabilitation assessment.

HTML全文

参考文献(21)

施引文献

资源附件(0)