基于改进YOLOv8n的煤矿井下受限场景目标检测算法

An improved YOLOv8n-based object detection algorithm for constrained scenarios in underground coal mines

  • 摘要: 在煤矿井下受限场景中由于目标尺度变化复杂、目标部分被遮挡和有效特征提取困难,导致目标检测精度低。针对上述问题,提出一种基于改进YOLOv8n的煤矿井下受限场景目标检测算法。在主干特征提取网络采用感受野注意力卷积(RFAConv),更好地处理受限环境下的目标空间位置信息,并根据特征的重要性动态调整权重,从而更关注目标的关键特征;在颈部网络引入高效多尺度注意力(EMA)模块,融合不同尺度的特征信息,提高了对尺度变化目标的检测精度;将新型可变形卷积(DCNv3)与动态检测头(Dynamic Head)结合,通过将尺度感知注意力、空间感知注意力和任务感知注意力相统一,有助于模型关注空间尺度信息和适应不同的检测任务,提高了对多尺度目标和部分被遮挡目标的检测能力;引入考虑预测框权重分配的Unified−IOU(U−IOU)损失函数,通过动态调整在不同质量预测框上的关注度,使模型更专注于高质量预测框,提高模型的收敛速度和精度。实验结果表明,针对CUMT−BelT数据集,改进YOLOv8n在煤矿井下输送带异物检测中的mAP@0.5相较于YOLOv8n提高了5.6%;针对DsLMF数据集,改进YOLOv8n在不同综采工作面作业场景下的总体mAP@0.5相较于YOLOv8n提高了4.8%,有效减少了误检和重复检测的情况。

     

    Abstract: In constrained scenarios of underground coal mines, object detection accuracy is low due to complex object scale variation, partial occlusion of objects, and difficulty in extracting effective features. To address these problems, an improved YOLOv8n-based object detection algorithm for constrained scenarios in underground coal mines was proposed. In the backbone feature extraction network, Receptive Field Attention Convolution (RFAConv) was adopted to better process the spatial location information of objects in constrained environments and was used to dynamically adjust weights according to the importance of features, thereby focusing more on the key features of objects. In the neck, the Efficient Multiscale Attention (EMA) module was introduced to fuse feature information at different scales, which improved the detection accuracy of objects with scale variation. The new Deformable Convolutional Networks v3 (DCNv3) and Dynamic Head were combined, integrating scale-aware attention, spatial-aware attention, and task-aware attention, which helped the model focus on spatial scale information and adapt to different detection tasks, thus enhancing the detection ability for multi-scale and partially occluded objects. A Unified-IoU (U-IoU) loss function that considers the weight distribution of prediction boxes was introduced. By dynamically adjusting the attention on prediction boxes of different qualities, the model focused more on high-quality prediction boxes, improving the convergence speed and accuracy. Experimental results showed that the improved YOLOv8n achieved a 5.6% improvement in mAP@0.5 compared with YOLOv8n in underground coal mine conveyor belt foreign object detection for the CUMT-BelT dataset; in different fully mechanized mining face operation scenarios, the overall mAP@0.5 increased by 4.8% compared with YOLOv8n for the DsLMF dataset, effectively reducing false detections and duplicate detections.

     

/

返回文章
返回