基于YOLOv8n-ASAM的井下复杂场景人员多目标检测

Multi-target detection of personnel in complex underground scenarios based on YOLOv8n-ASAM

  • 摘要: 为了解决煤矿井下监控视频存在的光照不均匀、目标尺度不一致、存在遮挡等复杂情况,提出一种基于YOLOv8n-ASAM的井下人员多目标检测算法。首先,对YOLOv8n的Neck层进行改进,添加MultiSEAM注意力机制,以增强对遮挡目标的检测性能;其次,在C2f模块中引入MLCA注意力机制,构建C2f-MLCA模块,以融合局部和全局特征信息,提高特征表达能力;再次,在Head层检测头中嵌入了自适应空间特征融合ASFF模块,以增强对小尺度目标的检测性能;最后,在煤矿井下生产监控视频数据集上对YOLOv8n-ASAM进行验证。实验结果表明:相较于Faster R-CNN、SSD、RT-DETR、YOLOv5s、YOLOv7、YOLOv8n、YOLOv8s等对比模型,YOLOv8n-ASAM模型在准确率、mAP50与mAP50-95指标上取得了较好的效果,比YOLOv8n分别提升了5.3%、1.8%和1.1%;说明YOLOv8n-ASAM模型具有较好的鲁棒性,在小尺度目标和遮挡目标上表现出良好的检测效果,在光照不均和多目标稀疏分布情况下具有更高的检测置信度分数。

     

    Abstract: In response to the complex conditions of uneven lighting, inconsistent target scales, and occlusions in underground coal mine monitoring videos, a multi-target detection algorithm based on YOLOv8n-ASAM is proposed. First, the Neck layer of YOLOv8n is improved by adding a MultiSEAM attention mechanism to enhance the detection performance for occluded targets. Secondly, the MLCA attention mechanism is introduced into the C2f module, constructing the C2f-MLCA module to fuse local and global feature information, thereby improving feature expression capability. Additionally, an Adaptive Spatial Feature Fusion (ASFF) module is embedded in the detection head of the Head layer to enhance the detection performance for small-scale targets. Finally, the YOLOv8n-ASAM model is validated on a coal mine underground production monitoring video dataset. Experimental results show that compared to the Faster R-CNN, SSD, RT-DETR, YOLOv5s, YOLOv7, YOLOv8n, and YOLOv8s models, the YOLOv8n-ASAM model achieves better performance in accuracy, mAP50, and mAP50-95 metrics. Specifically, compared to the YOLOv8n model, the YOLOv8n-ASAM model improves accuracy, mAP50, and mAP50-95 by 5.3%, 1.8%, and 1.1%, respectively. The YOLOv8n-ASAM model demonstrates good robustness, exhibiting excellent detection results for small-scale and occluded targets, and achieving higher detection confidence scores in conditions of uneven lighting and sparse multi-target distribution.

     

/

返回文章
返回