基于改进YOLOv11的煤矿井下人员安全帽佩戴检测方法

Safety helmet wearing detection method for underground coal mine personnel based on improved YOLOv11

  • 摘要: 煤矿井下环境中存在光照不均、粉尘弥漫、多重遮挡及背景纹理复杂等因素,导致基于计算机视觉技术的人员安全帽佩戴检测精度低,且井下防爆边缘计算终端算力受限,对检测模型大小及推理效率要求较高。针对该问题,提出一种融合频域增强与高效轻量化机制的改进YOLOv11模型:在YOLOv11主干网络引入基于小波卷积的C3k2_WTConv模块,利用多分辨率分析解耦光照噪声与纹理,增强特征提取能力;在颈部网络构建基于GSConv轻量化算子与VoVGSCSP拓扑的SlimNeck特征融合网络,在降低计算冗余的同时,维持跨尺度特征交互;设计参数共享的Detect_Efficient检测头,以提高推理效率;采用基于指数移动平均的参数平滑与多源域适应迁移学习策略,解决井下极端工况违规样本稀缺难题,增强模型在非结构化环境中的跨域泛化能力与鲁棒性。采用改进YOLOv11模型进行煤矿井下人员安全帽佩戴检测,通过CUMT−Helmet与DsLMF+Helmet数据集验证了该模型的mAP@0.5达97.5%,优于主流的单阶段检测模型YOLOv7−tiny,YOLOv8s,YOLOv11及改进模型MH−YOLO,WAM−YOLO,单帧耗时仅为10.2 ms,并在强光干扰、远距离目标尺度微小、目标动态模糊等极端工况下展现出更高的置信度与更低的漏检率。

     

    Abstract: Uneven illumination, diffuse dust, multiple occlusions, and complex background textures in underground coal mine environments lead to low accuracy in safety helmet wearing detection for personnel based on computer vision technology, while the limited computing power of underground explosion-proof edge computing terminals imposes high requirements on detection model size and inference efficiency. To address this problem, an improved YOLOv11 model integrating frequency-domain enhancement and efficient lightweight mechanisms was proposed. A C3k2_WTConv module based on wavelet convolution was introduced into the YOLOv11 backbone, and multiresolution analysis was used to decouple illumination noise and texture, thereby enhancing feature extraction capability. A SlimNeck feature fusion network based on the GSConv lightweight operator and VoVGSCSP topology was constructed in the neck network to reduce computational redundancy while maintaining cross-scale feature interaction. A parameter-sharing Detect_Efficient detection head was designed to improve inference efficiency. Parameter smoothing based on exponential moving average and a multi-source domain adaptation transfer learning strategy were adopted to solve the scarcity of violation samples under extreme underground working conditions, enhancing the cross-domain generalization capability and robustness of the model in unstructured environments. The improved YOLOv11 model was used for safety helmet wearing detection of underground coal mine personnel. Verification on the CUMT−Helmet and DsLMF+Helmet datasets showed that the mAP@0.5 of the model reached 97.5%, outperforming mainstream single-stage detection models YOLOv7-tiny, YOLOv8s, and YOLOv11 and improved models MH−YOLO and WAM−YOLO; the single-frame processing time was only 10.2 ms, and the model exhibited higher confidence and lower missed detection rates under extreme working conditions such as strong light interference, small-scale distant targets, and dynamic target blur.

     

/

返回文章
返回