复杂视觉退化工况下带式输送机大块物料鲁棒检测方法

Robust detection method for large lump material on belt conveyors under complex visual degradation conditions

  • 摘要: 带式输送机运行过程中输送带上夹杂的大块物料易引发转载点堵塞和输送带损伤,对系统安全运行构成潜在威胁。针对井下复杂工况下基于视觉的大块物料检测模型易出现漏检和误检的问题,提出了一种复杂视觉退化工况下带式输送机大块物料鲁棒检测方法。基于成像退化机理,构建检测驱动的自适应图像增强模块,将可学习卷积单元嵌入目标检测网络前端,以替代传统Retinex中依赖人工参数设定的显式建模过程,实现退化图像的自适应增强。该机制在检测损失函数约束下完成端到端联合优化,无需人为设定增强参数或区分退化类型。选用YOLOv11作为基础检测框架,引入自适应图像增强模块,并使用SIoU损失函数,利用更强几何约束的边界框回归策略对检测性能进行辅助优化。实验结果表明:改进YOLOv11模型在雾气、低照度及其混合工况下均具有良好的检测稳定性与鲁棒性;与原始YOLOv11模型相比检测精度提升了1.3%,在退化图像中的漏检现象明显减少;在模型复杂度与实时性方面,自适应图像增强模块仅引入0.1×106个额外参数和6.3 GFLOPs的计算开销;改进YOLOv11模型对图像的推理速度达106 帧/s,对视频的推理速度达42 帧/s,表明所提方法在保证检测精度提升的同时,仍具备良好的实时性。

     

    Abstract: Large lump material mixed on conveyor belts during belt conveyor operation can easily cause transfer point blockage and conveyor belt damage, posing a potential threat to safe system operation. To address the missed and false detections that may occur in vision-based large lump material detection models under complex underground conditions, a robust detection method for large lump material on belt conveyors under complex visual degradation conditions was proposed. Based on the imaging degradation mechanism, a detection-driven adaptive image enhancement module was constructed. A learnable convolutional unit was embedded at the front end of the object detection network to replace the explicit modeling process in traditional Retinex that depended on manually set parameters, thereby realizing adaptive enhancement of degraded images. This mechanism completed end-to-end joint optimization under the constraint of the detection loss function and did not require manual setting of enhancement parameters or distinction between degradation types. YOLOv11 was selected as the baseline detection framework, the adaptive image enhancement module was introduced, and the SIoU loss function was used to assist detection performance optimization through a bounding box regression strategy with stronger geometric constraints. Experimental results showed that the improved YOLOv11 model had good detection stability and robustness under foggy, low-illumination, and mixed conditions. Compared with the original YOLOv11 model, detection accuracy was improved by 1.3%, and missed detections in degraded images were significantly reduced. In terms of model complexity and real-time performance, the adaptive image enhancement module introduced only 0.1×106 additional parameters and 6.3 GFLOPs of computational overhead. The inference speed of the improved YOLOv11 model reached 106 frames/s for images and 42 frames/s for videos, indicating that the proposed method maintained good real-time performance while improving detection accuracy.

     

/

返回文章
返回