基于改进YOLOv5的煤矸识别研究

张释如; 黄综浏; 张袁浩; 章鳌; 季亮

doi:10.13272/j.issn.1671-251x.2022060052

基于改进YOLOv5的煤矸识别研究

Coal and gangue recognition research based on improved YOLOv5

摘要

摘要: 现有基于深度学习的煤矸识别方法应用于井下复杂环境中时易出现误检和漏检情况，且对小目标煤矸的识别精度低。针对该问题，提出一种改进YOLOv5模型，并基于该模型实现煤矸识别。对采集的煤与矸石数据进行数据增强，以丰富数据集，提高数据利用率；在空间金字塔池化（SPP）模块中引入空洞卷积和残差块，得到残差ASPP模块，可在不损失图像信息的前提下，增大卷积输出感受野，强化模型对深层特征的提取；采用AdaBelief优化算法代替YOLOv5原有的Adam优化算法，提高模型的收敛速度与识别精度。实验结果表明：AdaBelief优化算法和残差ASPP模块可有效提高YOLOv5模型的精确率、召回率和平均精度均值（mAP）；改进YOLOv5模型的mAP达到94.43%，比原始YOLOv5模型提高了2.27%，帧率降低了0.03 帧/s，性能优于SSD，Faster R−CNN，YOLOv3，YOLOv4等主流目标检测模型；在极端黑暗的环境中，改进YOLOv5模型也能准确划定目标边界，识别效果优于其他改进YOLOv5模型。

Abstract: The existing deep learning-based coal and gangue recognition methods are prone to false detection and missed detection when applied to underground complex environments. The recognition precision of small target coal and gangue is low. In order to solve this problem, an improved YOLOv5 model is proposed, and coal and gangue recognition is realized based on that model. Data enhancement is carried out on the collected coal and gangue data to enrich the data set and improve the data utilization rate. The atrous convolution and residual block are introduced into the spatial pyramid pooling (SPP) module to obtain the residual ASPP module. On the premise of not losing image information, the convolution output receptive field can be increased to enhance the extraction of deep features from the model. The AdaBelief optimization algorithm is used to replace the original Adam optimization algorithm of YOLOv5 to improve the convergence speed and recognition precision of the model. The experimental results show that the AdaBelief optimization algorithm and residual ASPP module can effectively improve the precision, recall rate and mean average precision (mAP) of the YOLOv5 model. The mAP of the improved YOLOv5 model reaches 94.43%, which is 2.27% higher than that of original YOLOv5 model. The frame rate is reduced by 0.03 frames/s. The performance of the improved YOLOv5 model is superior to SSD, Faster R-CNN, YOLOv3, YOLOv4 and other mainstream target detection models. In extremely dark environments, the improved YOLOv5 model can also accurately delineate the target boundary, and the recognition effect is better than other improved YOLOv5 models.

HTML全文

参考文献(16)

施引文献

资源附件(0)