基于CenterNet−GhostNet的选煤厂危险区域人员检测

张翼翔; 林松; 李雪

doi:10.13272/j.issn.1671-251x.2021080058

基于CenterNet−GhostNet的选煤厂危险区域人员检测

Personnel detection in dangerous area of coal preparation plant based on CenterNet-GhostNet

摘要

摘要: 选煤厂危险区域人员全身目标检测因粉尘、雾气干扰难以准确与生产环境背景区分，而人员头部特征相对易于辨识，且人头在监控视角下被遮挡的可能性较低，因此危险区域人员检测使用人员头部检测代替人员全身目标检测。目前基于深度学习的轻量化目标检测模型在特征提取时信息损失多，对人头目标的检测能力有限。针对该问题，提出了轻量化人员检测模型CenterNet−GhostNet。该模型以CenterNet网络为基础框架，将轻量化网络GhostNet与特征金字塔相结合作为特征提取网络，GhostNet对输入图像进行特征提取，提升网络特征表达能力，特征金字塔对GhostNet提取的不同分辨率的特征图中包含的信息进行融合，在提取高层语义特征的同时保留较多的细节信息；在较高分辨率的单个输出特征图上使用3个相互独立的卷积操作分支进行解码计算，以充分利用特征图包含的细节信息。实验结果表明：CenterNet−GhostNet模型对佩戴安全帽和未佩戴安全帽两类人头目标的检测精度分别为93.7%和91.7%，均优于通用的轻量化模型SSD−MobileNet、YOLOv4 Tiny和CenterNet−Res18；CenterNet−GhostNet模型部署在NVIDIA Jetson Nano上的单帧检测耗时为67 ms，满足选煤厂危险区域人员高精度、实时检测要求。

Abstract: Due to the dust and fog interference, it is difficult to distinguish accurately the whole body target of personnel in dangerous areas of coal preparation plant from the production environment background. Moreover, the head features of personnel are relatively easy to be identified, and the possibility of head being blocked in the monitoring perspective is low. Therefore, the head detection of personnel in dangerous areas is used instead of the whole body target detection of personnel. At present, the lightweight target detection model based on deep learning has a lot of information loss in feature extraction, and its detection capability of human head target is limited. In order to solve this problem, a lightweight personnel detection model CenterNet-GhostNet is proposed. The model takes CenterNet network as the basic framework, and combines the lightweight network GhostNet and the feature pyramid network(FPN) as the feature extraction network. GhostNet extracts the features of the input image and improves the network feature expression capability. And the FPN fuses the information contained in the feature maps with different resolutions extracted by GhostNet, so that more detailed information is reserved while extracting the high-level semantic features. Three independent convolution operation branches are used to decode and calculate the single output feature map with higher resolution, so as to make full use of the detailed information contained in the feature map. The experimental results show that the detection precision of CenterNet-GhostNet model is 93.7% and 91.7% respectively for the two types of head targets with and without helmet, which are better than the general lightweight models SSD-MobileNet, YOLOv4 Tiny and CenterNet-Res18. The single frame detection time of CenterNet-GhostNet model deployed on NVIDIA Jetson Nano is 67 ms, which meets the requirements of high-precision and real-time detection of personnel in dangerous areas of coal preparation plant.

HTML全文

参考文献(14)

施引文献

资源附件(0)