基于DRCA−GCN的矿工动作识别模型

Miner action recognition model based on DRCA-GCN

  • 摘要: 井下“三违”行为给煤矿生产带来严重安全隐患,提前感知并预防井下工作人员的不安全动作具有重要意义。针对因煤矿监控视频质量不佳导致基于图像的动作识别方法准确率受限的问题,构建了基于密集残差和组合注意力的图卷积网络(DRCA−GCN),提出了基于DRCA−GCN的矿工动作识别模型。首先利用人体姿态识别模型OpenPose提取人体关键点,并对缺失关键点进行补偿,以降低因视频质量不佳造成关键点缺失的影响,然后利用DRCA−GCN识别矿工动作。DRCA−GCN在时空初始图卷积网络(STIGCN)基础上引入组合注意力机制和密集残差网络:通过组合注意力机制提升模型中每个网络层对重要时间序列、空间关键点和通道特征的提取能力;通过密集残差网络对提取的动作特征进行信息补偿,加强各网络间的特征传递,进一步提升模型对矿工动作特征的识别能力。实验结果表明:① 在公共数据集NTU−RGB+D120上,以Cross-Subject(X−Sub)和Cross-Setup(X−Set)作为评估协议时,DRCA−GCN的识别精度分别为83.0%和85.1%,相比于STIGCN均提高了1.1%,且高于其他主流动作识别模型;通过消融实验验证了组合注意力机制和密集残差网络的有效性。② 在自建矿井人员动作(MPA)数据集上,进行缺失关键点补偿后,DRCA−GCN对下蹲、站立、跨越、横躺和坐5种动作的平均识别准确率由94.2%提升到96.7%;DRCA−GCN对每种动作的识别准确率均在94.2%以上,与STIGCN相比,平均识别准确率提升了6.5%,且对相似动作不易误识别。

     

    Abstract: The underground "three violations" behavior brings serious safety hazards to coal mine production. It is of great significance to perceive and prevent unsafe actions of underground personnel in advance. The poor video quality in coal mine monitoring leads to limited accuracy of image based action recognition methods. In order to solve the above problem, a dense residual and combined attention-graph convolutional network (DRCA-GCN) is constructed. A miner action recognition model based on DRCA-GCN is proposed. Firstly, the human pose recognition model OpenPose is used to extract human key points. The missing key points are compensated to reduce the impact of missing key points caused by poor video quality. Secondly, DRCA-GCN is used to identify the miner actions. DRCA-GCN introduces a combined attention mechanism and a dense residual network on the basis of the spatio-temporal inception graph convolutional network (STIGCN). By using the combined attention mechanism, the capability of each network layer in the model to extract important time series, spatial key points and channel features is enhanced. By using the dense residual network to compensate for the extracted action features, the feature transmission between different networks is strengthened. It further enhances the model's recognition capability for miner action features. The experimental results indicate the following points. ① On the public dataset NTU-RGB+D120, when using Cross-Subject(X-Sub) and Cross-Setup(X-Set) as evaluation protocols, the recognition precision of DRCA-GCN is 83.0% and 85.1%, respectively. It is 1.1% higher than the precision of STIGCN, and higher than other mainstream action recognition models. The effectiveness of the combined attention mechanism and dense residual network is verified through ablation experiments. ② After compensating for missing key points, on the self built mine personnel action (MPA) dataset, the average recognition accuracy of DRCA-GCN for squatting, standing, crossing, lying down and sitting movements increases from 94.2% to 96.7%. The recognition accuracy of DRCA-GCN for each type of action is above 94.2%. Compared with STIGCN, the average recognition accuracy has been improved by 6.5%. It is not likely to misrecognize similar actions.

     

/

返回文章
返回