WANG Gechen, YAN Yuhan, LIU Xiaowen, et al. Research on visual semantic method of mine personnel behavior[J]. Industry and Mine Automation, 2021, 47(5): 40-45. doi: 10.13272/j.issn.1671-251x.17775
Citation: WANG Gechen, YAN Yuhan, LIU Xiaowen, et al. Research on visual semantic method of mine personnel behavior[J]. Industry and Mine Automation, 2021, 47(5): 40-45. doi: 10.13272/j.issn.1671-251x.17775

Research on visual semantic method of mine personnel behavior

doi: 10.13272/j.issn.1671-251x.17775
  • Publish Date: 2021-05-20
  • The personnel behavior detection in underground coal mines is the focus of sensor mine construction. However, the existing personnel behavior detection methods based on electromagnetic waves, wearable devices and computer vision cannot integrate time, location, behavior, environment and other factors to judge whether the behavior of mine personnel is safe. A visual semantic method of mine personnel behavior is proposed, which generates statements describing personnel behavior in videos through characteristic extraction, semantic detection, characteristic reconstruction and decoding. The InceptionV4 network and the I3D network are used to extract the static and dynamic characteristics of the video images, and the parallel dual attention mechanism based on the spatial location attention model and the channel attention model is introduced into the InceptionV4 network so as to improve the characteristic extraction ability of the network. In order to solve the problem of the inconsistency between video content and visual semantics, the semantic detection network is introduced to add advanced semantic tags to video characteristics to generate embedded characteristics. The embedded characteristics are input into the decoder together with video characteristics and semantic characteristics, and the characteristic reconstruction module is introduced in the decoding process. Reconstructing video characteristics by obtaining the hidden layer state of the decoder enhances the correlation between video characteristics and description statements, and improves the accuracy of visual semantic generation. MSVD, MSR-VTT public data set and mine own video data set are used for experiments, and the results show that the method has good semantic consistency, can obtain the key semantics in the video accurately and better reflects the true meaning of the video.

     

  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (271) PDF downloads(11) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return