YANG Jun. Aggregation enhanced coal-gangue video recognition model based on long and short-term storage [J]. Journal of Mine Automation,2023,49(3):39-44, 62. DOI: 10.13272/j.issn.1671-251x.18058
Citation: YANG Jun. Aggregation enhanced coal-gangue video recognition model based on long and short-term storage [J]. Journal of Mine Automation,2023,49(3):39-44, 62. DOI: 10.13272/j.issn.1671-251x.18058

Aggregation enhanced coal-gangue video recognition model based on long and short-term storage

More Information
  • Received Date: November 21, 2022
  • Revised Date: February 19, 2023
  • Available Online: March 26, 2023
  • Some key targets will be missed when using coal-gangue image recognition technology to recognize coal-gangue. Compared with the image target recognition model, the video target recognition model is closer to the requirements of the coal-gangue recognition and separation scene. The coal-gangue features in the video data can be extracted more widely and deeply. However, the influence of frame repetition, frame similarity and contingency of key frame on the model performance is not considered in the current coal-gangue video target recognition technology. In order to solve the above problems, this paper proposes an aggregation enhanced coal-gangue video recognition model based on long and short-term storage (LSS) model. Firstly, the key frames and non-key frames are used to screen the massive information. Multi-frame aggregation is carried out on the video frame sequence of the coal-gangue. The feature information of the key frame and the adjacent frame is aggregated through temporal relation networks (TRN), and a long-term video frame and a short-term video frame are established. The calculation amount of the model is reduced while the key feature information is not lost. Secondly, the feature weights among the long-term video frames, the short-term video frames and the keyframes are reallocated by using an attention mechanism that integrates semantic similarity weights, learnable weights and region of interest (ROI) similarity weights. Finally, the LSS module is designed to store the effective features of long-term video frames and short-term video frames. The module fuses them in the key frame recognition to enhance the characterization capability of the key frame features, so as to realize coal-gangue recognition. The model is tested based on the coal-gangue video data set in Zaoquan Coal Preparation Plant. The results show that in comparison with the memory enhanced global-local aggregation (MEGA) network, the flow-guided feature aggregation for video object detection (FGFA), the relation distillation networks (RDN) and deep feature flow for video recognition (DFF) model for video recognition, the mean average precision of the aggregation enhanced coal-gangue video recognition model based on LSS is 77.12 % and better than that of other models. The recognition precision of the modes is negatively correlated with the moving speed of the target in the video. The recognition precision of the model in this paper is 83.82% for the slow-moving target detection, and the performance is the best.
  • [1]
    SHARMA V,GUPTA M,KUMAR A,et al. Video processing using deep learning techniques:a systematic literature review[J]. IEEE Access,2021,9:139489-139507. DOI: 10.1109/ACCESS.2021.3118541
    [2]
    AICH A, ZHENG M, KAEANAM S, et al. Spatio-temporal representation factorization for video-based person re-identification[C]. International Conference on Computer Vision, Montreal, 2021: 152-162.
    [3]
    孙立新. 基于卷积神经网络的煤矸石识别方法研究[D]. 邯郸: 河北工程大学, 2020.

    SUN Lixin. Research on coal gangue recognition method based on convolutional neural network[D]. Handan: Hebei University of Engineering, 2020.
    [4]
    PAN Hongguang,SHI Yuhong,LEI Xinyu,et al. Fast identification model for coal and gangue based on the improved tiny YOLO V3[J]. Journal of Real-Time Image Processing,2022,19(3):687-701. DOI: 10.1007/s11554-022-01215-1
    [5]
    ZHU Xizhou, WANG Yujie, DAI Jifeng, et al. Flow-guided feature aggregation for video object detection[C]. IEEE International Conference on Computer Vision, Venice, 2017, 408-417.
    [6]
    张勇. 基于视频处理的煤矸石识别研究[D]. 徐州: 中国矿业大学, 2018.

    ZHANG Yong. Research on gangue identification based on video processing[D]. Xuzhou: China University of Mining and Technology, 2018.
    [7]
    程健,王东伟,杨凌凯,等. 一种改进的高斯混合模型煤矸石视频检测方法[J]. 中南大学学报(自然科学版),2018,49(1):118-123.

    CHENG Jian,WANG Dongwei,YANG Lingkai,et al. An improved Gaussian mixture model for coal gangue video detection[J]. Journal of Central South University (Science and Technology),2018,49(1):118-123.
    [8]
    LEI Xinyu,PAN Hongguang,HUANG Xiangdong. A dilated CNN model for image classification[J]. IEEE Access,2019,7:124087-124095. DOI: 10.1109/ACCESS.2019.2927169
    [9]
    PAN Hongguang,WEN Fan,HUANG Xiangdong,et al. The enhanced deep plug-and-play super-resolution algorithm with residual channel attention networks[J]. Journal of Intelligent & Fuzzy Systems:Applicationgs in Engineering and Technology,2021,41(2):4069-4078.
    [10]
    ZHU Xizhou, DAI Jifeng, YUAN Lu, et al. Towards high performance video object detection[C]. The IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 2018: 7210-7218.
    [11]
    WANG Shiyao, ZHOU Yucong, YAN Junjie, et al. Fully motion-aware network for video object detection[C]. Proceedings of the European Conference on Computer Vision, Munich, 2018: 542-557.
    [12]
    WU Haiping, CHEN Yuntao, WANG Naiyan, et al. Sequence level semantics aggregation for video object detection[C]. The IEEE/CVF International Conference on Computer Vision, Seoul, 2019: 9217-9225.
    [13]
    FEICHTENHOFER C, PINZ A, ZISSERMAN A. Detect to track and track to detect[C]. The IEEE International Conference on Computer Vision, Venice, 2017: 3038-3046.
    [14]
    ZHOU Bolei, ANDONIAN A, TORRALBA A. Temporal relational reasoning in videos[C]. European Conference on Computer Vision, Munich, 2018: 803-818.
    [15]
    LIN T Y, DOLLAR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]. IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 2017: 936-944.
    [16]
    HE Kaiming, GKIOXARI G, DOLLAR P, et al. Mask R-CNN[C]. International Conference on Computer Vision, Venice, 2017: 2961-2969.
    [17]
    REN Shaoqing,HE Kaiming,GIRSHICK R,et al. Faster R-CNN:towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149. DOI: 10.1109/TPAMI.2016.2577031
    [18]
    CHEN Yihong, CAO Yue, HU Han, et al. Memory enhanced global-local aggregation for video object detection[C]. IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, 2020: 10337-10346.
    [19]
    GIRSHICK R. Fast R-CNN[C]. IEEE International Conference on Computer Vision, Santiago, 2015, 1440-1448.
    [20]
    HOCHREITER S,SCHMIDHUBER J. Long short-term memory[J]. Neural Computation,1997,9(8):1735-1780. DOI: 10.1162/neco.1997.9.8.1735
    [21]
    DENG Jiajun, PAN Yingwei, YAO Ting, et al. Relation distillation networks for video object detection[C]. The IEEE/CVF International Conference on Computer Vision, Seoul, 2019: 7023-7032.
    [22]
    ZHU Xizhou, XIONG Yuwen, DAI Jifeng, et al. Deep feature flow for video recognition[C]. The IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 2017: 2349-2358.
  • Related Articles

    [1]DU Jingyi, DANG Mengke, QIAO Lei, WEI Meiting, HAO Le. Drill pipe counting method based on improved spatial-temporal graph convolution neural network[J]. Journal of Mine Automation, 2023, 49(1): 90-98. DOI: 10.13272/j.issn.1671-251x.2022030098
    [2]YE Ou, DOU Xiaoyi, FU Yan, DENG Jun. Coal block detection method integrating lightweight network and dual attention mechanism[J]. Journal of Mine Automation, 2021, 47(12): 75-80. DOI: 10.13272/j.issn.1671-251x.2021030075
    [3]GONG Xiaoyan, PENG Gaogao, SONG Tao, FENG Xiong, CHEN Fei, LIU Hui, XIE Pei, XUE He. Study on air flow control parameters of long-pressure and short-extraction ventilation air outlets in heading face[J]. Journal of Mine Automation, 2021, 47(9): 45-52.. DOI: 10.13272/j.issn.1671-251x.2021060008
    [4]XIANG Zhencai . Application of short-drilling rapid drainage technology on long-distance excavation face[J]. Journal of Mine Automation, 2020, 46(2): 29-33. DOI: 10.13272/j.issn.1671-251x.2019070008
    [5]FENG Mei, QIAO Tiezhu. Research on short-term prediction method of coal flow on conveyor belt[J]. Journal of Mine Automation, 2019, 45(4): 72-76. DOI: 10.13272/j.issn.1671-251x.2018100075
    [6]PAN Lei, LI Li-juan, DING Ting-ting, LIU Dui. Forecasting of Short-term Power Load Based on Improved PSO Algorithm and LS-SVM[J]. Journal of Mine Automation, 2012, 38(9): 55-59.
    [7]SU Po, CHEN Qing, . Analysis of Mine Low-voltage Leakage Protection in Situation of Great Disparity between Long and Short Lines and Big Unbalanced Current[J]. Journal of Mine Automation, 2010, 36(10): 32-35.
    [8]SU Po, CHEN Qing. Analysis of Mine Low-voltage Leakage Protection in Situation of Great Disparity between Long and Short Lines and Big Unbalanced Current[J]. Journal of Mine Automation, 2010, 36(10): 12-15.
    [9]MA Ying-kui. Research of Multi-step Forecasting for Short-term Wind Speed[J]. Journal of Mine Automation, 2010, 36(9): 54-58.
    [10]DIAO Xiao-yan, ZHANG Duo. Application of ADO Technology in Data Storage of Monitoring and Control System[J]. Journal of Mine Automation, 2007, 33(5): 23-25.
  • Cited by

    Periodical cited type(2)

    1. 张卫,李振伟,杨晓利. 一种结合Gabor滤波器与GLCM的盲道检测方法. 计算机技术与发展. 2023(01): 68-73 .
    2. 刘晓阳,刘晶,张向阳,申利飞,邓志钢,马新彦,王地. 基于二维Gabor滤波器的胶带撕裂检测. 工矿自动化. 2021(04): 103-107 . 本站查看

    Other cited types(1)

Catalog

    Article Metrics

    Article views (163) PDF downloads (28) Cited by(3)
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return