基于时空动态聚合的井下钻场无参考视频质量评价

No-Reference Video Quality Assessment for Downhole Drilling Sites Based on Spatiotemporal Dynamic Aggregation

  • 摘要: 在煤矿智能化转型中,现有无参考视频质量评价方法多基于井上自然场景设计,未能针对钻场中煤尘与振动引起的复合失真得到有效解决,导致其在此类特殊干扰环境下泛化能力弱,无法满足智能矿山系统对高可靠性视频质量评估的需求。因此,针对现有视频质量评价方法难以有效评估煤矿井下钻场因煤尘扩散和钻机振动导致的视频退化问题,本文提出一种基于时空动态聚合的无参考视频质量评价方法。该方法采用双流特征提取网络,分别从空间结构与运动特性两方面对钻场监控视频进行建模。空间特征提取分支基于Swin Transformer架构,引入局部感知增强模块,以强化煤尘干扰下的纹理与边缘细节表征能力;运动特征提取分支则通过在3D ResNet中嵌入可变形卷积,实现对钻机运动轨迹与煤尘扩散动态特征的精准捕捉。为进一步应对钻场复合质量退化的动态特性,本文构建了时空动态聚合评价方法,通过动态分配时空权重,在静态场景中增强空间特征,以提升对粉尘污染的感知灵敏,并在动态场景中强化运动特征以有效捕捉设备异常位移,实现对不同失真类型与程度的判别性表达。最终,上述提取的判别性特征通过回归模块构建视频质量评价模型。本文方法在Coal-DB数据集开展实验,结果相比其他主流方法的指标SROCC、PLCC以及KROCC平均分别提升了10.5%、10.8%和12.3%,RMSE平均降低20.0%,表明该方法在煤矿井下钻场环境中的视频质量评价具有较高的准确性。

     

    Abstract: In the context of intelligent transformation in coal mining, most existing no-reference video quality assessment methods are designed for above-ground natural scenes and fail to effectively address compound distortions caused by coal dust and vibrations in underground drilling sites. This results in poor generalization in such challenging environments and an inability to meet the demand for highly reliable video quality assessment in intelligent mining systems. To tackle the difficulty of evaluating video degradation due to coal dust diffusion and drilling rig vibrations in underground coal drilling environments, this paper proposes a no-reference video quality assessment method based on spatio-temporal dynamic aggregation.The method employs a two-stream feature extraction network to model surveillance videos from both spatial structure and motion characteristics. The spatial feature branch, built on Swin Transformer, incorporates a local perception enhancement module to improve the representation of textures and edge details under coal dust interference. The motion feature branch integrates deformable convolutions into a 3D ResNet to accurately capture the motion trajectory of the drilling rig and dynamic characteristics of coal dust diffusion.To further handle the dynamic nature of compound quality degradation in drilling environments, a spatio-temporal dynamic aggregation strategy is introduced. It adaptively allocates spatio-temporal weights, enhancing spatial features in static scenarios to increase sensitivity to dust pollution, and strengthening motion features in dynamic scenarios to effectively detect abnormal equipment displacement. This enables discriminative representation of different distortion types and degrees. The extracted features are then fed into a regression module to construct the video quality assessment model.Experiments conducted on the Coal-DB dataset demonstrate that the proposed method outperforms other state-of-the-art approaches, achieving average improvements of 10.5% in SROCC, 10.8% in PLCC, and 12.3% in KROCC, while reducing RMSE by 20.0%. These results indicate that the method offers high accuracy in video quality assessment for underground coal drilling environments.

     

/

返回文章
返回