融合多模态数据和UAViT的带式输送机托辊故障诊断方法

Fault diagnosis method for belt conveyor rollers integrating multi-modal data and UAViT

  • 摘要: 为解决现有带式输送机托辊故障信息表征不显著、缺乏多模态信息有效融合利用机制且融合效果不稳定导致模型泛化能力差、故障辨识度低的问题,提出一种融合多模态数据和统一聚合视觉变换模型(UAViT)的带式输送机托辊故障诊断方法。首先,通过MEMS光纤振动传感器和GSD5矿用本安型声传感器采集带式输送机托辊运行过程中的原始振动和音频信号。其次,划分信号区间,通过小波阈值降噪、Wiener滤波以及STFT(短时傅里叶变换)处理原始信号,提取振动信号的时频图和音频信号的窄带语谱图,制作数据集。随后,构建包含聚合知识提取模块(AKEM)和隐式先验知识迁移模块(IPKTM)的UAViT模型:AKEM采用碎片交叉注意力机制(FCAM)融合多模态信息,增强多模态信息互补性和信息表征显著性,并生成隐式先验知识库;IPKTM通过结合新样本表征与隐式先验知识,进一步增强信息表征显著性,同时解决融合效果不稳定和模型泛化能力弱的问题。最后,采用两阶段有监督学习策略对UAViT模型进行训练:第一阶段仅包含AKEM模块,用于训练聚合知识提取能力;第二阶段引入IPKTM模块,固定AKEM权重,进一步优化模型性能,并在测试集上输出托辊故障诊断结果。在某矿区的带式输送机托辊实采数据上进行实验,消融实验结果表明,所提出的UAViT模型的准确率为99.67%。当去掉IPKTM后,准确率降低至95.56%;进一步去掉FCAM后,准确率进一步降低至92.71%,实验结果验证了IPKTM和FCAM的有效性。对比实验结果表明,与TFM-MI-1DCNN(融合信号-多输入一维卷积神经网络)和DWT-LMD-BPNN(离散小波变换-局部平均分解-反向传播神经网络)相比,UAViT模型的准确率分别提升1.27%和6.67%,实验结果验证了多模态信息表征的有效性。此外,将训练好的UAViT模型直接用于其他矿区的带式输送机托辊实采数据,结果显示托辊故障诊断的准确率为95.45%,验证了模型良好的泛化能力。

     

    Abstract: Knowledge Transfer Module (IPKTM). The AKEM employs the Fragmented Cross-Attention Mechanism (FCAM) to fuse multi-modal information, enhancing complementarity and representation significance while generating an implicit prior knowledge repository. The IPKTM further boosts information representation significance by integrating new sample representations with implicit prior knowledge, simultaneously addressing unstable fusion results and weak model generalization.Finally, we adopt a two-stage supervised learning strategy to train the UAViT model. In the first stage, only the AKEM module is included to train the capability of aggregating knowledge. In the second stage, the IPKTM module is introduced with fixed AKEM weights to further optimize model performance, ultimately producing fault diagnosis results for roller systems on the test dataset.Experiments were conducted using actual data from belt conveyor rollers in a specific mine. Ablation experiments demonstrated that the proposed UAViT model achieved an accuracy of 99.67%. Removing the IPKTM reduced accuracy to 95.56%, and further removing the FCAM decreased accuracy to 92.71%, thereby validating the effectiveness of both IPKTM and FCAM. Comparative experiments showed that compared to FM-MI-1DCNN (Fused Signal-Multi Input 1D Convolutional Neural Network) and DWT-LMD-BPNN (Discrete Wavelet Transform-Local Mean Decomposition-Backpropagation Neural Network), the UAViT model improved accuracy by 1.27% and 6.67%, respectively, confirming the effectiveness of multi-modal information representation. Additionally, applying the trained UAViT model directly to actual data from belt conveyor rollers in other mines yielded an average accuracy of 95.45%, demonstrating the model's excellent generalization capability.

     

/

返回文章
返回