基于退化核扩散的煤矿井下图像盲超分辨率重建方法

任才; 李明亮

doi:10.13272/j.issn.1671-251x.2025100002

基于退化核扩散的煤矿井下图像盲超分辨率重建方法

任才,
李明亮

Blind super-resolution reconstruction method for underground coal mine images based on degradation kernel diffusion

摘要

摘要: 现有图像超分辨率重建方法难以应对煤矿井下煤尘散射与非均匀模糊等多源耦合退化，受限于局部感受野而难以捕捉全局结构，因模型复杂度过高而难以满足煤矿井下边缘设备的轻量化部署要求。针对上述问题，提出了一种基于退化核扩散的煤矿井下图像盲超分辨率重建方法。在退化建模阶段引入退化核扩散建模，利用扩散概率模型的反向采样过程显式模拟煤矿井下复杂场景的退化核分布，从源头修正因退化估计偏差导致的重建伪影；在图像重建阶段设计混合Transformer−CNN编码器与动态可逆解码器，通过并行双分支结构互补提取局部纹理与全局依赖，并结合动态比例融合机制实现退化特征与图像内容的自适应交互，在保证深层特征无损传输的同时降低模型参数量；通过结合L1损失、结构相似性（SSIM）损失与感知损失构建多指标联合损失函数，在保证像素级精度的同时提高图像的感知质量。在煤矿井下图像数据集CMUID及公共基准数据集上进行了实验，结果表明：在客观评价指标上，所提方法在保持较低参数量的同时，峰值信噪比与SSIM均整体优于对比方法；在主观视觉效果上，所提方法有效抑制了低照度噪声，锐化了边缘结构，清晰恢复了煤矿井下输送带与煤块的纹理细节。

Abstract: Existing image super-resolution reconstruction methods have difficulty coping with multi-source coupled degradations such as coal dust scattering and non-uniform blur in underground coal mine environments, and they are limited by local receptive fields, making it difficult to capture global structures, while excessive model complexity prevents lightweight deployment on underground edge devices. To address these issues, a blind super-resolution reconstruction method for underground coal mine images based on degradation kernel diffusion was proposed. In the degradation modeling stage, degradation kernel diffusion modeling was introduced, and the reverse sampling process of a diffusion probabilistic model was used to explicitly simulate the degradation kernel distribution in complex underground coal mine scenes, thereby correcting reconstruction artifacts caused by degradation estimation bias at the early stage. In the image reconstruction stage, a hybrid Transformer-CNN encoder and a dynamic invertible decoder were designed, in which a parallel dual-branch structure was used to complementarily extract local textures and global dependencies, and a dynamic proportional fusion mechanism was employed to achieve adaptive interaction between degradation features and image content, reducing the number of model parameters while ensuring lossless transmission of deep features. By combining L1 loss, Structural Similarity (SSIM) loss, and perceptual loss, a multi-metric joint loss function was constructed to enhance perceptual image quality while ensuring pixel-level accuracy. Experiments were conducted on the CMUID underground coal mine image dataset and public benchmark datasets. The results showed that, in terms of objective evaluation metrics, the proposed method achieved overall superior performance in peak signal-to-noise ratio and SSIM compared with competing methods while maintaining a lower parameter count. In terms of subjective visual quality, the proposed method effectively suppressed low-light noise, sharpened edge structures, and clearly restored texture details of conveyor belts and coal blocks in underground coal mine scenes.

HTML全文

参考文献(25)

施引文献

资源附件(0)