基于改进融合注意力卷积网络的矿井低照度图像增强算法

郑爽; 王维柏

基于改进融合注意力卷积网络的矿井低照度图像增强算法

郑爽,
王维柏

An Enhanced Low-Illumination Mine Image Enhancement Algorithm Based on Improved Fusion Attention Convolutional Network

摘要

摘要: 煤矿井下复杂环境中，图像普遍存在照度不足、非均匀光照、粉尘散射模糊及噪声干扰等问题，严重影响监控预警与智能识别系统的稳定性。针对上述问题，提出一种基于改进融合注意力卷积网络的矿井低照度图像增强算法。该方法采用编码–解码结构作为主干网络，在各阶段引入残差密集块（RDB），通过多层特征级联与局部残差融合增强特征复用能力并缓解梯度退化；在解码阶段设计通道–空间融合注意力模块（CSA），结合高效通道注意力（ECA）与深度可分离空间注意力（DSA），实现对关键区域的自适应特征重标定，有效抑制背景噪声；同时构建包含均方误差损失与感知损失的复合损失函数，在保证像素一致性的同时提升结构保真度与视觉自然度。基于自建CMUHL矿井数据集开展对比实验，结果表明：所提算法的PSNR达到30.544dB，SSIM达到0.914，均优于FLOL、AnlightenDiff、Retinexformer等先进算法；模型参数量仅4.37M，在保证增强性能的同时有效降低计算复杂度。真实场景验证结果显示，该算法在NIQE与BRISQUE无参考指标上取得最优和次优表现，增强图像亮度均衡、细节清晰且无明显色偏现象，具有良好的工程应用价值。消融实验进一步验证了RDB、CSA与复合损失函数的协同增益作用。

Abstract: In the complex environment of underground coal mines, images often suffer from insufficient illumination, non-uniform lighting, dust scattering blurring, and noise interference, which seriously affect the stability of monitoring and early warning as well as intelligent recognition systems. To address these issues, an image enhancement algorithm for low-illumination mine images based on an improved fusion attention convolutional network is proposed. This method employs an encoder-decoder structure as the backbone network and introduces residual dense blocks (RDB) at each stage to enhance feature reusability and alleviate gradient degradation through multi-level feature concatenation and local residual fusion. In the decoding stage, a channel-space fusion attention module (CSA) is designed, combining efficient channel attention (ECA) and depthwise separable spatial attention (DSA), to achieve adaptive feature recalibration of key regions and effectively suppress background noise. Additionally, a composite loss function consisting of mean squared error loss and perceptual loss is constructed to ensure pixel consistency while improving structural fidelity and visual naturalness. Comparative experiments were conducted based on the self-built CMUHL mine dataset, and the results show that the proposed algorithm achieves a PSNR of 30.544 dB and an SSIM of 0.914, outperforming advanced algorithms such as FLOL, AnlightenDiff, and Retinexformer. The model has only 4.37M parameters, effectively reducing computational complexity while maintaining enhancement performance. Real-scene verification results indicate that the algorithm achieves the best and second-best performance in the NIQE and BRISQUE no-reference metrics, with enhanced images featuring balanced brightness, clear details, and no obvious color bias, demonstrating good engineering application value. Ablation experiments further verify the synergistic benefits of RDB, CSA, and the composite loss function.

HTML全文

参考文献(0)

施引文献

资源附件(0)