面向露天矿山无人机遥感影像机械小目标识别的CDMG−YOLO模型

CDMG-YOLO model for mechanical small object detection in UAV remote sensing images of open-pit mines

  • 摘要: 露天矿山作业环境复杂,无人机遥感图像中机械目标往往存在尺度差异显著、背景干扰繁杂及小目标易漏检等问题。针对上述问题,提出了一种面向露天矿山无人机遥感影像机械小目标识别的轻量化模型——CDMG−YOLO。该模型以YOLOv11n为基础进行改进:在特征感知阶段,新增P2检测头与主干浅层特征协同构建细粒度信息通道,增强小目标细节特征;在特征建模阶段,通过引入可变形大卷积核注意力(DLKA)的D−C3k2模块扩展感受野并提升尺度泛化能力;在特征融合阶段,双重注意力机制——卷积与注意力融合模块(CAFM)与多维协作注意力模块(MCAM)在通道与空间层面实现特征的多维交互与自适应加权,从而突出目标特征并抑制背景干扰;在训练优化阶段,采用结合梯度协调机制损失函数(GHM Loss)的复合损失函数,平衡了难易样本在训练过程中的梯度贡献,有效缓解了正负样本不均衡问题。实验结果表明:在自建的覆盖多类露天矿山机械小目标的无人机遥感数据集上,CDMG−YOLO的精确率和mAP@0.5分别达0.859和0.697,且参数量仅为3.1×106,每秒浮点运算次数为12.7×109,在保证高精度的同时兼顾了轻量化与高效性;在低对比度、目标严重遮挡及多目标密集分布等复杂露天矿山作业场景中,CDMG−YOLO模型实现了对目标的精准定位与识别;在公开LEVIR数据集上,CDMG−YOLO能准确识别不同类型目标,具有良好的泛化能力。

     

    Abstract: The operating environment of open-pit mines is complex, and mechanical targets in UAV remote-sensing images often exhibit significant scale variations, complex background interference, and a high risk of missed detections of small targets. To address these problems, a lightweight model named CDMG-YOLO was proposed for small mechanical target detection in UAV remote sensing images of open-pit mines. The model was improved based on YOLOv11n. In the feature perception stage, a P2 detection head was added and combined with shallow backbone features to construct a fine-grained information pathway, enhancing the detail features of small targets. In the feature modeling stage, the D-C3k2 module incorporating Deformable Large Kernel Attention (DLKA) was introduced to expand the receptive field and improve scale generalization. In the feature fusion stage, a dual-attention mechanism—namely the Convolution and Attention Fusion Module (CAFM) and the Multidimensional Collaborative Attention Module (MCAM)—achieved multidimensional interaction and adaptive weighting of features along the channel and spatial dimensions, thereby highlighting target features and suppressing background interference. In the training optimization stage, a composite loss function combining the Gradient Harmonizing Mechanism Loss (GHM Loss) was adopted, which balanced the gradient contributions of hard and easy samples during training and effectively alleviated the positive–negative sample imbalance problem. Experimental results showed that on a self-built UAV remote-sensing dataset covering multiple types of small mechanical targets in open-pit mines, CDMG-YOLO achieved a precision of 0.859 and an mAP@0.5 of 0.697, with only 3.1×106 parameters and 12.7 GFLOPs, achieving a balance between high accuracy, lightweight design, and efficiency. In low-contrast scenes, severe occlusion situations, and densely distributed multi-target scenarios in complex open-pit mine operations, the CDMG-YOLO model achieved accurate target localization and recognition. On the public LEVIR dataset, CDMG-YOLO accurately identified different types of targets and demonstrated good generalization capability.

     

/

返回文章
返回