基于DYCS−YOLOv8n的井下无人驾驶电机车多目标检测

许谨辉; 王文善; 王爽; 王文钺; 赵婷婷

doi:10.13272/j.issn.1671-251x.2024100036

基于DYCS−YOLOv8n的井下无人驾驶电机车多目标检测

Multi-object detection for underground unmanned locomotives based on DYCS-YOLOv8n

摘要

摘要: 针对井下无人驾驶电机车因光线暗、噪声大及运动模糊等因素导致图像特征难提取、细节易丢失、小尺寸目标难识别等问题，提出了一种基于DYCS−YOLOv8n的井下无人驾驶电机车多目标检测模型。在YOLOv8n的基础上引入卷积注意力模块（CBAM），通过空间和通道双重注意力机制，提高了对关键特征的提取能力；增加小目标检测层，由原来的3层增加到4层，从而更好地提取细小特征，提升了对小尺寸目标的检测性能；采用动态上采样算子DySample，根据输入特征自适应地调整采样策略，更好地保留图像中的边缘和局部细节，避免了图像关键信息损失。采用自建的井下无人驾驶电机车数据集进行实验，结果表明：① DYCS−YOLOv8n模型的平均精度均值(mAP@0.5)达97.5%，较YOLOv8n模型提高了3.4%，且检测速度达46.35帧/s，满足实时性检测需求。② 与YOLO系列主流目标检测模型相比，DYCS−YOLOv8n模型的mAP@0.5最优，在保持轻量化的同时保证了较快的计算速度。③ 在噪声、低光照等复杂井下场景下，DYCS−YOLOv8n模型对行人、轨道、信号灯的平均检测置信度较高，未出现漏与误检情况。

Abstract: To address the challenges in underground unmanned locomotive image feature extraction—such as poor lighting, high noise, and motion blur, which result in the loss of image details and difficulty in identifying small targets—a multi-object detection model for underground unmanned locomotives based on DYCS-YOLOv8n was proposed. Based on YOLOv8n, the Convolutional Block Attention Module (CBAM) was introduced, enhancing the extraction of key features through spatial and channel attention mechanisms. A small-object detection layer was added, increasing the original three layers to four, thereby improving the extraction of fine features and enhancing detection performance for small-sized targets. The dynamic upsampling operator DySample was employed to adaptively adjust the sampling strategy according to the input features, better preserving edges and local details in the images and avoiding the loss of critical information. Experiments conducted on a self-constructed underground unmanned locomotive dataset showed that: ① The DYCS-YOLOv8n model achieved a mean Average Precision (mAP@0.5) of 97.5%, an improvement of 3.4% over the YOLOv8n model, with a detection speed of 46.35 frames per second, meeting the requirements for real-time detection. ② Compared with mainstream YOLO series object detection models, DYCS-YOLOv8n achieved the optimal mAP@0.5, maintaining a lightweight structure while ensuring high computational speed. ③ In complex underground scenarios with noise and low illumination, the DYCS-YOLOv8n model exhibited high average detection confidence for pedestrians, tracks, and signal lights, with no cases of missed or false detections.

HTML全文

参考文献(25)

施引文献

资源附件(0)