点云与图像特征级融合的露天矿无人驾驶障碍检测算法

莫祥伦; 赵琛; 张歆悦; 韩刚庆

点云与图像特征级融合的露天矿无人驾驶障碍检测算法

Obstacle Detection Algorithm for Autonomous Driving in Open-Pit Mines Based on Point Cloud and Image Feature-Level Fusion

摘要

摘要: 目前露天矿无人驾驶重载卡车行驶过程中的障碍感知主要是通过单独处理激光雷达获取的点云或者摄像头获取的图像数据完成的，未能融合二者的感知优势进行综合判断；同时目前点云和图像的融合目标检测算法停留在决策级，导致多模态数据信息互补性低。针对上述问题，提出一种点云与图像特征级融合的露天矿无人驾驶障碍检测算法。分别优化Voxel R-CNN和Yolo-v5模型的检测模块，完成检测目标的点云和图像特征提取，然后采用稀疏卷积的方法，重新根据二者特征构建检测目标的重要立方体，进而完成目标检测。并设计了目标空间位置的判断流程。在进行硬件联合标定的基础上，构建训练和验证数据集，进行模型的训练和验证。实验结果表明：与Voxel R-CNN和Yolo-v5模型相比，融合模型在白天和夜间具有更高的精确率、召回率、bbox精度和3D精度，同时融合模型的召回率高于精确率显示模型的抗漏检能力较强。同时融合模型的各项指标伴随着检测距离的增加而不断下降，其中精确率、召回率在80米以内下降缓慢，超过80m下降趋势加大。融合模型能够满足露天矿无人驾驶高精度和高容错需求的障碍目标检测应用场景。

Abstract: Currently, the obstacle perception for autonomous heavy-duty trucks in open-pit mines is primarily achieved by separately processing point clouds obtained from LiDAR or image data from cameras, failing to integrate the perceptual advantages of both for comprehensive judgment. Meanwhile, current point-cloud-and-image fusion-based object detection algorithms remain at the decision level, resulting in low complementarity of multimodal data. To address the above issues, a feature-level fusion obstacle detection algorithm for autonomous mining is proposed. The detection modules of the Voxel R-CNN and YOLO-v5 models are optimized to extract point-cloud and image features of the detection targets, respectively. Then, a sparse convolution method is employed to reconstruct the critical cuboid of the detection target based on the combined features, thereby completing object detection. A process for determining the spatial position of targets is also designed. On the basis of hardware co-calibration, a training and validation dataset is constructed to train and validate the model. Experimental results show that compared to the Voxel R-CNN and YOLO-v5 models, the fusion model achieves higher precision, recall, bounding-box (bbox) accuracy, and 3D accuracy both day and night. Additionally, the recall rate of the fusion model is higher than its precision, indicating strong resistance to missed detections. The precision, recall, bbox accuracy, and 3D accuracy of the fusion model gradually decrease with increasing detection distance. Within 80 meters, the decline in precision and recall is slow, but beyond 80 meters, the downward trend accelerates. The fusion model can meet the high-precision and high error tolerance requirements of obstacle target detection for unmanned driving in open-pit mines.

HTML全文

参考文献(0)

施引文献

资源附件(0)