基于多模态感知与深度强化学习的履带式机器人煤矿狭窄非结构化巷道动态避障

A mobile robot for coal mines with narrow and unstructured tunnels, based on multimodal perception and deep reinforcement learning, for dynamic obstacle avoidance

  • 摘要: 煤矿巷道环境狭窄、复杂,包含大量非结构化障碍物,导致目标感知-标定-决策链路中的多重不确定性较大,履带式机器人难以实现平衡、高精度的避障,由此,提出一种履带式机器人煤矿狭窄非结构化巷道动态避障方法。利用可见光波段的高分辨率成像与红外波段的热辐射敏感特性,多模态感知亮度异常区和热辐射隐匿目标;针对煤矿狭窄非结构化巷道中的障碍物,引入Mean Shift算法对标定对象进行核密度估计,通过迭代收敛于概率密度函数的位姿模点,解算亮度异常区和热辐射隐匿目标的三维空间坐标,缓解空间上的多重不确定性;考虑位姿模点相对位置关系,结合球体包络法构建三维空间坐标对应的安全势场边界,反复优化执行器位姿优化奖励、本体碰撞规避奖励与任务终止奖励,生成并更新避障策略。实验结果表明,高粉尘浓度环境还是强光、弱光环境中,跨模态结构相似性均值的波动性都较为平衡,且障碍物标定精度更高,最小距离始终大于安全阈值,在整个避障任务过程中保持安全裕度的能力较优秀。

     

    Abstract: The environment of coal mine tunnels is narrow and complex, containing a large number of unstructured obstacles, resulting in multiple uncertainties in the target perception calibration decision chain. It is difficult for tracked robots to achieve balanced and high-precision obstacle avoidance. Therefore, a dynamic obstacle avoidance method for tracked robots in narrow and unstructured coal mine tunnels is proposed. By utilizing high-resolution imaging in the visible light band and thermal radiation sensitivity in the infrared band, multimodal perception of areas with abnormal brightness and hidden targets due to thermal radiation can be achieved; In response to obstacles in narrow and unstructured coal mine tunnels, the Mean Shift algorithm is introduced to estimate the kernel density of the calibration object. By iteratively converging to the pose model points of the probability density function, the three-dimensional spatial coordinates of the brightness anomaly area and thermal radiation hidden targets are solved to alleviate multiple uncertainties in space; Considering the relative position relationship of pose model points, combined with the sphere envelope method, a safe potential field boundary corresponding to three-dimensional spatial coordinates is constructed. The actuator pose optimization reward, ontology collision avoidance reward, and task termination reward are repeatedly optimized to generate and update obstacle avoidance strategies. The experimental results show that the fluctuation of cross modal structural similarity mean is relatively balanced in high dust concentration environments or strong and weak light environments, and the obstacle calibration accuracy is higher. The minimum distance is always greater than the safety threshold, and the ability to maintain safety margin throughout the entire obstacle avoidance task is excellent.

     

/

返回文章
返回