ZHU Mei-qiang, LI Ming, ZHANG Qia. A Dyna_Q-learning Algorithm Used in Underground Path Planning[J]. Industry and Mine Automation, 2012, 38(12): 71-76.
Citation: ZHU Mei-qiang, LI Ming, ZHANG Qia. A Dyna_Q-learning Algorithm Used in Underground Path Planning[J]. Industry and Mine Automation, 2012, 38(12): 71-76.

A Dyna_Q-learning Algorithm Used in Underground Path Planning

  • Publish Date: 2012-12-10
  • The Euclidean distance is usually used in heuristic planning of Dyna_Q-learning based on reinforcement learning tasks of goal position. But it is not suitable for these tasks whose state space is not continuous in Euclidean space such as path planning of disaster rescue robot in underground coal mine. For the problem, the paper introduced the Laplacian Eigenmap whose computational complexity is lower in manifold learning, then proposed an improved Dyna_Q-learning algorithm based on manifold distance metric. The proposed algorithm is simulated in grid world that is similar to underground environment. The simulation results verified validity of the algorithm.

     

  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (41) PDF downloads(4) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return