融合快速遍历随机树和Q强化学习的煤矿轮式机器人路径规划关键技术

    Key technologies for path planning of coal mine wheeled robots integrating fast traversal random trees and Q-reinforcement learning

    • 摘要: 煤炭开采过程中存在大量危险环境和大体积运输任务,改善煤矿设备的工作质量是煤矿智能化建设的关键。为了提高煤矿生产的效率和运输任务的安全性,提出了一种基于强化学习的煤矿轮式机器人路径规划方法;方法使用贪婪策略进行随机树扩展方向指引,利用马尔科夫决策减少扩展时生成的无效节点,通过三阶贝塞尔曲线平滑化路径轨迹,在学习过程中加入专家经验回放池提高计算效率。试验结果表明:在规划路径长度测试中,研究方法生成的全局路径长度能够比其他算法缩短最少10.71%;进行多障碍物场景规划时间测试时,研究方法的规划时间仅为0.452 s;在避障效果分析中,研究方法规划路径能够有效避开静态障碍物和动态障碍物;研究方法具有更快的路径规划效率,能够生成更安全的机器人运行路径。

       

      Abstract: There are a lot of dangerous environment and large volume transportation tasks in the coal mining process. Improving the working quality of coal mine equipment is the key to the intelligent construction of coal mine. In order to improve the efficiency of coal mine production and the safety of transportation tasks, a path planning method of coal mine wheeled robot based on reinforcement learning is proposed. This method uses greedy strategy to guide the direction of random tree expansion, uses Markov decision to reduce the invalid nodes generated during expansion, smooths the path trajectory through third-order Bessel curve, and adds expert experience playback pool to improve the computational efficiency. The experimental results show that the global path length generated by the research method can be reduced by at least 10.71% compared with other algorithms in the planned path length test. In the multi-obstacle scenario planning time test, the planning time of the research method is only 0.452 s. In the analysis of obstacle avoidance effect, the research method can effectively avoid static obstacle and dynamic obstacle. The research method has faster path planning efficiency and can generate safer robot running paths.

       

    /

    返回文章
    返回