基于多智能体强化学习的煤矿智能瓦斯巡检机器人网络化控制研究

    Networked control of intelligent gas inspection robots in coal mines based on multi-agent reinforcement learning

    • 摘要: 为了提高煤矿井下瓦斯巡检的安全性和实时性,降低人工巡检带来的危险与效率低下问题,面向复杂巷道和高风险工况构建了一种融合感知与决策的智能化、网络化控制方法。以多智能体强化学习为总体框架,构建了嵌入式目标识别和协同路径规划双模块。感知层采用了结构剪枝与8 bit量化的轻量化目标检测网络,通过多尺度特征融合与注意机制实现了对瓦斯管道、阀门装置、障碍物的高精度识别;决策层在蚁群优化算法基础上引入了局部最优引导、交叉变异与信息素自适应更新机制,形成了改进型蚁群优化算法以完成路径规划。在Gazebo三维矿井仿真数据集上,对所建模型与对比模型进行对比,并在真实矿井环境开展了24 h小范围连续巡检试验,试验指标涵盖检测精度、均方根误差、路径规划成功率、收敛代数、系统故障率、能耗与数据丢包率等。仿真结果表明,模型在50次迭代后路径适应度达到0.92,均方根误差0.15,检测精度88.3%,平均精度均值90.1%;数据量800帧时的训练时间为1.8 s,处理时延3.2 s,较传统模型分别缩短了28%和20%。在真实矿井测试中,检测精度保持在87.9%,综合故障率1.8%,通信中断率0.6%,路径规划失败率0.7%,平均响应时延2.8 s,能耗14.6 Wh,数据丢包率0.4%;在弱光与高湿度条件下检测精度仍保持在85%以上,路径规划成功率在95%左右。研究结果显示,该方法实现了高精度、低延迟的煤矿瓦斯自主巡检。

       

      Abstract: To enhance the safety and real-time performance of underground gas inspection in coal mines and to overcome the low efficiency and high risk associated with manual inspection, a networked intelligent control approach integrating perception and decision-making was developed. Relying on multi-agent reinforcement learning (MARL), the framework constructs an embedded dual-module system that combines object recognition with cooperative path planning. The perception module utilizes the lightweight YOLO-Float detection network, which is optimized through structural pruning and 8-bit quantization to balance computational efficiency and accuracy. Furthermore, by integrating multi-scale feature fusion and attention mechanisms, the model achieves high-precision recognition of gas pipelines, valves, and dynamic obstacles. The decision-making layer introduced local optimal guidance, crossover mutation and information self-adaptive update mechanism based on the ant colony optimization algorithm, and formed an improved ant colony optimization (IACO) algorithm to complete the path planning. On the Gazebo three-dimensional mine simulation dataset, the constructed model was compared with the reference model. A 24-hour small-scale continuous inspection test was also conducted in the real mine environment. The test indicators covered detection accuracy, root mean square error (RMSE), path planning success rate, convergence generations, system failure rate, energy consumption, and data packet loss rate, etc. The simulation results show that after 50 iterations, the path fitness of the model reached 0.92, the root mean square error was 0.15, the detection accuracy was 88.3%, the average accuracy mean was 90.1%, and the training time with 800 frames of data was 1.8 seconds, and the processing delay was 3.2 seconds. Compared with the traditional model, it was shortened by 28% and 20% respectively. In the actual mine tests, the detection accuracy remained at 87.9%, the comprehensive failure rate was 1.8%, the communication interruption rate was 0.6%, the path planning failure rate was 0.7%, the average response delay was 2.8 seconds, the energy consumption was 14.6 Wh, and the data packet loss rate was 0.4%. Even under weak illumination and high humidity conditions, the detection accuracy still remained above 85%, and the path planning success rate was around 95%. The research results show that this method has achieved high-precision and low-latency autonomous gas inspection in coal mines.

       

    /

    返回文章
    返回