首页> 外文会议>IEEE International Conference on Real-time Computing and Robotics >Research on Virtual Path Planning Based on Improved DQN
【24h】

Research on Virtual Path Planning Based on Improved DQN

机译:基于改进DQN的虚拟路径规划研究

获取原文

摘要

An end-to-end approach based on the theory of Deep Reinforcement Learning has been proven to be able to meet or exceed human-level strategic capabilities. Applying this learning algorithm to path planning methods can make robots self-contained learning ability and environment interaction ability, and increased generalization ability. In this paper, Deep Q Network (DQN) as the typical Deep Reinforcement Learning method is improved. Improvement points can be divided into two steps. Firstly, the two steps of the selection of actions in the current network and how to calculate the target Q value are decoupled to eliminate overestimation caused by the rapid optimization of Q value in the possible direction. Then, considering that the action value function can bring benefits in addition to the action with the greatest value made by the agent, the static environment also has certain influence, the final result is a linear combination of two parts, which is to estimate the value functions of the upper, lower, left and right actions of the neural network output and the value of the environment state itself. Under the same experimental conditions, the improved DQN network is compared with the original DQN network, the result shows that the estimated final target value function of improved DQN network is more accurate and effective for virtual path planning tasks.
机译:已被证明能够满足或超过人级战略能力的基于深度加强学习理论的端到端方法。将该学习算法应用于路径规划方法,可以使机器人自包含的学习能力和环境交互能力,以及增加的泛化能力。在本文中,改进了深度Q网络(DQN)作为典型的深度增强学习方法。改进点可以分为两个步骤。首先,在当前网络中选择动作的两个步骤以及如何计算目标Q值,以消除由可能方向上的Q值的快速优化引起的高度估计。然后,考虑到动作价值函数除了由代理商所做的最大价值之外的动作来带来益处,静态环境也具有一定的影响,最终结果是两部分的线性组合,这是估计值神经网络输出的上部,较低,左,左右动作的功能和环境状态本身的值。在相同的实验条件下,将改进的DQN网络与原始DQN网络进行比较,结果表明,改进的DQN网络的估计最终目标值函数更准确,对虚拟路径规划任务有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号