Research on Virtual Path Planning Based on Improved DQN

机译：基于改进DQN的虚拟路径规划研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An end-to-end approach based on the theory of Deep Reinforcement Learning has been proven to be able to meet or exceed human-level strategic capabilities. Applying this learning algorithm to path planning methods can make robots self-contained learning ability and environment interaction ability, and increased generalization ability. In this paper, Deep Q Network (DQN) as the typical Deep Reinforcement Learning method is improved. Improvement points can be divided into two steps. Firstly, the two steps of the selection of actions in the current network and how to calculate the target Q value are decoupled to eliminate overestimation caused by the rapid optimization of Q value in the possible direction. Then, considering that the action value function can bring benefits in addition to the action with the greatest value made by the agent, the static environment also has certain influence, the final result is a linear combination of two parts, which is to estimate the value functions of the upper, lower, left and right actions of the neural network output and the value of the environment state itself. Under the same experimental conditions, the improved DQN network is compared with the original DQN network, the result shows that the estimated final target value function of improved DQN network is more accurate and effective for virtual path planning tasks.

机译：已被证明能够满足或超过人级战略能力的基于深度加强学习理论的端到端方法。将该学习算法应用于路径规划方法，可以使机器人自包含的学习能力和环境交互能力，以及增加的泛化能力。在本文中，改进了深度Q网络（DQN）作为典型的深度增强学习方法。改进点可以分为两个步骤。首先，在当前网络中选择动作的两个步骤以及如何计算目标Q值，以消除由可能方向上的Q值的快速优化引起的高度估计。然后，考虑到动作价值函数除了由代理商所做的最大价值之外的动作来带来益处，静态环境也具有一定的影响，最终结果是两部分的线性组合，这是估计值神经网络输出的上部，较低，左，左右动作的功能和环境状态本身的值。在相同的实验条件下，将改进的DQN网络与原始DQN网络进行比较，结果表明，改进的DQN网络的估计最终目标值函数更准确，对虚拟路径规划任务有效。

著录项

来源
《IEEE International Conference on Real-time Computing and Robotics》|2020年|387-392|共6页
会议地点
作者
Cheng Yi; Meng Qi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Path planning; Reinforcement learning; Robots; Neural networks; Planning; Markov processes; Correlation;

机译：路径规划;加强学习;机器人;神经网络;规划;马尔可夫进程;相关性;

相似文献

外文文献
中文文献
专利

1. A Twofold-interpolation-based Path Planning Algorithm and Its Path Following Based on Improved Virtual Vehicle Method [J] . Zhiwei Liang, Songhao Zhu, Fang Fang International Journal of Control, Automation, and Systems . 2012,第1期

机译：基于改进虚拟车方法的基于二次插值的路径规划算法及其路径跟踪
2. A twofold-interpolation-based path planning algorithm and its path following based on improved virtual vehicle method [J] . Zhiwei Liang, Songhao Zhu, Fang Fang International Journal of Control, Automation and Systems . 2012,第1期

机译：基于改进的虚拟车辆方法的基于双重插值的路径规划算法及其路径跟踪
3. Multi-robot path planning based on a deep reinforcement learning DQN algorithm [J] . Yang Yang, Li Juntao, Peng Lingling CAAI Transactions on Intelligence Technology . 2020,第3期

机译：基于深度加强学习DQN算法的多机器人路径规划
4. A Deep Q-network (DQN) Based Path Planning Method for Mobile Robots [C] . Siyu Zhou, Xin Liu, Yingfu Xu, IEEE International Conference on Information and Automation . 2018

机译：基于深度Q网络（DQN）的移动机器人路径规划方法
5. Utilizing a Technology-Based Meal Planning Engine, EatLove, Simultaneously with Virtual Cooking Classes and Virtual Guidance to Eliminate Provider Barriers and Promote Improved Nutrition Choices by Patients at Neucare Clinic [D] . Weyer, Timmie. 2020

机译：利用基于技术的膳食计划发动机，同时使用虚拟烹饪课程和虚拟指导，消除提供者障碍，并通过Neucare诊所的患者提高营养选择
6. Path Planning for Mobile Robot Based on Improved Bat Algorithm [O] . Xin Yuan, Xinwei Yuan, Xiaohu Wang 2021

机译：基于改进BAT算法的移动机器人路径规划
7. Path Planning via an Improved DQN-Based Learning Policy [O] . Liangheng Lv, Sunjie Zhang, Derui Ding, 2019

机译：路径规划通过改进的基于DQN的学习政策

Research on Virtual Path Planning Based on Improved DQN

摘要

著录项

相似文献

相关主题

期刊订阅