Autopilot Strategy Based on Improved DDPG Algorithm

机译：基于改进DDPG算法的自动驾驶仪策略

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Deep Deterministic Policy Gradient (DDPG) is one of the Deep Reinforcement Learning algorithms. Because of the well perform in continuous motion control, DDPG algorithm is applied in the field of self-driving. Regarding the problems of the instability of DDPG algorithm during training and low training efficiency and slow convergence rate. An improved DDPG algorithm based on segmented experience replay is presented. On the basis of the DDPG algorithm, the segmented experience replay select the training experience by the importance according to the training progress to improve the training efficiency and stability of the training model. The algorithm was tested in an open source 3D car racing simulator called TORCS. The simulation results demonstrate the training stability is significantly improved compared with the DDPG algorithm and the DQN algorithm, and the average return is about 46% higher than the DDPG algorithm and about 55% higher than the DQN algorithm.

机译：深度确定性政策梯度（DDPG）是深度加强学习算法之一。由于在连续运动控制中执行良好，DDPG算法应用于自动驾驶领域。关于训练中DDPG算法不稳定性的问题，训练效率低，收敛速度慢。提出了一种基于分段体验重放的改进的DDPG算法。在DDPG算法的基础上，分段体验重播根据培训进展以提高培训模型的培训效率和稳定性，通过重要性选择培训体验。该算法在一个名为TORC的开源3D赛车赛车模拟器中进行了测试。与DDPG算法和DQN算法相比，仿真结果证明了训练稳定性显着提高，平均返回比DDPG算法高约46％，比DQN算法高约55％。

著录项

来源
《SAE New Energy Intelligent Connected Vehicle Technology Conference》|2019年|1 Electronic text data|共6页
会议地点
作者
Zhewen Tian; Xiaochao Zuo; Xiaoning Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 U46-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Fuzzy PID Autopilot for Ship Steering Based on Membership Function Optimized by an Improved Genetic Algorithm [J] . CHENGMING YANG, ZAOJIAN ZOU Journal of multiple-valued logic and soft computing . 2018,第4a6期

机译：改进遗传算法的隶属度函数模糊PID自动驾驶仪
2. DDPG-Based Energy-Efficient Flow Scheduling Algorithm in Software-Defined Data Centers [J] . Zan Yao, Ying Wang, Luoming Meng, Wireless communications & mobile computing . 2021,第a期

机译：基于DDPG的节能流量调度算法在软件定义的数据中心
3. A novel movies recommendation algorithm based on reinforcement learning with DDPG policy [J] . International Journal of Intelligent Computing and Cybernetics . 2020,第1期

机译：一种基于REDPG政策的强化学习的新型电影推荐算法
4. Autopilot Strategy Based on Improved DDPG Algorithm [C] . Zhewen Tian, Xiaochao Zuo, Xiaoning Li SAE New Energy Intelligent Connected Vehicle Technology Conference . 2019

机译：基于改进DDPG算法的自动驾驶仪策略
5. Improving search in genetic algorithms through instinct-based mating strategies. [D] . Quirino, Thiago S. 2012

机译：通过基于本能的交配策略改善遗传算法中的搜索。
6. Lane Following Method Based on Improved DDPG Algorithm [O] . Rui He, Haipeng Lv, Sumin Zhang, 2021

机译：基于改进DDPG算法的泳道之后的方法
7. Optimal Torque Distribution Control of Multi-Axle Electric Vehicles with In-wheel Motors Based on DDPG Algorithm [O] . Liqiang Jin, Duanyang Tian, Qixiang Zhang, 2020

机译：基于DDPG算法的车载电动机多轴电动车的最佳扭矩分布控制
8. YF22 Model With On-Board On-Line Learning Microprocessors-Based Neural Algorithms for Autopilot and Fault-Tolerant Flight Control Systems. [R] . Napolitano, M. R. 2002

机译：YF22模型与基于机载在线学习微处理器的自动驾驶仪和容错飞行控制系统的神经算法。

Autopilot Strategy Based on Improved DDPG Algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅