Orthogonal Policy Gradient and Autonomous Driving Application

机译：正交政策梯度与自主驾驶应用

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

One less addressed issue of deep reinforcement learning is the lack of generalization capability based on new state and new target, for complex tasks, it is necessary to give the correct strategy and evaluate all possible actions for current state. Fortunately, deep reinforcement learning has enabled enormous progress in both subproblems: giving the correct strategy and evaluating all actions based on the state. In this paper we present an approach called orthogonal policy gradient descent (OPGD) that can make agent learn the policy gradient based on the current state and the actions set, by which the agent can learn a policy network with generalization capability. we evaluate the proposed method on the 3D autonomous driving enviroment TORCS compared with the baseline model, detailed analyses of experimental results and proofs are also given.

机译：深度加强学习的一个较少的解决问题是基于新的状态和新目标缺乏概括能力，对于复杂的任务，有必要提供正确的策略并评估当前状态的所有可能的行动。幸运的是，深度加强学习在副本中已经启用了巨大进展：给出了正确的策略并根据国家评估所有行动。在本文中，我们提出了一种称为正交策略梯度下降（OPGD）的方法，该方法可以使代理基于当前状态和操作集的策略梯度，由此代理可以通过泛化能力学习策略网络。与基线模型相比，我们评估了在3D自主驱动环境TORC上的提出的方法，还给出了实验结果和证据的详细分析。

著录项

来源
《IEEE International Conference on Software Engineering and Service Science》|2018年|580-1155p|共4页
会议地点
作者
Mincong Luo; Yin Tong; Jiachi Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.5-53;
关键词
Deep reinforcement learning; Orthogonal policy gradient; Generalization capability; Autonomous driving;

机译：深增强学习;正交政策梯度;泛化能力;自主驾驶;

相似文献

外文文献
中文文献
专利

1. Planning the transition to autonomous driving: A policy pathway towards urban liveability [J] . Brovarone Elisabetta Vitale, Scudellari Jacopo, Staricco Luca Cities . 2021,第Jana期

机译：规划向自动驾驶过渡：对城市居住性的政策途径
2. A survey on autonomous vehicle control in the era of mixed-autonomy: From physics-based to AI-guided driving policy learning [J] . Di Xuan, Shi Rongye Transportation research . 2021,第Apra期

机译：混合自治时代的自主车辆控制调查：从物理学为基于AI引导驾驶政策学习
3. Deep reinforcement-learning-based driving policy for autonomous road vehicles [J] . Intelligent Transport Systems, IET . 2020,第1期

机译：基于深度强化学习的自动驾驶道路驾驶策略
4. Orthogonal Policy Gradient and Autonomous Driving Application [C] . Mincong Luo, Yin Tong, Jiachi Liu IEEE International Conference on Software Engineering and Service Science . 2018

机译：正交策略梯度与自动驾驶应用
5. Computationally Efficient Exact Remodeling of Optimization Programs With Applications to Autonomous Driving [D] . Karlsson, Johan 2019

机译：优化程序的计算有效精确重构及其在自动驾驶中的应用
6. Policy-Gradient and Actor-Critic Based State Representation Learning for Safe Driving of Autonomous Vehicles [O] . Abhishek Gupta, Ahmed Shaharyar Khwaja, Alagan Anpalagan, 2020

机译：基于政策梯度和演员批评的国家代表性学习自主车辆安全驾驶
7. Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving [O] . Yanliang Jin, Qianhong Liu, Liquan Shen, 2021

机译：基于自动驾驶卷积块注意力的深度确定性政策梯度算法

Orthogonal Policy Gradient and Autonomous Driving Application

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅