Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection

Tianhao Wu; Mingzhi Jiang; Lin Zhang

首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection

【24h】

Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection

机译：与无罪交叉口的智能连接运输的合作多态深度决定性政策梯度（COMADDPG）

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Unsignalized intersection control is one of the most critical issues in intelligent transportation systems, which requires connected and automated vehicles to support more frequent information interaction and on-board computing. It is very promising to introduce reinforcement learning in the unsignalized intersection control. However, the existing multiagent reinforcement learning algorithms, such as multiagent deep deterministic policy gradient (MADDPG), hardly handle a dynamic number of vehicles, which cannot meet the need of the real road condition. Thus, this paper proposes a Cooperative MADDPG (CoMADDPG) for connected vehicles at unsignalized intersection to solve this problem. Firstly, the scenario of multiple vehicles passing through an unsignalized intersection is formulated as a multiagent reinforcement learning (RL) problem. Secondly, MADDPG is redefined to adapt to the dynamic quantity agents, where each vehicle selects reference vehicles to construct a partial stationary environment, which is necessary for RL. Thirdly, this paper incorporates a novel vehicle selection method, which projects the reference vehicles on a virtual lane and selects the largest impact vehicles to construct the environment. At last, an intersection simulation platform is developed to evaluate the proposed method. According to the simulation result, CoMADDPG can reduce average travel time by 39.28% compared with the other optimization-based methods, which indicates that CoMADDPG has an excellent prospect in dealing with the scenario of unsignalized intersection control.

机译：无信号化的交叉控制是智能交通系统中最关键的问题之一，需要连接和自动车辆来支持更频繁的信息交互和车载计算。在无罪化的交叉点控制中引入钢筋学习非常有希望。然而，现有的多算法强化学习算法，例如多眼深度决定性政策梯度（MADDPG），几乎没有处理动态数量的车辆，这不能满足真正的道路状况的需要。因此，本文提出了一种在无罪化交叉路口的连接车辆的合作Maddpg（Comaddpg），以解决这个问题。首先，通过无罪化交集的多个车辆的场景被制定为多轴增强学习（RL）问题。其次，MaddPG被重新定义以适应动态量代理，其中每辆车选择参考车辆以构建部分静止环境，这对于R1是必要的。第三，本文采用了一种新颖的车辆选择方法，该方法将参考车辆投射在虚拟车道上，并选择最大的冲击车辆以构建环境。最后，开发了一个交叉点模拟平台来评估所提出的方法。根据仿真结果，与其他基于优化的方法相比，Comaddpg可以将平均旅行时间减少39.28％，这表明Comaddpg在处理无罪化交叉点控制的情况下具有出色的前景。

著录项

来源
《Mathematical Problems in Engineering: Theory, Methods and Applications》 |2020年第1期|共12页
作者
Tianhao Wu; Mingzhi Jiang; Lin Zhang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An Intelligent Energy Management Strategy for Hybrid Vehicle with irrational actions using Twin Delayed Deep Deterministic Policy Gradient [J] . Zemin Eitan Liu, Quan Zhou, Yanfei Li, IFAC PapersOnLine . 2021,第10期

机译：使用双胞胎延迟的非理性行为的混合动力车辆智能能量管理策略深度确定性政策梯度
2. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm [J] . Junta Wu, Huiyun Li Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：具有多种深度确定性政策梯度算法的深度集成钢筋学习
3. Development of a conflict-free unsignalized intersection organization method for multiple connected and autonomous vehicles [J] . Qinglu Ma, Shu Zhang, Qi Zhou PLoS One . 2021,第3期

机译：开发多个连接和自治车辆的无密切异常交叉组织方法的开发
4. Deep Deterministic Policy Gradient for Traffic Signal Control of Single Intersection [C] . Hali Pang, Weilong Gao Chinese Control and Decision Conference . 2019

机译：单交叉口交通信号控制的深度确定性策略梯度
5. Transportation operation and safety comparison between unsignalized/signalized offset T-intersections, two-lane roundabout, and modified roundabout [D] . Fleming, Brandon 2014

机译：无信号/信号偏移T形交叉口，两车道回旋处和修改的回旋处之间的运输操作和安全性比较
6. Development of a conflict-free unsignalized intersection organization method for multiple connected and autonomous vehicles [O] . Qinglu Ma, Shu Zhang, Qi Zhou 2021

机译：开发多个连接和自治车辆的无罪无罪交叉组织方法
7. Intention of Risk-Taking Behavior at Unsignalized Intersections under the Connected Vehicle Environment [O] . Qianshan Jiang, Helai Huang, Wenjing Zhao, 2021

机译：在连接的车辆环境下的无罪交叉口的风险行为的意图
8. Intelligent Transportation Systems 2014, Volume 2: Connected Vehicles and Cooperative Systems. Transportation Research Record: Journal of the Transportation Research Board, No. 2424 [R] . 2014

机译：智能交通系统2014，第2卷：联网车辆和合作系统。交通研究记录：交通研究委员会杂志，第2424号

Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection

摘要

著录项

相似文献

相关主题

期刊订阅