Policy Optimization in Automated Point Merge Trajectory Planning: An Artificial Intelligence-based Approach

机译：自动点合并轨迹规划中的政策优化：基于人工智能的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Air Traffic Management (ATM) is a complex decision-making process. Air traffic controllers' decision on aircraft trajectory control actions directly leads to the efficiency of traffic flow management. In the Automated Point Merge Trajectory Planning (APMTP) problem, it aims to realize an automated routine trajectory management in Terminal Manoeuvring Area (TMA) with an intelligent decision-making agent. An Artificial Intelligence-based approach, mainly Reinforcement Learning (RL) algorithm, is applied to adaptively and smartly integrate four types of de-conflict actions for solving conflicts with fewer delays on the environment. In this paper, we will mainly discuss the policy optimization in APMTP, focus on improving the agent's learning quality and exploration efficiency. Firstly, application of RL in adaptive trajectory planning is presented. APMTP problem is adaptively divided into several sub-problems. For each sub-problem, an online policy π is applied to guide the simulation and optimization modules to find out the conflict-free and less-delay solution. The online policy π is a scale of weight distribution for choosing desirable actions. It follows the rule of Roulette-wheel selection with weighted probability. The highest desirable decision variable has the largest share of the roulette wheel, while the lowest desirable decision variable has the smallest share of the roulette wheel. The RL direct policy optimization algorithm is designed to update the online policy π, Finally, experiments are built up for validation of the proposed policy optimization algorithm for the intelligent decision-making in APMTP. The results in the test environment show that learning agent with different exploration and exploitation ability will result in different system performance in conflict resolution and delay.

机译：空中交通管理（ATM）是一个复杂的决策过程。空中交通控制器对飞机轨迹控制行动的决定直接导致交通流管理的效率。在自动点合并轨迹规划（APMTP）问题中，它旨在通过智能决策代理实现终端操纵区域（TMA）中的自动常规轨迹管理。基于人工智能的方法，主要是加强学习（RL）算法应用于自适应，并自适应地整合四种类型的脱冲突动作，以解决环境的延迟较少的冲突。在本文中，我们将主要讨论APMTP中的政策优化，重点是提高代理商的学习质量和勘探效率。首先，介绍了RL在自适应轨迹规划中的应用。 APMTP问题是自适应地分为几个子问题。对于每个子问题，应用在线策略π指导模拟和优化模块，以找出无冲突和较少延迟的解决方案。在线策略π是选择所需动作的权重分配规模。它遵循Roulette-Wheel选择具有加权概率的规则。最高理想的决策变量具有轮盘赌轮的最大份额，而最低期望的决策变量具有轮盘赌轮的最小份额。 RL直接策略优化算法旨在更新在线策略π，最后，建立实验以验证APMTP中智能决策的提议策略优化算法。测试环境中的结果表明，具有不同勘探和开发能力的学习代理将导致冲突解决和延迟的不同系统性能。

著录项

来源
《IEEE/AIAA Digital Avionics Systems Conference》|2019年|1 v.|共8页
会议地点
作者
Man Liang; Weigang Li; Daniel Delahaye; Philippe Notry;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
—air traffic management; decision making; artificial intelligence; reinforcement learning; policy optimization;

机译：- 公式交通管理;决策;人工智能;强化学习;政策优化;

相似文献

外文文献
中文文献
专利

1. An artificial intelligence-based approach to deal with argumentation applied to food quality in a public health policy [J] . Jean-Remi Bourguet, Rallou Thomopoulos, Marie-Laure Mugnier, Expert Systems with Application . 2013,第11期

机译：基于人工智能的方法来处理适用于公共卫生政策中食品质量的论证
2. Motion Planning for Highly Automated Road Vehicles with a Hybrid Approach Using Nonlinear Optimization and Artificial Neural Networks [J] . Hegedus Ferenc, Becsi Tamas, Aradi Szilard, Journal of Mechanical Engineering . 2019,第3期

机译：具有非线性优化和人工神经网络的混合方法的高度自动化道路车辆的运动规划
3. Computer-Aided Reconfiguration Planning: An Artificial Intelligence-Based Approach [J] . Li Tang, Yoram Koren, Derek M. Yip-Hoi, Journal of Computing and Information Science in Engineering . 2006,第3期

机译：计算机辅助重配置计划：一种基于人工智能的方法
4. Policy Optimization in Automated Point Merge Trajectory Planning: An Artificial Intelligence-based Approach [C] . Man Liang, Weigang Li, Daniel Delahaye, IEEE/AIAA Digital Avionics Systems Conference . 2019

机译：自动点合并轨迹规划中的策略优化：基于人工智能的方法
5. Profile merging and code versioning for automated profile guided optimization systems. [D] . Saxena, Rahul. 2007

机译：概要文件合并和代码版本控制，用于自动概要文件引导的优化系统。
6. Optimizing Trajectories for Cranial Laser Interstitial Thermal Therapy Using Computer-Assisted Planning: A Machine Learning Approach [O] . Kuo Li, Vejay N. Vakharia, Rachel Sparks, 2019

机译：使用计算机辅助计划优化颅骨激光间质热疗法的轨迹：一种机器学习方法
7. Policy Optimization in Automated Point Merge Trajectory Planning: An Artificial Intelligence-based Approach [O] . Man Liang, Weigang Li, Daniel Delahaye, 2019

机译：自动点合并轨迹规划中的政策优化：基于人工智能的方法

Policy Optimization in Automated Point Merge Trajectory Planning: An Artificial Intelligence-based Approach

摘要

著录项

相似文献

相关主题

期刊订阅