Optimal UAV Base Station Trajectories Using Flow-Level Models for Reinforcement Learning

Saxena Vidit; Jalden Joakim; Klessig Henrik

首页> 外文期刊>IEEE Transactions on Cognitive Communications and Networking >Optimal UAV Base Station Trajectories Using Flow-Level Models for Reinforcement Learning

【24h】

Optimal UAV Base Station Trajectories Using Flow-Level Models for Reinforcement Learning

机译：最佳UAV基站轨迹使用流量模型进行强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cellular base stations (BS) and remote radio heads can be mounted on unmanned aerial vehicles (UAV) for flexible, traffic-aware deployment. These UAV base station networks (UAVBSN) promise an unprecendented degree of freedom that can be exploited for spectral efficiency gains as well as optimal network utilization. However, the current literature lacks realistic radio and traffic models for UAVBSN deployment planning and for performance evaluation. In this paper, we propose flow-level models (FLM) for realistically characterizing the UAVBSN performance in terms of a broad range of flow- and system-level metrics. Further, we propose a deep reinforcement learning (DRL) approach that relies on the UAVBSN FLM for learning the optimal traffic-aware UAV trajectories. For a given user traffic density and starting UAV locations, our RL approach learns the optimal UAV trajectories offline that maximizes a cumulative performance metric. We then execute the learned UAV trajectories in a discrete event simulator to evaluate online UAVBSN performance. For M = 9 UAVs deployed in a simulated Downtown San Francisco model, where the UAV trajectories are defined by N = 20 discrete actions, our approach achieves approximately a three-fold increase in the average user throughput compared to the initial UAV placement, while simultaneously balancing traffic loads across the BSs.

机译：蜂窝基站（BS）和远程无线电头可以安装在无人驾驶飞行器（UAV）上，用于灵活，流量感知部署。这些无人机基站网络（UAVBSN）承诺可以利用频谱效率增益以及最佳网络利用率的不合格自由度。然而，目前的文献缺乏用于UAVBSN部署规划和绩效评估的现实无线电和交通模型。在本文中，我们提出了流动级模型（FLM），以便在广泛的流量和系统级度量方面进行实际描述UAVBSN性能。此外，我们提出了一种深度增强学习（DRL）方法，依赖于UADBSN FLM学习最佳的交通感知UAV轨迹。对于给定的用户流量密度和启动UAV位置，我们的RL方法会学习离线的最佳UAV轨迹，最大化累积性能度量。然后，我们在离散事件模拟器中执行学习的UAV轨迹，以评估在线UAVBSN性能。对于M = 9，在模拟的旧金山模型中部署的UAV，其中UAV轨迹由n = 20个离散动作定义，我们的方法与初始UAV放置相比，我们的方法在平均用户吞吐量上实现了大约三倍的增加，而同时平衡BSS的流量负载。

著录项

来源
《IEEE Transactions on Cognitive Communications and Networking》 |2019年第4期|1101-1112|共12页
作者
Saxena Vidit; Jalden Joakim; Klessig Henrik;
展开▼
作者单位

KTH Royal Inst Technol Dept Informat Sci & Engn S-17734 Stockholm Sweden|Ericsson AB Ericsson Res S-16480 Stockholm Sweden;

KTH Royal Inst Technol Dept Informat Sci & Engn S-17734 Stockholm Sweden;

Int Comp Sci Inst Edge Comp Serv Continu & Aerial Wireless Network Berkeley CA 94704 USA|Univ Calif Berkeley Berkeley CA 94720 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
UAV base stations; flow-level models; reinforcement learning; proximal policy optimization;

机译：UAV基站;流量级模型;加固学习;近端政策优化;

相似文献

外文文献
中文文献
专利

1. Neural Combinatorial Deep Reinforcement Learning for Age-Optimal Joint Trajectory and Scheduling Design in UAV-Assisted Networks [J] . Ferdowsi Aidin, Abd-Elmagid Mohamed A., Saad Walid, IEEE Journal on Selected Areas in Communications . 2021,第5期

机译：无人机辅助网络中年龄最优关节轨迹和调度设计的神经组合深度加固学习
2. Intelligent Trajectory Design for Secure Full- Duplex MIMO-UAV Relaying Against Active Eavesdroppers: A Model-Free Reinforcement Learning Approach [J] . Milad Tatar Mamaghani, Yi Hong Quality Control, Transactions . 2021,第1期

机译：智能轨迹设计，用于激活窃听器的安全全双工MIMO-UAV：一种无模型加强学习方法
3. Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing [J] . Wang Liang, Wang Kezhi, Pan Cunhua, IEEE Transactions on Cognitive Communications and Networking . 2021,第1期

机译：基于多功能的深度加强学习基于学习的多UAV辅助移动边缘计算的轨迹规划
4. Optimal Trajectory Learning for UAV-BS Video Provisioning System: A Deep Reinforcement Learning Approach [C] . Dohyun Kwon, Joongheon Kim International Conference on Information Networking . 2019

机译：UAV-BS视频预配系统的最佳轨迹学习：一种深度强化学习方法
5. Understanding Model-Based Reinforcement Learning and its Application in Safe Reinforcement Learning [D] . Hu, Dingcheng . 2019

机译：了解基于模型的强化学习及其在安全强化学习中的应用
6. Intelligent Land-Vehicle Model Transfer Trajectory Planning Method Based on Deep Reinforcement Learning [O] . Lingli Yu, Xuanya Shao, Yadong Wei, 2018

机译：基于深度强化学习的智能陆车模型转移轨迹规划方法
7. Neural Combinatorial Deep Reinforcement Learning for Age-Optimal Joint Trajectory and Scheduling Design in UAV-Assisted Networks [O] . Aidin Ferdowsi, Mohamed A. Abd-Elmagid, Walid Saad, 2021

机译：无人机辅助网络中年龄最优关节轨迹和调度设计的神经组合深度加固学习

Optimal UAV Base Station Trajectories Using Flow-Level Models for Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅