Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks With Sense-and-Send Protocol

Hu Jingzhi; Zhang Hongliang; Song Lingyang

首页> 外文期刊>Internet of Things Journal, IEEE >Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks With Sense-and-Send Protocol

【24h】

Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks With Sense-and-Send Protocol

机译：带有感知和发送协议的蜂窝无人机网络中分散轨迹设计的强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, the unmanned aerial vehicles (UAVs) have been widely used in real-time sensing applications over cellular networks. The performance of a UAV is determined by both its sensing and transmission processes, which are influenced by the trajectory of the UAV. However, it is challenging for the UAV to determine its trajectory, since it works in a dynamic environment, where other UAVs determine their trajectories dynamically and compete for the limited spectrum resources in the same time. To tackle this challenge, we adopt the reinforcement learning to solve the UAV trajectory design problem in a decentralized manner. To coordinate multiple UAVs performing real-time sensing tasks, we first propose a sense-and-send protocol, and analyze the probability for successful valid data transmission using nested Markov chains. Then, we propose an enhanced multi-UAV Q-learning algorithm to solve the decentralized UAV trajectory design problem. Simulation results show that the proposed algorithm converges faster and achieves higher utilities for the UAVs, compared to traditional single-and multi-agent Q-learning algorithms.

机译：最近，无人飞行器（UAV）已被广泛用于蜂窝网络的实时感测应用中。无人机的性能取决于其感测和传输过程，受无人机飞行轨迹的影响。但是，由于无人机要在动态环境中工作，因此无人机要确定其轨迹具有挑战性，在该环境中其他无人机可以动态确定其轨迹并同时竞争有限的频谱资源。为了应对这一挑战，我们采用强化学习的方式来分散解决无人机航迹设计问题。为了协调执行实时传感任务的多架无人机，我们首先提出一种传感发送协议，并使用嵌套的马尔可夫链分析成功进行有效数据传输的可能性。然后，我们提出了一种增强的多UAV Q学习算法，以解决分散的无人机航迹设计问题。仿真结果表明，与传统的单智能体和多智能体Q学习算法相比，该算法收敛更快，对无人机具有更高的实用性。

著录项

来源
《Internet of Things Journal, IEEE》 |2019年第4期|6177-6189|共13页
作者
Hu Jingzhi; Zhang Hongliang; Song Lingyang;
展开▼
作者单位

Peking Univ Sch Elect Engn & Comp Sci Beijing 100871 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Reinforcement learning; sense-and-send protocol; trajectory design; unmanned aerial vehicle (UAV);

机译：强化学习;感应发送协议;轨迹设计;无人机（UAV）;

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning for a Cellular Internet of UAVs: Protocol Design, Trajectory Control, and Resource Management [J] . IEEE Wireless Communications . 2020,第1期

机译：无人机蜂窝互联网的强化学习：协议设计，轨迹控制和资源管理
2. Neural Combinatorial Deep Reinforcement Learning for Age-Optimal Joint Trajectory and Scheduling Design in UAV-Assisted Networks [J] . Ferdowsi Aidin, Abd-Elmagid Mohamed A., Saad Walid, IEEE Journal on Selected Areas in Communications . 2021,第5期

机译：无人机辅助网络中年龄最优关节轨迹和调度设计的神经组合深度加固学习
3. Deep Reinforcement Learning-Based Content Placement and Trajectory Design in Urban Cache-Enabled UAV Networks [J] . Chenyu Wu, Shuo Shi, Shushi Gu, Wireless communications & mobile computing . 2020,第1期

机译：基于深度加强学习的内容放置和城市缓存的UAV网络中的轨迹设计
4. Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks [C] . Nan Zhao, Yiqiang Cheng, Yiyang Pei, IEEE International Conference on Communications . 2020

机译：无人机网络中弹道设计和功率分配的深度强化学习
5. Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control [D] . Guo, Jin . 2020

机译：网络级交通信号控制分散的深度增强学习
6. A Reinforcement Learning Routing Protocol for UAV Aided Public Safety Networks [O] . Hassan Ishtiaq Minhas, Rizwan Ahmad, Waqas Ahmed, 2021

机译：用于UAV辅助公共安全网络的加强学习路由协议
7. Reinforcement Learning for a Cellular Internet of UAVs: Protocol Design, Trajectory Control, and Resource Management [O] . Jingzhi Hu, Hongliang Zhang, Lingyang Song, 2020

机译：蜂窝联网互联网互联网的加固学习：协议设计，轨迹控制和资源管理

Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks With Sense-and-Send Protocol

摘要

著录项

相似文献

相关主题

期刊订阅