Function Approximation Based Reinforcement Learning for Edge Caching in Massive MIMO Networks

Garg Navneet; Sellathurai Mathini; Bhatia Vimal; Ratnarajah Tharmalingam

首页> 外文期刊>IEEE Transactions on Communications >Function Approximation Based Reinforcement Learning for Edge Caching in Massive MIMO Networks

【24h】

Function Approximation Based Reinforcement Learning for Edge Caching in Massive MIMO Networks

机译：基于功能近似基于MIMO网络边缘高速缓存的增强学习

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Caching popular contents in advance is an important technique to achieve low latency and reduced backhaul congestion in future wireless communication systems. In this article, a multi-cell massive multi-input-multi-output system is considered, where locations of base stations are distributed as a Poisson point process. Assuming probabilistic caching, average success probability (ASP) of the system is derived for a known content popularity (CP) profile, which in practice is time-varying and unknown in advance. Further, modeling CP variations across time as a Markov process, reinforcement Q-learning is employed to learn the optimal content placement strategy to optimize the long-term-discounted ASP and average cache refresh rate. In the Q-learning, the number of Q-updates are large and proportional to the number of states and actions. To reduce the space complexity and update requirements towards scalable Q-learning, two novel (linear and non-linear) function approximations-based Q-learning approaches are proposed, where only a constant (4 and 3 respectively) number of variables need updation, irrespective of the number of states and actions. Convergence of these approximation-based approaches are analyzed. Simulations verify that these approaches converge and successfully learn the similar best content placement, which shows the successful applicability and scalability of the proposed approximated Q-learning schemes.

机译：高速缓存流行内容提前是实现低延迟和未来的无线通信系统中重新回程拥塞的重要技术。在本文中，考虑了一种多电池大量多输入多输出系统，其中基站的位置分配为泊松点过程。假设概率高速缓存，系统的平均成功概率（ASP）被导出用于已知的内容人气（CP）简档，这在实践中预先是时变且未知。此外，采用增强型Q-Learning在时间作为Markov过程的CP变化来学习最佳内容放置策略，以优化长期折扣的ASP和平均缓存刷新率。在Q-Learning中，Q-更新的数量很大，与状态数量和动作成比例。为了降低可扩展Q学习的空间复杂性和更新要求，提出了两种新（线性和非线性）函数近似的基于Q学习方法，其中仅常数（分别为4和3）变量的变量，无论国家和行动的数量如何。分析了基于近似的方法的收敛性。仿真验证了这些方法是否会聚并成功学习类似的最佳内容放置，其显示了所提出的近似Q学习方案的成功适用性和可扩展性。

著录项

来源
《IEEE Transactions on Communications 》 |2021年第4期| 2304-2316| 共13页
作者
Garg Navneet; Sellathurai Mathini; Bhatia Vimal; Ratnarajah Tharmalingam;
展开▼
作者单位

Univ Edinburgh Sch Engn Edinburgh EH8 9YL Midlothian Scotland;

Heriot Watt Univ Sch Engn & Phys Sci Edinburgh EH14 4AS Midlothian Scotland;

Indian Inst Technol Indore Dept Elect Engn Indore 452017 India;

Univ Edinburgh Sch Engn Edinburgh EH8 9YL Midlothian Scotland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Linear function approximation; massive MIMO; non-linear function approximation; Poisson point process; Q-learning; wireless edge caching;

机译：线性函数近似;巨大的MIMO;非线性函数近似;泊松点过程;Q-Learning;无线边缘缓存;

相似文献

外文文献
中文文献
专利

1. Multi-Agent Deep Reinforcement Learning-Based Cooperative Edge Caching for Ultra-Dense Next-Generation Networks [J] . Chen Shuangwu, Yao Zhen, Jiang Xiaofeng, IEEE Transactions on Communications . 2021 ,第4期

机译：基于多功能深度加强学习的合作边缘缓存超密集的下一代网络
2. Deep Reinforcement Learning-Based Edge Caching in Wireless Networks [J] . Zhong Chen, Gursoy M. Cenk, Velipasalar Senem IEEE Transactions on Cognitive Communications and Networking . 2020 ,第1期

机译：无线网络中基于深度加强学习的高速缓存
3. Increasing energy efficiency of Massive-MIMO network via base stations switching using reinforcement learning and radio environment maps [J] . Hoffmann Marcin, Kryszkiewicz Pawel, Kliks Adrian Computer Communications . 2021 ,第Mara期

机译：通过使用加固学习和无线电环境贴图通过基站切换增加大规模 - MIMO网络的能效
4. Linear Approximation based Q-Learning for Edge Caching in Massive MIMO Networks [C] . Navneet Garg, Mathini Sellathurai, Tharmalingam Ratnarajah Asilomar Conference on Signals, Systems, and Computers . 2019

机译：用于大规模MIMO网络中边缘缓存的基于线性近似的Q学习
5. A Reinforcement Learning-based Framework for Resource Allocation and Task Assignment in Mobile Edge Computing Networks [D] . Hsieh, Li-Tse. 2021

机译：基于加强学习的移动边缘计算网络中的资源分配和任务分配框架
6. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions [O] . Minija Tamosiunaite, Tamim Asfour, Florentin Wörgötter -1

机译：通过使用连续动作的基于受体场的函数逼近方法通过强化学习来学习达到
7. Multi-Agent Deep Reinforcement Learning-Based Cooperative Edge Caching for Ultra-Dense Next-Generation Networks [O] . Shuangwu Chen, Zhen Yao, Xiaofeng Jiang, 2021

机译：基于多功能深度加强学习的合作边缘缓存超密集的下一代网络

Function Approximation Based Reinforcement Learning for Edge Caching in Massive MIMO Networks

摘要

著录项

相似文献

相关主题

期刊订阅