Cooperative Caching and Fetching in D2D Communications - A Fully Decentralized Multi-Agent Reinforcement Learning Approach

Yan Yan; Baoxian Zhang; Cheng Li; Changqing Su

首页> 外文期刊>IEEE Transactions on Vehicular Technology >Cooperative Caching and Fetching in D2D Communications - A Fully Decentralized Multi-Agent Reinforcement Learning Approach

【24h】

Cooperative Caching and Fetching in D2D Communications - A Fully Decentralized Multi-Agent Reinforcement Learning Approach

机译：D2D通信中的合作缓存和取出 - 一种完全分散的多功能加强学习方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

To satisfy the increasing demands of cellular traffic, cooperative content caching at the network edge (e.g., User Equipment) has become a promising paradigm in the next-generation cellular networks. Device-to-Device (D2D) communications can improve the content caching and fetching performance without deploying additional infrastructure. In this paper, we investigate the joint optimization of cooperative caching and fetching in dynamic D2D environment for minimizing the overall content fetching delay. We formulate it as a decentralized partially observable Markov game for finding the optimal policies at agents. To address this problem, we propose a Fully Decentralized Soft Multi-Agent Reinforcement Learning (FDS-MARL) algorithm, which extends the soft actor-critic framework to non-stationary multi-agent environment for fully decentralized learning and it contains the following major design components: Graph Attention Network based self-attention for cooperative inter-agent coordination, a consensus communication mechanism for effectively reducing the information loss and non-stationarity of the environment while keeping gradual global consensus, and an influence based transmission scheduling mechanism for effective credit assignment and also alleviation of potential transmission contentions among agents. Simulation results show that FDS-MARL can improve the content caching and fetching performance significantly compared with the representative work in the literature.

机译：为了满足蜂窝交通的不断增加的需求，网络边缘（例如用户设备）在网络边缘（例如，用户设备）的协同内容已经成为下一代蜂窝网络中的有前途的范例。设备到设备（D2D）通信可以在不部署其他基础架构的情况下改善内容缓存和提取性能。在本文中，我们调查了在动态D2D环境中的协同缓存和获取的联合优化，以最大限度地减少整体内容提取延迟。我们将其制定为一个分散的部分可观察马尔可夫游戏，用于寻找代理商的最佳政策。为了解决这个问题，我们提出了一个完全分散的软多功能代理强化学习（FDS-MARL）算法，它将软演员 - 评论家框架扩展到非静止多智能经纪环境，以获得完全分散的学习，它包含以下主要设计组件：图表关注基于网络的自我关注，共同协商机制，有效减少环境的信息丢失和环境的非公平性，以及基于影响的有效信用分配的传输调度机制并且还减轻了代理商之间的潜在传播凋亡。仿真结果表明，与文献中的代表性工作相比，FDS-Marl可以显着提高内容缓存和提取性能。

著录项

来源
《IEEE Transactions on Vehicular Technology》 |2020年第12期|16095-16109|共15页
作者
Yan Yan; Baoxian Zhang; Cheng Li; Changqing Su;
展开▼
作者单位

Research Center of Ubiquitous Sensor Networks University of Chinese Academy of Sciences Beijing China;

University of Chinese Academy of Sciences Beijing China;

Memorial University St. John's NL Canada;

University of Chinese Academy of Sciences Beijing China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Device-to-device communication; Heuristic algorithms; Reinforcement learning; Entropy; Wireless networks; Cooperative caching; Protocols;

机译：设备到设备通信;启发式算法;加固学习;熵;无线网络;合作缓存;协议;

相似文献

外文文献
中文文献
专利

1. Multi-Agent Reinforcement Learning for Efficient Content Caching in Mobile D2D Networks [J] . Jiang Wei, Feng Gang, Qin Shuang, IEEE transactions on wireless communications . 2019,第3期

机译：用于移动D2D网络中有效内容缓存的多Agent强化学习
2. Cooperative Multi-Agent Reinforcement Learning-Based Co-optimization of Cores, Caches, and On-chip Network [J] . Jain Rahul, Panda Preeti Ranjan, Subramoney Sreenivas ACM Transactions on Architecture and Code Optimization . 2017,第4期

机译：基于合作多功能强化学习的核心，高速缓存和片上网络的协同优化
3. Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications [J] . IEEE Transactions on Vehicular Technology . 2020,第2期

机译：基于多代理深度强化学习的D2D底层通信频谱分配
4. Efficient D2D content caching using multi-agent reinforcement learning [C] . Wei Jiang, Gang Feng, Shuang Qin, IEEE Conference on Computer Communications Workshops . 2018

机译：使用多主体强化学习进行有效的D2D内容缓存
5. Decentralized Coordinated Optimal Ramp Metering using Multi-agent Reinforcement Learning [D] . Rezaee, Kasra 2014

机译：使用多主体强化学习的分散式协调最佳斜坡计量
6. A Deep Learning Approach for Maximum Activity Links in D2D Communications [O] . Bocheng Yu, Xingjun Zhang, Francesco Palmieri, 2019

机译：D2D通信中最大活动链接的深度学习方法
7. Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications [O] . Zheng Li, Caili Guo 2020

机译：基于多功能的D2D底层通信的频谱分配

Cooperative Caching and Fetching in D2D Communications - A Fully Decentralized Multi-Agent Reinforcement Learning Approach

摘要

著录项

相似文献

相关主题

期刊订阅