CBMoS: Combinatorial Bandit Learning for Mode Selection and Resource Allocation in D2D Systems

Ortiz Andrea; Asadi Arash; Engelhardt Max; Klein Anja; Hollick Matthias

首页> 外文期刊>IEEE Journal on Selected Areas in Communications >CBMoS: Combinatorial Bandit Learning for Mode Selection and Resource Allocation in D2D Systems

【24h】

CBMoS: Combinatorial Bandit Learning for Mode Selection and Resource Allocation in D2D Systems

机译：CBMoS：D2D系统中模式选择和资源分配的组合式强盗学习

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The complexity of the mode selection and resource allocation (MS&RA) problem has hampered the commercialization progress of Device-to-Device (D2D) communication in 5G networks. Furthermore, the combinatorial nature of MS&RA has forced the majority of existing proposals to focus on constrained scenarios or offline solutions to contain the size of the problem. Given the real-time constraints in actual deployments, a reduction in computational complexity is necessary. Adaptability is another key requirement for mobile networks that are exposed to constant changes such as channel quality fluctuations and mobility. In this article, we propose an online learning technique (i.e., CBMoS) which leverages combinatorial multi-armed bandits (CMAB) to tackle the combinatorial nature of MS&RA. Furthermore, our two-stage CMAB design results in a tight model, which eliminates the theoretically feasible but practicality invalid options from the solution space. We prototype the first SDR-based D2D testbed to verify the performance of CBMoS under real-world conditions. The simulations confirm that the fast learning speed of CBMoS leads to outperforming the benchmark schemes by up to 132%. In experiments, CBMoS exhibits even higher performance (up to 142%) than in the simulations. This stems from the adaptability/fast learning speed of CBMoS in presence of high channel dynamics which cannot be captured via statistical channel models used in the simulators.

机译：模式选择和资源分配（MS＆RA）问题的复杂性阻碍了5G网络中设备到设备（D2D）通信的商业化进程。此外，MS＆RA的组合性质迫使大多数现有建议集中于受约束的方案或脱机解决方案以控制问题的规模。考虑到实际部署中的实时约束，必须降低计算复杂性。适应性是面临不断变化（例如信道质量波动和移动性）的移动网络的另一个关键要求。在本文中，我们提出了一种在线学习技术（即CBMoS），该技术利用组合式多臂土匪（CMAB）来解决MS＆RA的组合性问题。此外，我们的两阶段CMAB设计产生了一个紧模型，从而从解决方案空间中排除了理论上可行但实用的无效选择。我们制作了第一个基于SDR的D2D测试平台的原型，以验证CBMoS在实际条件下的性能。仿真结果表明，CBMoS的快速学习速度导致其性能比基准方案高出132％。在实验中，CBMoS表现出比模拟更高的性能（高达142％）。这是由于CBMoS在高通道动态情况下的适应性/快速学习速度所致，而高通道动态无法通过模拟器中使用的统计通道模型来捕获。

著录项

来源
《IEEE Journal on Selected Areas in Communications 》 |2019年第10期| 2225-2238| 共14页
作者
Ortiz Andrea; Asadi Arash; Engelhardt Max; Klein Anja; Hollick Matthias;
展开▼
作者单位

Tech Univ Darmstadt Commun Engn Lab D-64283 Darmstadt Germany;

Tech Univ Darmstadt Secure Mobile Networking Lab SEEMOO D-64283 Darmstadt Germany;

Tech Univ Darmstadt Secure Mobile Networking Lab SEEMOO D-64283 Darmstadt Germany|Vector Informat GmbH Stuttgart Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Device-to-device communications; combinatorial multi-armed bandits; mode selection and resource allocation; online learning;

机译：设备到设备的通信;组合式多臂匪;模式选择和资源分配;在线学习;

相似文献

外文文献
中文文献
专利

1. Energy-Efficient Mode Selection and Resource Allocation for D2D-Enabled Heterogeneous Networks: A Deep Reinforcement Learning Approach [J] . Zhang Tao, Zhu Kun, Wang Junhua IEEE transactions on wireless communications . 2021 ,第2期

机译：能源有效的D2D异构网络选择和资源分配：深度加强学习方法
2. Graph-Theory-Based Resource Allocation and Mode Selection in D2D Communication Systems: The Role of Full-Duplex [J] . Jeon Hong-Bae, Koo Bon-Hong, Park Sung-Ho, Wireless Communications Letters, IEEE . 2021 ,第2期

机译：基于图形理论的资源分配和模式选择在D2D通信系统中：全双工的作用
3. A Two-Stages Relay Selection and Resource Allocation with Throughput Balance Scheme in Relay-Assisted D2D System [J] . Gu Xinyu, Zhao Ming, Ren Luming, Mobile networks & applications . 2017 ,第6期

机译：中继辅助D2D系统中具有吞吐量平衡方案的两阶段中继选择和资源分配
4. Performance Analysis of Multi-Armed Bandit Based Resource Allocation Algorithms for D2D Communication Systems [C] . Rahul Shanbhag, Rahul Bajpai, Naveen Gupta, International Conference on Telecommunications . 2020

机译：基于多臂强盗的D2D通信系统资源分配算法性能分析
5. Market-based model predictive control for survivable distributed information systems: Resource allocation and algorithm selection. [D] . Lee, Seokcheon. 2005

机译：可生存的分布式信息系统的基于市场的模型预测控制：资源分配和算法选择。
6. Mode Selection and Spectrum Allocation in Coexisting D2D and Cellular Networks with Cooperative Precoding [O] . Yu-Wei Chan, Feng-Tsun Chien, Chao-Tung Yang 2019

机译：具有协作预编码的共存D2D和蜂窝网络中的模式选择和频谱分配
7. Mode Selection and Resource Allocation Algorithm in Energy-Harvesting D2D Heterogeneous Network [O] . Jie Yan, Zhufang Kuang, Fan Yang, 2019

机译：能量收集D2D异构网络中的模式选择与资源分配算法

CBMoS: Combinatorial Bandit Learning for Mode Selection and Resource Allocation in D2D Systems

摘要

著录项

相似文献

相关主题

期刊订阅