...
首页> 外文期刊>IEEE Journal on Selected Areas in Communications >CBMoS: Combinatorial Bandit Learning for Mode Selection and Resource Allocation in D2D Systems
【24h】

CBMoS: Combinatorial Bandit Learning for Mode Selection and Resource Allocation in D2D Systems

机译:CBMoS:D2D系统中模式选择和资源分配的组合式强盗学习

获取原文
获取原文并翻译 | 示例

摘要

The complexity of the mode selection and resource allocation (MS&RA) problem has hampered the commercialization progress of Device-to-Device (D2D) communication in 5G networks. Furthermore, the combinatorial nature of MS&RA has forced the majority of existing proposals to focus on constrained scenarios or offline solutions to contain the size of the problem. Given the real-time constraints in actual deployments, a reduction in computational complexity is necessary. Adaptability is another key requirement for mobile networks that are exposed to constant changes such as channel quality fluctuations and mobility. In this article, we propose an online learning technique (i.e., CBMoS) which leverages combinatorial multi-armed bandits (CMAB) to tackle the combinatorial nature of MS&RA. Furthermore, our two-stage CMAB design results in a tight model, which eliminates the theoretically feasible but practicality invalid options from the solution space. We prototype the first SDR-based D2D testbed to verify the performance of CBMoS under real-world conditions. The simulations confirm that the fast learning speed of CBMoS leads to outperforming the benchmark schemes by up to 132%. In experiments, CBMoS exhibits even higher performance (up to 142%) than in the simulations. This stems from the adaptability/fast learning speed of CBMoS in presence of high channel dynamics which cannot be captured via statistical channel models used in the simulators.
机译:模式选择和资源分配(MS&RA)问题的复杂性阻碍了5G网络中设备到设备(D2D)通信的商业化进程。此外,MS&RA的组合性质迫使大多数现有建议集中于受约束的方案或脱机解决方案以控制问题的规模。考虑到实际部署中的实时约束,必须降低计算复杂性。适应性是面临不断变化(例如信道质量波动和移动性)的移动网络的另一个关键要求。在本文中,我们提出了一种在线学习技术(即CBMoS),该技术利用组合式多臂土匪(CMAB)来解决MS&RA的组合性问题。此外,我们的两阶段CMAB设计产生了一个紧模型,从而从解决方案空间中排除了理论上可行但实用的无效选择。我们制作了第一个基于SDR的D2D测试平台的原型,以验证CBMoS在实际条件下的性能。仿真结果表明,CBMoS的快速学习速度导致其性能比基准方案高出132%。在实验中,CBMoS表现出比模拟更高的性能(高达142%)。这是由于CBMoS在高通道动态情况下的适应性/快速学习速度所致,而高通道动态无法通过模拟器中使用的统计通道模型来捕获。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号