首页> 外文会议>International Conference on Information and Communication Technology Convergence >Combinatorial multi-armed bandits in cognitive radio networks: A brief overview
【24h】

Combinatorial multi-armed bandits in cognitive radio networks: A brief overview

机译:认知无线电网络中的组合式多臂匪:简要概述

获取原文

摘要

Combinatorial multi-armed bandit (MAB) problem can be used to formulate sequential decision problems with exploration-exploitation tradeoff. Dynamic spectrum access (DSA) in cognitive radio (CR) networks is one of important applications. In this work, we briefly overview combinatorial MAB problems with its possible applications to CR networks. We first investigate the standard MAB problems where a single player either explores an arm to gather information to improve its decision strategy, or exploits the arm based on the information that it has collected at each round. Then, we study the taxonomy of combinatorial MAB problems, in particular for multi-player scenarios with independent and identically distributed (i.i.d.) rewards. Finally, we discuss limitations of existing works and interesting open problems.
机译:组合式多臂匪(MAB)问题可用于制定具有勘探与开发权衡的顺序决策问题。认知无线电(CR)网络中的动态频谱访问(DSA)是重要的应用之一。在这项工作中,我们简要概述了组合式MAB问题及其在CR网络中的可能应用。我们首先研究标准的MAB问题,其中单个玩家要么探索一支手臂来收集信息以改善其决策策略,要么根据每一轮收集的信息来利用该手臂。然后,我们研究组合式MAB问题的分类法,特别是对于具有独立且均等分布(即i.d.)奖励的多玩家场景。最后,我们讨论了现有作品的局限性和有趣的开放性问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号