【24h】

Multi-channel Opportunistic Access: A Case of Restless Bandits with Multiple Plays

机译:多渠道机会访问:多玩的不安土匪案例

获取原文
获取外文期刊封面目录资料

摘要

This paper considers the following stochastic control problem that arises in opportunistic spectrum access: a system consists of n channels where the state ("good" or "bad") of each channel evolves as independent and identically distributed Markov processes. A user can select exactly k channels to sense and access (based on the sensing result) in each time slot. A reward is obtained whenever the user senses and accesses a "good" channel. The objective is to design a channel selection policy that maximizes the expected discounted total reward accrued over a finite or infinite horizon. In our previous work we established the optimality of a greedy policy for the special case of k = 1 (i.e., single channel access) under the condition that the channel state transitions are positively correlated over time. In this paper we show under the same condition the greedy policy is optimal for the general case of k ≥ 1; the methodology introduced here is thus more general. This problem may be viewed as a special case of the restless bandit problem, with multiple plays. We discuss connections between the current problem and existing literature on this class of problems.
机译:本文考虑了机会频谱访问中出现的以下随机控制问题:系统由N个通道组成,其中每个通道的状态(“良好”或“坏”)作为独立和相同分布的Markov进程演变。用户可以精确地选择k个通道以在每个时隙中感测和访问(基于感测结果)。每当用户感知并访问“良好”频道时,获得奖励。目的是设计一个渠道选择策略,最大化有限或无限地平线上的预期折扣总奖励。在我们之前的工作中,我们在信道状态转换随时间呈正相关的条件下建立了k = 1(即单通道访问)的特殊情况的贪婪策略的最优性。在本文中,我们在相同的条件下显示贪婪政策对于k≥1的一般情况是最佳的。因此,介绍的方法更加一般。该问题可以被视为一个特殊的案例,具有焦躁的强盗问题,具有多个播放。我们讨论当前问题与现有文学之间的联系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号