Multi-channel Opportunistic Access: A Case of Restless Bandits with Multiple Plays

机译：多渠道机会访问：多玩的不安土匪案例

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper considers the following stochastic control problem that arises in opportunistic spectrum access: a system consists of n channels where the state ("good" or "bad") of each channel evolves as independent and identically distributed Markov processes. A user can select exactly k channels to sense and access (based on the sensing result) in each time slot. A reward is obtained whenever the user senses and accesses a "good" channel. The objective is to design a channel selection policy that maximizes the expected discounted total reward accrued over a finite or infinite horizon. In our previous work we established the optimality of a greedy policy for the special case of k = 1 (i.e., single channel access) under the condition that the channel state transitions are positively correlated over time. In this paper we show under the same condition the greedy policy is optimal for the general case of k ≥ 1; the methodology introduced here is thus more general. This problem may be viewed as a special case of the restless bandit problem, with multiple plays. We discuss connections between the current problem and existing literature on this class of problems.

机译：本文考虑了机会频谱访问中出现的以下随机控制问题：系统由N个通道组成，其中每个通道的状态（“良好”或“坏”）作为独立和相同分布的Markov进程演变。用户可以精确地选择k个通道以在每个时隙中感测和访问（基于感测结果）。每当用户感知并访问“良好”频道时，获得奖励。目的是设计一个渠道选择策略，最大化有限或无限地平线上的预期折扣总奖励。在我们之前的工作中，我们在信道状态转换随时间呈正相关的条件下建立了k = 1（即单通道访问）的特殊情况的贪婪策略的最优性。在本文中，我们在相同的条件下显示贪婪政策对于k≥1的一般情况是最佳的。因此，介绍的方法更加一般。该问题可以被视为一个特殊的案例，具有焦躁的强盗问题，具有多个播放。我们讨论当前问题与现有文学之间的联系。

著录项

来源
《Annual allerton conference on communication control, and computing;Allerton conference on communication control, and computing;Allerton 2009》|2009年|P.1361-1368|共8页
会议地点
作者
Sahand Haji Ali Ahmad; Mingyan Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Opportunistic Scheduling Revisited Using Restless Bandits: Indexability and Index Policy [J] . Wang Kehao, Yu Jihong, Chen Lin, IEEE transactions on wireless communications . 2019,第10期

机译：使用不安的匪徒重新探讨机会调度：可索引性和索引策略
2. Navigation Data-Assisted Opportunistic Spectrum Scheduling for Network-Based UAV Systems: A Parallel Restless Bandits Formulation [J] . Si Pengbo, Yu F. Richard, Yang Ruizhe, Wireless personal communications: An Internaional Journal . 2015,第1期

机译：基于网络的无人机系统的导航数据辅助机会频谱调度：并行的不安定土匪公式
3. Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access [J] . Liu K.Zhao Q. Information Theory, IEEE Transactions on . 2010,第11期

机译：动态多通道访问的不安定匪问题的可索引性和Whittle索引的最优性
4. Multi-channel Opportunistic Access: A Case of Restless Bandits with Multiple Plays [C] . Sahand Haji Ali Ahmad, Mingyan Liu Annual Allerton Conference on Communication, Control, and Computing . 2009

机译：多通道机会访问：一种差别乐队的案例
5. Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics [D] . Liu, Haoyang 2013

机译：在瞬息万变的世界中学习：具有未知动态的躁动多臂强盗
6. Nash Equilibrium of Social-Learning Agents in a Restless Multiarmed Bandit Game [O] . Kazuaki Nakayama, Masato Hisakado, Shintaro Mori -1

机译：躁动多臂强盗游戏中的社会学习代理人的纳什均衡
7. Multi-channel opportunistic access: a case of restless bandits with multiple plays [O] . Sahand Haji Ali Ahmad, Mingyan Liu 2009

机译：多渠道机会访问：多玩的不安土匪的情况
8. Myopic Policy for a Class of Restless Bandit Problems with Applications in Dynamic Multichannel Access [R] . Liu, K., Zhao, Q. 2009

机译：一类不安全强盗问题的近视策略及其在动态多通道接入中的应用

Multi-channel Opportunistic Access: A Case of Restless Bandits with Multiple Plays

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅