Cost-Aware Learning and Optimization for Opportunistic Spectrum Access

机译：机会成本频谱访问的成本意识学习和优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate cost-aware joint learning and optimization for multi-channel opportunistic spectrum access in a cognitive radio system. We investigate a discrete-time model where the time axis is partitioned into frames. Each frame consists of a sensing phase, followed by a transmission phase. During the sensing phase, the user is able to sense a subset of channels sequentially before it decides to use one of them in the following transmission phase. We assume the channel states alternate between busy and idle according to independent Bernoulli random processes from frame to frame. To capture the inherent uncertainty in channel sensing, we assume the reward of each transmission when the channel is idle is a random variable. We also associate random costs with sensing and transmission actions. Our objective is to understand how the costs and reward of the actions would affect the optimal behavior of the user in both offline and online settings, and design the corresponding opportunistic spectrum access strategies to maximize the expected cumulative net reward (i.e., reward-minus-cost).We start with an offline setting where the statistics of the channel status, costs and reward are known beforehand. We show that the the optimal policy exhibits a recursive double-threshold structure, and the user needs to compare the channel statistics with those thresholds sequentially in order to decide its actions. With such insights, we then study the online setting, where the statistical information of the channels, costs and reward are unknown a priori. We judiciously balance exploration and exploitation, and show that the cumulative regret scales in O(log T). We also establish a matched lower bound, which implies that our online algorithm is order-optimal. Simulation results corroborate our theoretical analysis.

机译：在本文中，我们研究了认知无线电系统中用于多信道机会频谱访问的成本感知型联合学习和优化。我们研究了一个离散时间模型，其中时间轴被划分为多个帧。每个帧都包含一个检测阶段，然后是一个传输阶段。在感测阶段，用户可以决定在随后的传输阶段中使用其中一个信道之前，先依次感知信道的一个子集。我们假设信道状态根据帧之间独立的伯努利随机过程在忙碌和空闲之间交替。为了捕获信道感测中固有的不确定性，我们假设信道空闲时每次传输的报酬是随机变量。我们还将随机成本与感知和传输动作相关联。我们的目标是了解操作的成本和奖励将如何影响用户在离线和在线环境中的最佳行为，并设计相应的机会频谱访问策略，以最大化预期的累积净奖励（即，奖励减去-费用）。我们从一个离线设置开始，该渠道设置会预先知道频道状态，费用和奖励的统计信息。我们表明，最佳策略表现出递归双阈值结构，并且用户需要顺序地将信道统计信息与那些阈值进行比较，以便确定其操作。有了这些见解，我们便研究了在线设置，其中先验未知渠道，成本和奖励的统计信息。我们明智地权衡了勘探与开发之间的关系，并表明累积后悔程度以O（log T）表示。我们还建立了一个匹配的下限，这意味着我们的在线算法是顺序最优的。仿真结果证实了我们的理论分析。

著录项

来源
《Information Theory and Applications Workshop》|2020年|1-12|共12页
会议地点
作者
Chao Gan; Ruida Zhou; Jing Yang; Cong Shen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Opportunistic spectrum access; sensing cost; sensing uncertainty; cascading bandits;

机译：机会频谱访问;传感成本;传感不确定性;级联强盗;

相似文献

外文文献
中文文献
专利

1. Cost-Aware Learning and Optimization for Opportunistic Spectrum Access [J] . Chao Gan, Ruida Zhou, Jing Yang, IEEE Transactions on Cognitive Communications and Networking . 2019,第1期

机译：机会成本频谱访问的成本意识学习和优化
2. Machine-Learning-Based Opportunistic Spectrum Access in Cognitive Radio Networks [J] . IEEE Wireless Communications . 2020,第1期

机译：认知无线电网络中基于机器学习的机会频谱访问
3. LA-MHR: Learning Automata Based Multilevel Heterogeneous Routing for Opportunistic Shared Spectrum Access to Enhance Lifetime of WSN [J] . Tanwar Sudeep, Tyagi Sudhanshu, Kumar Neeraj, IEEE systems journal . 2019,第1期

机译：LA-MHR：学习基于自动机的多级异构路由，以获得机会共享频谱，以延长WSN的使用寿命
4. Optimization of Spectrum Sensing for Opportunistic Spectrum Access in Cognitive Radio Networks [C] . Amir Ghasemi, Elvino S. Sousa IEEE Consumer Communications and Networking Conference . 2007

机译：认知无线电网络机遇频谱访问的频谱感应优化
5. Opportunistic Access of Noncontiguous Spectrum [D] . Kumbhkar, Ratnesh. 2018

机译：非连续频谱的机会访问
6. Decentralized Opportunistic Spectrum Resources Access Model and Algorithm toward Cooperative Ad-Hoc Networks [O] . Ming Liu, Yang Xu, Abdul-Wahid Mohammed 2011

机译：协同Ad-Hoc网络的分散机会频谱资源访问模型和算法
7. Cost-Aware Learning and Optimization for Opportunistic Spectrum Access [O] . Chao Gan, Ruida Zhou, Jing Yang, 2020

机译：机会主义频谱访问的成本感知学习和优化

Cost-Aware Learning and Optimization for Opportunistic Spectrum Access

摘要

著录项

相似文献

相关主题

期刊订阅