...
首页> 外文期刊>Selected Areas in Communications, IEEE Journal on >Online Pricing of Secondary Spectrum Access with Unknown Demand Function
【24h】

Online Pricing of Secondary Spectrum Access with Unknown Demand Function

机译:具有未知需求函数的二次频谱访问的在线定价

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We consider a wireless provider who caters to two classes of customers, namely primary users (PUs) and secondary users (SUs). PUs have long term contracts while SUs are admitted and priced according to current availability of excess spectrum. The average rate at which SUs attempt to access the spectrum is a function on the currently advertised price, referred to as the demand function. We analyze the problem of maximizing the average profit gained by admissions of SUs, when the demand function is unknown. We introduce a new on-line algorithm, called Measurement-based Threshold Pricing (MTP), that requires the optimization of only two parameters, a price and a threshold, whereby SU calls are admitted and charged a fixed price when the channel occupancy is lower than the threshold and rejected otherwise. At each iteration, MTP measures the average arrival rate of SUs corresponding to a certain test price. We prove that these measurements of the secondary demand are sufficient for MTP to converge to a local optimal price and corresponding optimal threshold, within a number of measurements that is logarithmic in the total number of possible prices. We further provide an adaptive version of MTP that adjusts to time-varying demand and establish its convergence properties. We conduct numerical studies showing the convergence of MTP to near-optimal online profit and its superior performance over a traditional reinforcement learning approach.
机译:我们考虑一家无线服务提供商,它可以满足两类客户,即主要用户(PU)和次要用户(SU)。 PU具有长期合同,而SU则根据当前剩余频谱的可用性来接受和定价。 SU尝试访问频谱的平均速率是当前广告价格的函数,称为需求函数。当需求函数未知时,我们分析了最大化SU的接纳所获得的平均利润的问题。我们引入了一种新的在线算法,称为基于测量的阈值定价(MTP),该算法仅需要优化两个参数,即价格和阈值,从而当信道占用率较低时,允许SU呼叫被接纳并收取固定价格超过阈值,否则拒绝。在每次迭代中,MTP都会测量与某个测试价格相对应的SU的平均到达率。我们证明,这些次级需求的度量足以使MTP收敛到局部最优价格和相应的最优阈值,而这些度量在所有可能价格的对数中都是对数。我们还提供了MTP的自适应版本,该版本可适应随时间变化的需求并建立其收敛特性。我们进行的数值研究表明,MTP融合到接近最佳的在线利润中,并且其性能优于传统的强化学习方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号