首页> 外文会议>International Conference on Electronics, Communication and Aerospace Technology >Combinatorial Sleeping Bandits with Fairness Constraints and Long-Term Non-Availability of Arms
【24h】

Combinatorial Sleeping Bandits with Fairness Constraints and Long-Term Non-Availability of Arms

机译:组合睡衣带有公平限制和武器的长期不可释放

获取原文

摘要

In this paper, the situation of long term non-availability of arms in combinatorial sleeping bandits problem is analyzed. The multi-arm sleeping bandit’s model along with fairness constraint is very widely used to model real world examples like a network switch. One common occurrence in such a scenario is long term non-availability. In such cases the queue length (in the Queuing techniques) grows rapidly causing system instability. The algorithm proposed in this paper deals with this problem and still maintain the regret bounds along with the queue fairness constraints. A better way of estimating the fairness that takes into account the long term non-availability of arms is also proposed. Extension of the UCB algorithm is used to deal with the exploration versus exploitation dilemma. Mathematical proofs for arriving at the regret bounds and feasibility optimality is given in the end.
机译:本文分析了组合睡眠杆问题中长期非可用性的情况。多臂睡眠强盗的模型以及公平约束非常广泛地用于建模像网络交换机的真实世界示例。这种情况下的一个常见发生是长期非可用性。在这种情况下,队列长度(在排队技术中)迅速增长,导致系统不稳定性。本文提出的算法涉及此问题,仍然将遗憾范围与队列公平限制相同。还提出了一种估计考虑到武器长期不可用的公平性的更好方法。 UCB算法的扩展用于处理勘探与开发困境。最后给出了到达遗憾范围和可行性最优性的数学证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号