首页> 外文会议>Automata, languages and programming >Multi-armed Bandits with Metric Switching Costs
【24h】

Multi-armed Bandits with Metric Switching Costs

机译:公制转换成本的多臂土匪

获取原文
获取原文并翻译 | 示例

摘要

In this paper we consider the stochastic multi-armed bandit with metric switching costs. Given a set of locations (arms) in a metric space and prior information about the reward available at these locations, cost of getting a sample/play at every location and rules to update the prior based on samples/plays, the task is to maximize a certain objective function constrained to a distance cost of L and cost of plays C. This fundamental and well-studied problem models several optimization problems in robot navigation, sensor networks, labor economics, etc.rnIn this paper we develop a general duality-based framework to provide the first O(1) approximation for metric switching costs; the actual constants being quite small. Since these problems are Max-SNP hard, this result is the best possible. The overall technique and the ensuing structural results are independently of interest in the context of bandit problems with complicated side-constraints. Our techniques also improve the approximation ratio of the budgeted learning problem from 4 to 3 + ε.
机译:在本文中,我们考虑具有度量转换成本的随机多臂匪。给定度量空间中的一组位置(武器)以及有关在这些位置可获得的奖励的先验信息,在每个位置获得样本/比赛的成本以及基于样本/比赛更新先验的规则,任务是最大化一个确定的目标函数,约束到L的距离成本和游戏成本C。这个经过深入研究的基本问题模型对机器人导航,传感器网络,劳动经济学等方面的几个优化问题进行了建模。提供度量转换成本的第一个O(1)近似值的框架;实际常数很小。由于这些问题很难解决Max-SNP问题,因此这种结果是最好的。在具有复杂的侧约束的土匪问题的情况下,整体技术和随后的结构结果不受关注。我们的技术还将预算学习问题的近似率从4提高到3 +ε。

著录项

  • 来源
  • 会议地点 Rhodes(GR);Rhodes(GR);Rhodes(GR);Rhodes(GR);Rhodes(GR);Rhodes(GR);Rhodes(GR);Rhodes(GR);Rhodes(GR);Rhodes(GR)
  • 作者

    Sudipto Guha; Kamesh Munagala;

  • 作者单位

    Department of Computer and Information Sciences, University of Pennsylvania,Philadelphia PA 19104-6389;

    Department of Computer Science, Duke University, Durham NC 27708-0129;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 程序设计、软件工程;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号