首页> 外文会议>Conference on Neural Information Processing Systems >Pure Exploration with Multiple Correct Answers
【24h】

Pure Exploration with Multiple Correct Answers

机译:纯粹的探索,具有多种正确答案

获取原文

摘要

We determine the sample complexity of pure exploration bandit problems with multiple good answers. We derive a lower bound using a new game equilibrium argument. We show how continuity and convexity properties of single-answer problems ensure that the existing Track-and-Stop algorithm has asymptotically optimal sample complexity. However, that convexity is lost when going to the multiple-answer setting. We present a new algorithm which extends Track-and-Stop to the multiple-answer case and has asymptotic sample complexity matching the lower bound.
机译:我们确定纯粹探索强盗问题的样本复杂性,具有多个好答案。 我们使用新的游戏均衡论点获得了较低的界限。 我们展示了单答案问题的连续性和凸性属性如何确保现有的轨道和停止算法具有渐近最佳的样本复杂性。 但是,进入多答案设置时凸起丢失。 我们介绍了一种新的算法,它将跟踪和停止扩展到多答案案例,并具有匹配下限的渐近样本复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号