首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Bridging the gap between regret minimization and best arm identification, with application to A/B tests
【24h】

Bridging the gap between regret minimization and best arm identification, with application to A/B tests

机译:桥接后悔最小化和最佳臂识别之间的差距,应用于A / B测试

获取原文
           

摘要

State of the art online learning procedures focus either on selecting the best alternative (“best arm identification”) or on minimizing the cost (the “regret”). We merge these two objectives by providing the theoretical analysis of cost minimizing algorithms that are also $delta$-PAC (with a proven guaranteed bound on the decision time), hence fulfilling at the same time regret minimization and best arm identification. This analysis sheds light on the common observation that ill-callibrated UCB-algorithms minimize regret while still identifying quickly the best arm. We also extend these results to the non-iid case faced by many practitioners. This provides a technique to make cost versus decision time compromise when doing adaptive tests with applications ranging from website A/B testing to clinical trials.
机译:最先进的在线学习程序重点关注选择最佳替代方案(“最佳臂识别”)或最小化成本(“后悔”)。我们通过提供价格最小化算法的理论分析来合并这两个目标,这些算法也是$ delta $ -pac(在决定时间的经过验证的保证),因此同时致力于最小化和最佳臂识别。这种分析揭示了常见的观察,即僵硬的UCB算法最小化遗憾的同时仍然迅速识别最好的手臂。我们还将这些结果扩展到许多从业者面临的非IID案件。这提供了一种技术,用于在与从网站A / B测试到临床试验的应用范围内的应用程序进行适应性测试时妥协的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号