Rapidly Finding the Best Arm Using Variance

机译：快速找到使用方差的最佳臂

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of identifying the best arm in a pure-exploration multi-armed bandit problem. In this setting, the agent repeatedly pulls arms in order to identify the one associated with the maximum expected reward. We focus on the fixed-budget version of the problem in which the agent tries to find the best arm given a fixed number of arm pulls. We propose a novel sequential elimination method exploiting the empirical variance of the arms. We detail and analyse the overall approach providing theoretical and empirical results. The experimental evaluation shows the advantage of our variance-based rejection method in heterogeneous test settings, considering both identification accuracy and execution time.

机译：我们解决了识别纯粹探索多武装强盗问题中最好的手臂的问题。在该设置中，代理重复拉动臂，以便识别与最大预期奖励相关联的人。我们专注于固定预算版本的代理商试图找到一个固定数量的ARM拉动的最佳手臂。我们提出了一种新的连续消除方法，利用武器的经验方差。我们详细说明并分析了提供理论和经验结果的整体方法。实验评估表明，考虑到识别精度和执行时间，我们在异构测试设置中的差异基抑制方法的优点。

著录项

来源
《European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems》|2020年|2275-3053p|共7页
会议地点
作者
Marco Faella; Alberto Finzi; Luigi Sauro;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Estimating the asymptotic variance matrix of the QMLE of weak multivariate ARMA models [Estimation de la matrice de variance asymptotique des estimateurs du QMV de modèles ARMA faibles multivariés] [J] . Boubacar Ma?nassara Y. Comptes rendus. Mathematique . 2011,第13a14期

机译：估计弱多元ARMA模型的QMLE的渐近方差矩阵[估计多元弱ARMA模型的QMV估计器的渐近方差矩阵]
2. Rapid Administration Technique of Ketamine for Pediatric Forearm Fracture Reduction: A Dose-Finding Study [J] . Chinta Sri S., Schrock Charles R., McAllister John D., Annals of Emergency Medicine: Journal of the American College of Emergency Physicians and the University Association for Emergency Medicine . 2015,第6期

机译：氯胺酮快速给药技术用于减少小儿前臂骨折的剂量发现研究
3. Application of Harmonic Mean of Variances for Testing Ordered Alternative Hypothesis under Variance Heterogeneity [J] . Abidoye A. O., Jolayemi E. T., Sanni O. O. M., International Journal of Statistics and Applications . 2014,第4期

机译：方差谐波均值在方差异质性下测试有序替代假设的应用
4. Rapidly Finding the Best Arm Using Variance [C] . Marco Faella, Alberto Finzi, Luigi Sauro European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems . 2020

机译：快速找到使用方差的最佳臂
5. Rapid Enrollment Design for Finding the Optimal Dose in Immunotherapy Trials with Ordered Groups and Optimal Design of Experiments with Observation Censoring Driven by Random Enrollment [D] . Xue, Xiaoqiang 2019

机译：在有序组的免疫治疗试验中寻找最佳剂量的快速入组设计，以及随机入组驱动的观察检查实验的优化设计
6. Rapid administration technique of ketamine for pediatric forearm fracture reduction- a dose finding study [O] . Sri S Chinta, Charles R Schrock, John D McAllister, -1

机译：氯胺酮快速给药技术用于减少小儿前臂骨折的剂量研究
7. Rapid Administration Technique of Ketamine for Pediatric Forearm Fracture Reduction: A Dose-Finding Study [O] . Sri S. Chinta, Charles R. Schrock, John D. McAllister, 2015

机译：小儿前臂骨折减少氯胺酮的快速管理技术：一种剂量发现研究
8. Finding Optimal Policies for Markov Decision Chains: A Unifying Framework for Mean-Variance-Tradeoffs (Revised) [R] . Huang, Y., Kallenberg, L. C. M. 1993

机译：寻找马尔可夫决策链的最优政策：均值 - 方差 - 权衡的统一框架（修订）

Rapidly Finding the Best Arm Using Variance

摘要

著录项

相似文献

相关主题

期刊订阅