Best Arm Identification in Linear Bandits with Linear Dimension Dependency

Chao Tao; Saúl Blanco; Yuan Zhou

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Best Arm Identification in Linear Bandits with Linear Dimension Dependency

【24h】

Best Arm Identification in Linear Bandits with Linear Dimension Dependency

机译：具有线性尺寸相关性的线性强盗中的最佳手臂识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the best arm identification problem in linear bandits, where the mean reward of each arm depends linearly on an unknown $d$-dimensional parameter vector $heta$, and the goal is to identify the arm with the largest expected reward. We first design and analyze a novel randomized $heta$ estimator based on the solution to the convex relaxation of an optimal $G$-allocation experiment design problem. Using this estimator, we describe an algorithm whose sample complexity depends linearly on the dimension $d$, as well as an algorithm with sample complexity dependent on the reward gaps of the best $d$ arms, matching the lower bound arising from the ordinary top-arm identification problem. We finally compare the empirical performance of our algorithms with other state-of-the-art algorithms in terms of both sample complexity and computational time.

机译：我们研究了线性强盗中的最佳手臂识别问题，其中每个手臂的平均奖励线性地取决于未知的$ d $维参数向量$ theta $，目标是识别具有最大预期奖励的手臂。我们首先根据最优$ G $分配实验设计问题的凸松弛解，设计并分析了一种新颖的随机$ theta估计量。使用该估计器，我们描述了一种算法，其样本复杂度线性地取决于维数$ d $，以及一种算法，其样本复杂度取决于最佳$ d $臂的奖励缺口，与普通顶部产生的下限相匹配手臂识别问题。最后，我们从样本复杂度和计算时间两方面比较了我们的算法与其他最新算法的经验性能。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2009期|共10页
作者
Chao Tao; Saúl Blanco; Yuan Zhou;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. A nonlinear Wiener system identification based on improved adaptive step-size glowworm swarm optimization algorithm for three-dimensional elliptical vibration cutting [J] . Lu Mingming, Wang Hao, Lin Jieqiong, The International Journal of Advanced Manufacturing Technology . 2019,第5a8期

机译：基于改进自适应阶梯尺寸萤火虫优化算法的非线性维纳系统识别三维椭圆振动切割
2. Application of nonlinear dynamic analysis to the identification and control of nonlinear systems .3. n-dimensional systems [J] . Read NK., Ray WH. Journal of Process Control . 1998,第1期

机译：非线性动力学分析在非线性系统辨识与控制中的应用.3。 n维系统
3. Application of decoupled ARMA model to modal identification of linear time-varying system based on the ICA and assumption of & ldquo;short-time linearly varying & rdquo; [J] . Chen Tengfei, Chen Guoping, Chen Weiting, Journal of Sound and Vibration . 2021,第1期

机译：解耦ARMA模型在基于ICA的线性时变系统模识别与＆ldquo的模拟识别;短时线性变化＆rdquo;
4. Nonlinear sequential accepts and rejects for identification of top arms in stochastic bandits [C] . Shahin Shahrampour, Vahid Tarokh Annual Allerton Conference on Communication, Control, and Computing . 2017

机译：非线性顺序接受和拒绝，以识别随机强盗中的上臂
5. Essays on Nonparametric Identification: Identification of Dependent Multidimensional Unobserved Variables in a System of Linear Equations Identification and Estimation for Regressions with Errors in All Variables Identification of Nonparametrically Distributed Random Coefficients in Linear Panel Data Models. [D] . Ben-Moshe, Dan. 2012

机译：关于非参数识别的论文：线性方程组中相关多维多维观测变量的识别以及线性面板数据模型中非参数分布随机系数的所有变量中所有变量均具有误差的回归估计。
6. Precision Lasso: accounting for correlations and linear dependencies in high-dimensional genomic data [O] . Haohan Wang, Benjamin J Lengerich, Bryon Aragam, -1

机译：精密套索：考虑高维基因组数据中的相关性和线性相关性
7. Stochastic continuum armed bandit problem of few linear parameters in high dimensions [O] . Tyagi, Hemant, Stich, Sebastian, Gärtner, Bernd 2017

机译：随机连续体武装强盗少数线性参数问题高维

Best Arm Identification in Linear Bandits with Linear Dimension Dependency

摘要

著录项

相似文献

相关主题

期刊订阅