Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

Jonathan Scarlett; Ilija Bogunovic; Volkan Cevher

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

【24h】

Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

机译：后悔的下界，用于噪声高斯过程的强盗优化

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we consider the problem of sequentially optimizing a black-box function $f$ based on noisy samples and bandit feedback. We assume that $f$ is smooth in the sense of having a bounded norm in some reproducing kernel Hilbert space (RKHS), yielding a commonly-considered non-Bayesian form of Gaussian process bandit optimization. We provide algorithm-independent lower bounds on the simple regret, measuring the suboptimality of a single point reported after $T$ rounds, and on the cumulative regret, measuring the sum of regrets over the $T$ chosen points. For the isotropic squared-exponential kernel in $d$ dimensions, we find that an average simple regret of $ε$ requires $T = Ωig(rac1ε^2 (lograc1ε)^d/2ig)$, and the average cumulative regret is at least $Ωig( sqrt{T}(log T)^d ig)$, thus matching existing upper bounds up to the replacement of $d/2$ by $d+O(1)$ in both cases. For the Matérn-$ν$ kernel, we give analogous bounds of the form $Ωig( (rac1ε)^2+d/νig)$ and $Ωig( T^racν+ d2ν+ d ig)$, and discuss the resulting gaps to the existing upper bounds.

机译：在本文中，我们考虑了基于噪声样本和强盗反馈对黑盒函数$ f $进行顺序优化的问题。我们假设$ f $在某些再现内核Hilbert空间（RKHS）中具有有界范数的意义上是平滑的，从而产生了通常认为的非贝叶斯形式的高斯过程匪徒优化。我们为简单后悔，测量$ T $回合后报告的单个点的次优性以及累积后悔，测量对所选T $$点的后悔总和提供了算法独立下界。对于$ d $维的各向同性平方指数内核，我们发现$ε$的平均简单后悔需要$ T =Ω big（rac1ε^ 2（ log rac1ε）^ d / 2 big）$ ，并且平均累积后悔至少为$Ω big（ sqrt {T}（ log T）^ d big）$，因此匹配现有上限，直至将$ d / 2 $替换为$ d +在两种情况下均为O（1）$。对于Matérn-$ν$内核，我们以$Ω big（（rac1ε）^ 2 + d /ν big）$和$Ω big（T ^ racν+d2ν+ d big）$，并讨论与现有上限之间的差距。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第3期|共20页
作者
Jonathan Scarlett; Ilija Bogunovic; Volkan Cevher;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting [J] . Srinivas N. Information Theory, IEEE Transactions on . 2012,第5期

机译：Bandit环境中高斯过程优化的信息理论后悔界
2. Regret Bounds for Gaussian Process Bandit Problems [J] . Jeanndash, Yves Audibert, John Shawendash, JMLR: Workshop and Conference Proceedings . 2010,第2010期

机译：高斯过程强盗问题的遗憾界限
3. Bandits with Budgets: Regret Lower Bounds and Optimal Algorithms [J] . Richard Combes, Chong Jiang, Rayadurgam Srikant Performance evaluation review . 2015,第1期

机译：有预算的土匪：遗憾的下界和最佳算法
4. Regret Bounds for Safe Gaussian Process Bandit Optimization [C] . Sanae Amani, Mahnoosh Alizadeh, Christos Thrampoulidis IEEE International Symposium on Information Theory . 2021

机译：安全高斯过程强盗优化的遗憾界限
5. Hermite/Laguerre-Gaussian modes & lower bounds for quasimodes of semiclassical operators. [D] . VanValkenburgh, Michael James. 2009

机译：Hermite / Laguerre-Gaussian模和半经典算子的准模的下界。
6. Gaussian Process Regression Tuned by Bayesian Optimization for Seawater Intrusion Prediction [O] . George Kopsiaftis, Eftychios Protopapadakis, Athanasios Voulodimos, 2019

机译：贝叶斯优化调整的高斯过程回归用于海水入侵预测
7. Regret Bounds for Deterministic Gaussian Process Bandits [O] . de Freitas, Nando, Smola, Alex, Zoghi, Masrour 2012

机译：确定性高斯过程匪徒的遗憾界限

Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

摘要

著录项

相似文献

相关主题

期刊订阅