Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits

Drugan Madalina M.

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits

【24h】

Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits

机译：多目标多臂匪的协方差矩阵自适应

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Upper confidence bound (UCH) is a successful multiarmed bandit for regret minimization. The covariance matrix adaptation (CMA) for Pareto UCB (CMA-PUCB) algorithm considers stochastic reward vectors with correlated objectives. We upper bound the cumulative pseudoregret of pulling suboptimal arms for the CMA-PUCB algorithm to logarithmic number of arms K, objectives D, and samples n, 0(In(n DK) Sigma(i) (parallel to Sigma(i)parallel to(2)/Delta(i))), using a variant of Berstein inequality for matrices, where Delta(i) is the regret of pulling the suboptimal arm i. For unknown covariance matrices between objectives Sigma(i), we upper bound the approximation of the covariance matrix using the number of samples to o(n ln(n DK) + ln(2)(nDK) Sigma(i) (1/Delta(i))) Simulations on a three objective stochastic environment show the applicability of our method.

机译：上置信界（UCH）是成功的多臂匪徒，可最大程度地减少后悔。帕累托UCB（CMA-PUCB）算法的协方差矩阵适应（CMA）考虑具有相关目标的随机奖励向量。我们将拉出CMA-PUCB算法的次优臂的累积伪后悔上限设为臂K，目标D和样本n，0（In（n DK）Sigma（i）（平行于Sigma（i）平行（2）/ Delta（i））），对矩阵使用Berstein不等式的变体，其中Delta（i）是拉出次优臂i的遗憾。对于目标Sigma（i）之间的未知协方差矩阵，我们使用样本数将o（n ln（n DK）+ ln（2）（nDK）Sigma（i）（1 / Delta）作为样本的协方差矩阵的近似上限（i）））在三个目标随机环境下的仿真表明了我们方法的适用性。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2019年第8期|2493-2502|共10页
作者
Drugan Madalina M.;
展开▼
作者单位

ITLearns Online NL-3564 ET Utrecht Netherlands;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Berstein inequality; covariance matrix adaptation (CMA); multiobjective multiarmed bandits (MABs); regret minimization; stochastic reward vector;

机译：伯恩斯坦不等式;协方差矩阵自适应（CMA）;多目标多武装匪徒（MAB）;遗憾最小化;随机奖励矢量;

相似文献

外文文献
中文文献
专利

1. Electromagnetic Optimization Using Mixed-Parameter and Multiobjective Covariance Matrix Adaptation Evolution Strategy [J] . BouDaher Elie, Hoorfar Ahmad Antennas and Propagation, IEEE Transactions on . 2015,第4期

机译：混合参数和多目标协方差矩阵自适应演化策略的电磁优化
2. Toward a Matrix-Free Covariance Matrix Adaptation Evolution Strategy [J] . Arabas Jarosiaw, Jagodzinski Dariusz IEEE transactions on evolutionary computation . 2020,第1期

机译：朝着无协方差矩阵矩阵适应演化策略
3. Optimization of rotation patterns of a mangle-type magnetic field source using covariance matrix adaptation evolution strategy [J] . Hiroshi Sakuma, Takuto Nakagawara Journal of magnetism and magnetic materials . 2021,第Juna期

机译：使用协方差矩阵适应演化策略优化剪发磁场源的旋转模式
4. Gaussian Adaptation Revisited - An Entropic View on Covariance Matrix Adaptation [C] . Christian L. Miiller, Ivo F. Sbalzarini EvoCOMPLEX;EvoCOMNET;European event on evolutionary algorithms and complex systems;EvoCOP 2010;EvoApplications 2010;EvoENVIRONMENT;EuroGP 2010;EvoBIO 2010;European event on the application of nature-inspired techniques for telecommunication networks and other paralle and distributed systems;EvoFIN;European event on nature-inspired methods for environmental issues;EvoGAMES;European event on evolutionary and natural computation in finance and economics;EvoIASP;European event on Bio-inspired algorithms in games;EvoINTELLIGENCE;EvoMUSART;European event on nature-inspired methods for intelligent systems;EvoNUM;European event on evolutionary and biologically inspired music, sound, art and design;EvoSTOC;European event on Bio-inspired algorithms for continuous parameter optimization;EvoTRANSLOG; . 2010

机译：再谈高斯自适应-协方差矩阵自适应的熵观。
5. Enabling Robust State Estimation through Covariance Adaptation [D] . Watson, Ryan Magnum. 2019

机译：通过协方差调整实现强大的状态估计
6. Parameter Optimization Using Covariance Matrix Adaptation—Evolutionary Strategy (CMA-ES) an Approach to Investigate Differences in Channel Properties Between Neuron Subtypes [O] . Zbigniew Jȩdrzejewski-Szmek, Karina P. Abrahao, Joanna Jȩdrzejewska-Szmek, 2018

机译：使用协方差矩阵自适应—进化策略（CMA-ES）进行参数优化一种研究神经元亚型之间通道特性差异的方法
7. Gaussian Adaptation Revisited – An Entropic View on Covariance Matrix Adaptation [O] . Müller, Christian L, Sbalzarini, Ivo F 2010

机译：再谈高斯适应性-协方差矩阵适应性的熵观

Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits

摘要

著录项

相似文献

相关主题

期刊订阅