Better algorithms for benign bandits

机译：良性土匪的更好算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The online multi-armed bandit problem and its generalizations are repeated decision making problems, where the goal is to select one of several possible decisions in every round, and incur a cost associated with the decision, in such a way that the total cost incurred over all iterations is close to the cost of the best fixed decision in hindsight. The difference in these costs is known as the regret of the algorithm. The term bandit refers to the setting where one only obtains the cost of the decision used in a given iteration and no other information.

机译：在线多武装匪徒问题及其概括是重复的决策问题，其目的是在每一轮中选择几个可能的决策中的一个，并招致与决策相关的成本，以使总成本超过在事后看来，所有迭代都接近最佳固定决策的成本。这些成本之间的差异被称为算法的遗憾。术语“匪徒”是指这样一种设置，其中仅获得给定迭代中使用的决策成本，而没有其他信息。

著录项

来源
《Annual ACM-SIAM Symposium on Discrete Algorithms;ACM-SIAM Symposium on Discrete Algorithms》|2009年|P.38 - 47|共10页
会议地点 New York NY(US);New York NY(US)
作者
Elad Hazan; Satyen Kale;
展开▼
作者单位

IBM Almaden;

Microsoft Research;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Better Algorithms for Benign Bandits [J] . Hazan Elad, Kale Satyen Journal of machine learning research . 2011,第Apr期

机译：良性土匪的更好算法
2. Intelligent and Reconfigurable Architecture for KL Divergence-Based Multi-Armed Bandit Algorithms [J] . Santosh S. V. Sai, Darak Sumit J. IEEE transactions on circuits and systems. II, Express briefs . 2021,第3期

机译：基于KL发散的多武装强盗算法的智能和可重构架构
3. Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits [J] . Thibaut Cuvelier, Richard Combes, Eric Gourdin Performance evaluation review . 2021,第1期

机译：组合半刺槐的统计有效，多项式时间算法
4. Better Algorithms for Benign Bandits [C] . Elad Hazan, Satyen Kale Annual ACM-SIAM Symposium on Discrete Algorithms . 2009

机译：良性匪徒的更好算法
5. Offline Evaluation of Multi-Armed Bandit Algorithms Using Bootstrapped Replay on Expanded Data [D] . Dai, Jin. 2021

机译：在扩展数据上使用引导重播的多武装强盗算法的离线评估
6. Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm [O] . Emanuele Cavenaghi, Gabriele Sottocornola, Fabio Stella, 2021

机译：非固定多武装强盗：新概念漂移感知算法的实证评估
7. Better Algorithms for Benign Bandits [O] . Elad Hazan, Satyen Kale 2009

机译：良性土匪的更好算法

Better algorithms for benign bandits

摘要

著录项

相似文献

相关主题

期刊订阅