Context Attentive Bandits: Contextual Bandit with Restricted Context

机译：上下文周度匪徒：具有限制背景的上下文匪

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a novel formulation of the multi-armed bandit model, which we call the contextual bandit with restricted context, where only a limited number of features can be accessed by the learner at every iteration. This novel formulation is motivated by different online problems arising in clinical trials, recommender systems and attention modeling. Herein, we adapt the standard multi-armed bandit algorithm known as Thompson Sampling to take advantage of our restricted context setting, and propose two novel algorithms, called the Thompson Sampling with Restricted Context (TSRC) and the Windows Thompson Sampling with Restricted Context (WTSRC), for handling stationary and nonstationary environments, respectively. Our empirical results demonstrate advantages of the proposed approaches on several real-life datasets.

机译：我们考虑了一种新颖的多武装强盗模型的制定，我们呼叫具有限制上下文的上下文匪徒，其中仅在每次迭代时都可以访问有限数量的功能。这种新型制剂通过临床试验，推荐系统和注意力建模产生的不同在线问题。在此，我们将称为汤普森采样的标准多武装强盗算法适用于利用我们限制的上下文设置，并提出两个新颖的算法，称为汤普森采样，其中包含受限制的上下文（TSRC）和具有受限上下文的窗口汤普森采样（WTSRC ），分别处理静止和非间抗环境。我们的经验结果表明了拟议的近几种现实生活数据集的优势。

著录项

来源
《International Joint Conference on Artificial Intelligence》|2019年|1377-2022p|共8页
会议地点
作者
Djallel Bouneffouf; Irina Rish; Guillermo A. Cecchi; Raphael Feraud;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Contextual bandits with hidden contexts: a focused data capture from social media streams [J] . Lamprier Sylvain, Gisselbrecht Thibault, Gallinari Patrick Data mining and knowledge discovery . 2019,第6期

机译：具有隐藏上下文的上下文匪徒：来自社交媒体流的聚焦数据捕获
2. A Context-Aware Multiarmed Bandit Incentive Mechanism for Mobile Crowd Sensing Systems [J] . Wu Yue, Li Fan, Ma Liran, Internet of Things Journal, IEEE . 2019,第5期

机译：移动人群感知系统的情境感知多臂强盗激励机制
3. BANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASES [J] . Villar Sofia S. Probability in the Engineering and Informational Sciences . 2018,第2期

机译：在临床试验中评估罕见病危重病的盗贼策略
4. Context Attentive Bandits: Contextual Bandit with Restricted Context [C] . Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi, International Joint Conference on Artificial Intelligence . 2019

机译：上下文周度匪徒：具有限制背景的上下文匪
5. Using Contextual Bandits to Improve Traffic Performance in Edge Network [D] . Al Zadjali, Aziza Najeeb. 2021

机译：使用上下文匪徒改进边缘网络中的流量性能
6. Basal Ganglia Preferentially Encode Context Dependent Choice in a Two-Armed Bandit Task [O] . André Garenne, Benjamin Pasquereau, Martin Guthrie, 2011

机译：在两臂强盗任务中基础神经节优先编码上下文相关选择
7. Context Attentive Bandits: Contextual Bandit with Restricted Context [O] . Bouneffouf, Djallel, Rish, Irina, Cecchi, Guillermo A., 2017

机译：语境殷勤强盗：具有受限上下文的语境强盗

Context Attentive Bandits: Contextual Bandit with Restricted Context

摘要

著录项

相似文献

相关主题

期刊订阅