Experience-efficient learning in associative bandit problems

机译：联想土匪问题中的经验有效学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We formalize the associative bandit problem framework introduced by Kaelbling as a learning-theory problem. The learning environment is modeled as a k-armed bandit where arm payoffs are conditioned on an observable input selected on each trial. We show that, if the payoff functions are constrained to a known hypothesis class, learning can be performed efficiently with respect to the VC dimension of this class. We formally reduce the problem of PAC classification to the associative bandit problem, producing an efficient algorithm for any hypothesis class for which efficient classification algorithms are known. We demonstrate the approach empirically on a scalable concept class.

机译：我们将由Kaelbling引入的联想土匪问题框架正式化为学习理论问题。学习环境被建模为 k 武装匪徒，其中武装收益取决于每次试验中选择的可观察输入。我们证明，如果将支付函数限制在已知的假设类中，则可以相对于该类的VC维有效地执行学习。我们正式将PAC分类问题简化为相关的匪徒问题，从而为任何已知有效分类算法的假设类别提供了一种有效的算法。我们在可扩展的概念类上以经验方式演示了该方法。 展开▼

著录项

来源
《International conference on Machine learning》|2006年|P.889-896|共8页

会议地点

作者
Alexander L. Strehl; Chris Mesterharm; Michael L. Littman; Haym Hirsh; PChris Mesterharm; PMichael L. Littman; PHaym Hirsh;
展开▼

作者单位

展开▼

会议组织

原文格式 PDF

正文语种

中图分类计算机的应用;

关键词

相似文献

外文文献

中文文献

专利

1. Assessing unlimited associative learning as a transition marker Commentary on Birch et al. 2020, Unlimited Associative Learning and the Origins of Consciousness: A Primer and Some Predictions [J] . Irvine Elizabeth Biology & philosophy . 2021,第2期

机译：评估无限制的关联学习作为Birch等人的过渡标记评论。 2020年，无限的联想学习和意识的起源：一个底漆和一些预测

2. Towards a new study on associative learning in human fetuses: fetal associative learning in primates [J] . Nobuyuki Kawai Infant and Child Development . 2010,第1期

机译：致力于人类胎儿联想学习的新研究：灵长类动物的胎儿联想学习

3. Implicit associative learning engages the hippocampus and interacts with explicit associative learning. [J] . Degonda N, Mondadori CR, Bosshardt S, Neuron . 2005,第3期

机译：内隐联想学习使海马参与并与外显联想学习互动。

4. Experience-Efficient Learning in Associative Bandit Problems [C] . Alexander L. Strehl, Chris Mesterharm, Michael L. Littman, International Conference on Machine Learning . 2006

机译：在联想强盗问题中历史效益学习

5. Adaptive Preference Learning with Bandit Feedback: Information Filtering, Dueling Bandits and Incentivizing Exploration [D] . Chen, Bangrui. 2017

机译：带有土匪反馈的自适应偏好学习：信息过滤，决斗土匪和激励探索

6. Nash Equilibrium of Social-Learning Agents in a Restless Multiarmed Bandit Game [O] . Kazuaki Nakayama, Masato Hisakado, Shintaro Mori -1

机译：躁动多臂强盗游戏中的社会学习代理人的纳什均衡

7. Bandits on the information superhighway By Daniel J. Barrett. O'Reilly Associates, Sebastopol, CA. (1996). 229 pages. $17.95 [O] . 1996

机译：信息高速公路上的土匪作者：丹尼尔·J·巴雷特（Daniel J. Barrett）。加利福尼亚塞巴斯托波尔的O'Reilly＆Associates。（1996）。 229页。 $ 17.95

8. Measures of Associative over-Lap and Paired-Associate Learning [R] . Cofer, C. N. 1965

机译：关联重叠和配对关联学习的度量

1. 经验联想,让幼儿在科学活动中"有问可提" [J] . 林艳芳 . 天津教育 . 2020,第011期

2. 循常规,巧联想,妙解题——试论联想在高中数学解题中的应用 [J] . 王娟 . 高中数理化 . 2017,第020期

3. 三问联想——专访联想大中国区服务运作支持总经理吕再峰 [J] . 陈宇 . 大众硬件 . 2008,第007期

4. 模糊双向联想记忆网络的有效学习算法 [J] . 曾水玲 ,杨静宇 ,徐蔚鸿 . 微电子学与计算机 . 2007,第12期

5. 基于Lukasiewicz t-模的模糊双向联想记忆网络的有效学习算法 [J] . 曾水玲 ,徐蔚鸿 . 计算机应用 . 2006,第012期

6. 防洪与河流生态保护之问的协同——瑞士15年经验介绍 [C] . Burno Sch(a)dler . 第二届黄河国际论坛 . 2005

7. 高校破解大学生“就业难”问题中的职能转型与管理创新——基于X高校与Y高校经验的研究 [A] . 徐燕丽 . 2013

1. 联想元件,使用了联想元件的联想装置以及其方法 [P] . 中国专利： CN1101574C . 2003.02.12

2. 联想元件,使用了联想元件的联想装置以及其方法 [P] . 中国专利： CN1197957A . 1998-11-04

3. META-AUTOMATED MACHINE LEARNING WITH IMPROVED MULTI-ARMED BANDIT ALGORITHM FOR SELECTING AND TUNING A MACHINE LEARNING ALGORITHM [P] . 外国专利： US2021224585A1 . 2021-07-22

机译：元自动化机器学习，采用改进的多武装强盗算法选择和调整机器学习算法

4. ADVERSARIAL BANDIT CONTROL LEARNING FRAMEWORK FOR SYSTEM AND PROCESS OPTIMIZATION, SEGMENTATION, DIAGNOSTICS AND ANOMALY TRACKING [P] . 外国专利： WO2022003733A2 . 2022-01-06

机译：系统和过程优化，分段，诊断和异常跟踪的对抗匪盗控制学习框架

5. A LEARNING-BASED RANDOM ACCESS METHOD USING MULTI-AGENT MULTI-ARMED BANDIT ALGORITHMS ON WIRELESS COMMUNICATION NETWORKS [P] . 外国专利： KR20210062944A . 2021-06-01

机译：基于学习的无线通信网络上的多臂多武装频率算法的随机访问方法

相关主题

Experience-efficient learning in associative bandit problems

摘要

著录项

相似文献

相关主题

期刊订阅