【24h】

Group Online Adaptive Learning

机译:小组在线自适应学习

获取原文

摘要

Sharing information among multiple learning agents can accelerate learning. It could be particularly useful if learners operate in continuously changing environments, because a learner could benefit from previous experience of another learner to adapt to their new environment. Such group-adaptive learning has numerous applications, from predicting financial time-series, through content recommendation systems, to visual understanding for adaptive autonomous agents. Here we address the problem in the context of online adaptive learning. We formally define the learning settings of Group Online Adaptive Learning and derive an algorithm named Shared Online Adaptive Learning (SOAL) to address it. SOAL avoids explicitly modeling changes or their dynamics, and instead shares information continuously. The key idea is that learners share a common small pool of experts, which they can use in a weighted adaptive way. We define group adaptive regret and prove that SOAL maintains known bounds on the adaptive regret obtained for single adaptive learners. Furthermore, it quickly adapts when learning tasks are related to each other. We demonstrate the benefits of the approach for two domains: vision and text. First, in the visual domain, we study a visual navigation task where a robot learns to navigate based on outdoor video scenes. We show how navigation can improve when knowledge from other robots in related scenes is available. Second, in the text domain, we create a new dataset for the task of assigning submitted papers to relevant editors. This is, inherently, an adaptive learning task due to the dynamic nature of research fields evolving in time. We show how learning to assign editors improves when knowledge from other editors is available. Together, these results demonstrate the benefits for sharing information across learners in concurrently changing environments.
机译:在多个学习代理之间共享信息可以加快学习速度。如果学习者在不断变化的环境中工作,这可能会特别有用,因为一个学习者可以从另一个学习者的先前经验中受益,以适应他们的新环境。这样的群体自适应学习具有许多应用,从预测财务时间序列到通过内容推荐系统,再到对自适应自主主体的视觉理解。在这里,我们在在线自适应学习的背景下解决了这个问题。我们正式定义小组在线自适应学习的学习设置,并派生一种名为共享在线自适应学习(SOAL)的算法来解决该问题。 SOAL避免显式地为更改或其动态建模,而是连续地共享信息。关键思想是,学习者共享一个共同的小型专家库,他们可以以加权自适应方式使用这些专家库。我们定义了群体适应性遗憾,并证明了SOAL保持了对单个适应性学习者获得的适应性遗憾的已知界限。此外,当学习任务相互关联时,它可以快速适应。我们在两个领域展示了该方法的好处:视觉和文本。首先,在视觉领域,我们研究视觉导航任务,其中机器人学习基于室外视频场景的导航。我们展示了当相关场景中其他机器人的知识可用时,导航将如何改善。其次,在文本域中,我们创建了一个新的数据集,用于将提交的论文分配给相关编辑者。由于研究领域随时间变化的动态性质,从本质上讲,这是一项自适应学习任务。我们展示了当其他编辑者的知识可用时,如何分配编辑者的学习会得到改善。总之,这些结果证明了在同时变化的环境中跨学习者共享信息的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号