首页> 外文期刊>Neural processing letters >Specialization in Hierarchical Learning Systems: A Unified Information-theoretic Approach for Supervised, Unsupervised and Reinforcement Learning
【24h】

Specialization in Hierarchical Learning Systems: A Unified Information-theoretic Approach for Supervised, Unsupervised and Reinforcement Learning

机译:分层学习系统的专业化:统一的信息 - 监督,无监督和强化学习的理论方法

获取原文
获取原文并翻译 | 示例
       

摘要

Joining multiple decision-makers together is a powerful way to obtain more sophisticated decision-making systems, but requires to address the questions of division of labor and specialization. We investigate in how far information constraints in hierarchies of experts not only provide a principled method for regularization but also to enforce specialization. In particular, we devise an information-theoretically motivated on-line learning rule that allows partitioning of the problem space into multiple sub-problems that can be solved by the individual experts. We demonstrate two different ways to apply our method: (ⅰ) partitioning problems based on individual data samples and (ⅱ) based on sets of data samples representing tasks. Approach (ⅰ) equips the system with the ability to solve complex decision-making problems by finding an optimal combination of local expert decision-makers. Approach (ⅱ) leads to decision-makers specialized in solving families of tasks, which equips the system with the ability to solve meta-learning problems. We show the broad applicability of our approach on a range of problems including classification, regression, density estimation, and reinforcement learning problems, both in the standard machine learning setup and in a meta-learning setting.
机译:加入多个决策者在一起是获得更复杂的决策系统的强大方法,但需要解决劳动和专业化的问题。我们调查了专家层次结构中的信息限制,不仅提供了用于规范化的原则方法,还要强制执行专业化。特别是,我们设计了一个信息理论上动机的在线学习规则,允许将问题空间划分为可以由各个专家解决的多个子问题。我们展示了应用我们的方法的两种不同方式:(Ⅰ)基于代表任务的数据样本组的基于单个数据样本的分区问题和(Ⅱ)。方法(Ⅰ)通过找到当地专家决策者的最佳组合来提供解决复杂决策问题的能力。方法(Ⅱ)导致专门解决任务的家庭的决策者,该机构配备了解决元学问题的能力。我们在标准机器学习设置和元学习环境中显示了我们在包括分类,回归,密度估计和强化学习问题的一系列问题上的广泛适用性,包括分类,回归,密度估计和加强学习问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号