首页> 外文会议>Proceedings of the 26th Annual International Conference on Machine Learning >The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning
【24h】

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

机译:自适应k气象学家问题及其在强化学习中的结构学习和特征选择中的应用

获取原文
获取原文并翻译 | 示例

摘要

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an algorithm, known as the Adaptive k-Meteorologists Algorithm, analyze its sample-complexity upper bound, and give a matching lower bound. Second, this algorithm is used to create a new reinforcement-learning algorithm for factored-state problems that enjoys significant improvement over the previous state-of-the-art algorithm. Finally, we apply the Adaptive k-Meteorologists Algorithm to remove a limiting assumption in an existing reinforcement-learning algorithm. The effectiveness of our approaches is demonstrated empirically in a couple benchmark domains as well as a robotics navigation problem.
机译:本文的目的是三方面的。首先,我们在最近提出的KWIK框架中形式化并研究了学习概率概念的问题。我们给出了称为自适应k气象学家算法的算法的详细信息,分析了其样本复杂度的上限,并给出了匹配的下限。其次,该算法用于为因式状态问题创建新的强化学习算法,该算法比以前的最新算法有了显着改进。最后,我们应用自适应k气象学家算法来消除现有增强学习算法中的限制假设。我们在几个基准域以及机器人导航问题中通过经验证明了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号