首页> 外文会议>IEEE International Conference on System, Man, and Cybernetics >On amount and quality of bias in reinforcement learning
【24h】

On amount and quality of bias in reinforcement learning

机译:加固学习中偏差的数量和质量

获取原文

摘要

Reinforcement learning is widely regarded as elegant in theory but hopelessly slow in practice. This is because it is often studied under the assumption that there is little or no prior information about the task at hand. This assumption, however, is not the defining characteristic of learning. Learning involves the incorporation of prior knowledge or bias that can greatly accelerates or otherwise improves the learning process.In this paper we address the influence of the amount and quality of bias on the speed of reinforcement learning. For a chosen class of learning problem different forms of biases are initially identified. Some of the bias are extracted from the knowledgeof the environment, others from the task, and yet a few from both. Belief matrices, which reset Q-tables before learning commences, encode the biases. The average number of interactions between the agent and the environment is used to quantify the biases. Based on this performance measure, the biases are graded and some new results are reported. In addition, the paper compares continual learning to learning from scratch and presents results that clearly demonstrate the advantages of the former.
机译:强化学习在理论上被广泛认为优雅,但实际上绝望地缓慢。这是因为假设往往是几乎没有关于手头任务的事先信息的研究。然而,这种假设不是学习的定义特征。学习涉及纳入先前的知识或偏见,可以大大加速或以其他方式改善学习过程。在本文中,我们解决了偏倚速度的影响力和质量的影响。对于选择的学习问题,最初识别出不同形式的偏差。一些偏见从环境中的知识中提取,其他人来自任务,而且来自两者的几个。信仰矩阵,在学习开始之前重置Q-Tables,编码偏差。代理和环境之间的平均交互次数用于量化偏差。基于这种性能措施,偏差分级,报告了一些新的结果。此外,本文比较不断学习从划伤中学习,并提出了清楚地证明前者的优势的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号