...
【24h】

Two Steps Reinforcement Learning

机译:两步强化学习

获取原文
           

摘要

When applying reinforcement learning in domains with very large or continuous state spaces, the experience obtained by the learning agent in the interaction with the environment must be generalized. The generalization methods are usually based on the approximation of the value functions used to compute the action policy and tackled in two different ways. On the one hand by using an approximation of the value functions based on a supervized learning method. On the other hand, by discretizing the environment to use a tabular representation of the value functions. In this work, we propose an algorithm that uses both approaches to use the benefits of both mechanisms, allowing a higher performance. The approach is based on two learning phases. In the first one, a learner is used as a supervized function approximator, but using a machine learning technique which also outputs a state space discretization of the environment, such as nearest prototype classifiers or decision trees do. In the second learning phase, the space discretization computed in the first phase is used to obtain a tabular representation of the value function computed in the previous phase, allowing a tuning of such value function approximation. Experiments in different domains show that executing both learning phases improves the results obtained executing only the first one. The results take into account the resources used and the performance of the learned behavior.
机译:在具有非常大或连续状态空间的领域中应用强化学习时,必须概括学习代理在与环境交互中获得的经验。泛化方法通常基于用于计算操作策略并以两种不同方式处理的价值函数的近似值。一方面,通过基于高级学习方法使用值函数的近似值。另一方面,通过离散化环境来使用值函数的表格表示形式。在这项工作中,我们提出了一种使用两种方法来利用两种机制的优点的算法,从而可以实现更高的性能。该方法基于两个学习阶段。在第一个中,学习器用作超函数逼近器,但使用的机器学习技术还输出环境的状态空间离散化,例如最近的原型分类器或决策树。在第二学习阶段中,在第一阶段中计算出的空间离散化用于获得在前一阶段中计算出的值函数的表格表示,从而可以调整这种值函数的近似值。不同领域的实验表明,执行两个学习阶段可改善仅执行第一个学习阶段所获得的结果。结果考虑了使用的资源和学习行为的表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号