首页> 外文会议>TAROS 2013 >Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance
【24h】

Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance

机译:启发式加速钢筋学习:表现的比较分析

获取原文

摘要

This paper presents a comparative analysis of three Reinforcement Learning algorithms (Q-learning, Q(λ)-learning and QS-learning) and their heuristically-accelerated variants (HAQL, HAQ(λ) and HAQS) where heuristics bias action selection, thus speeding up the learning. The experiments were performed in a simulated robot soccer environment which reproduces the conditions of a real competition league environment. The results clearly demonstrate that the use of heuristics substantially improves the performance of the learning algorithms.
机译:本文介绍了三种加强学习算法(Q-Learning,Q(λ) - 学习和QS学习)的比较分析,以及它们的启发式偏置动作选择的启发式加速的变体(HAQL,HAQ(λ)和HAQS),因此加快学习。实验是在模拟机器人足球环境中进行的,该环境再现了真正的竞争联盟环境的条件。结果清楚地表明,启发式的使用大大提高了学习算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号