首页> 外文会议>TAROS 2013 >Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance

【24h】

Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance

机译：启发式加速钢筋学习：表现的比较分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a comparative analysis of three Reinforcement Learning algorithms (Q-learning, Q(λ)-learning and QS-learning) and their heuristically-accelerated variants (HAQL, HAQ(λ) and HAQS) where heuristics bias action selection, thus speeding up the learning. The experiments were performed in a simulated robot soccer environment which reproduces the conditions of a real competition league environment. The results clearly demonstrate that the use of heuristics substantially improves the performance of the learning algorithms.

机译：本文介绍了三种加强学习算法（Q-Learning，Q（λ） - 学习和QS学习）的比较分析，以及它们的启发式偏置动作选择的启发式加速的变体（HAQL，HAQ（λ）和HAQS），因此加快学习。实验是在模拟机器人足球环境中进行的，该环境再现了真正的竞争联盟环境的条件。结果清楚地表明，启发式的使用大大提高了学习算法的性能。

著录项

来源
《TAROS 2013》|2014年||共13页
会议地点
作者
Murilo Fernandes Martins; Reinaldo A.C. Bianchi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP24-532;
关键词
Reinforcement learning; Heuristics; Robot soccer;

机译：加固学习;启发式;机器人足球;

相似文献

外文文献
中文文献
专利

1. Heuristically-Accelerated Multiagent Reinforcement Learning [J] . Bianchi R.A.C., Martins M.F., Ribeiro C.H.C., Cybernetics, IEEE Transactions on . 2014,第2期

机译：启发式加速多主体强化学习
2. Comparative analysis of evolving artificial neural network and reinforcement learning in stochastic optimization of multireservoir systems [J] . Dariane Alireza B., Moradi Amir Mohammad Hydrological sciences journal . 2016,第5a8期

机译：演化神经网络与强化学习在多储层系统随机优化中的对比分析
3. How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis [J] . CollinsA.G.E., FrankM.J. The European Journal of Neuroscience . 2012,第7a8期

机译：强化学习中有多少是工作记忆而不是强化学习？行为，计算和神经遗传学分析
4. Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance [C] . Murilo Fernandes Martins, Reinaldo A.C. Bianchi Annual conference on towards autonomous robotic systems . 2014

机译：启发式加速强化学习：性能比较分析
5. Features and Performance of Sarsa Reinforcement Learning Algorithm with Eligibility Traces and Local Environment Analysis for Bots in First Person Shooter Games [D] . Bundik Bettina Vivien 2020

机译：第一人称射击游戏中具有资格追踪和局部环境分析的Sarsa强化学习算法的功能和性能
6. How much of reinforcement learning is working memory not reinforcement learning? A behavioral computational and neurogenetic analysis [O] . Anne G. E. Collins, Michael J. Frank -1

机译：钢筋学习多少是工作记忆而不是加强学习？行为计算和神经肝分析
7. A Comparative Analysis of Expected and Distributional Reinforcement Learning [O] . Clare Lyle, Marc G. Bellemare, Pablo Samuel Castro 2019

机译：预期和分布增强学习的比较分析

Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance

摘要

著录项

相似文献

相关主题

期刊订阅