【24h】

Simplified online Q-learning for LEGO EV3 robot

机译:简体在线Q-Learning for Lego EV3机器人

获取原文

摘要

Q-learning is a kind of model-free reinforcement learning algorithm which is effective in Robot's navigation applications. Unfortunately, Lego Mindstorms EV3 robot's file writing speed is sometimes too slow to implement Q-learning algorithm. In this paper, an approach is proposed to simplify Q-learning discrete value table into a new version that stores only one optimum action and its Q-value instead of storing every action's Q-value in each state. Exploration and contrast experiments show that our algorithm learns much faster than the original Q-learning without losing the ability to find a better policy in navigation task.
机译:Q-Learning是一种无模型加强学习算法,它在机器人的导航应用中有效。不幸的是,Lego Mindstorms EV3机器人的文件写入速度有时太慢实现了Q学习算法。在本文中,提出了一种方法来简化Q-Learnal离散值表中的新版本,该新版本仅存储一个最佳动作及其Q值,而不是在每个状态中存储每个动作的Q值。探索和对比实验表明,我们的算法比原来的Q学得更快地学习了更快,而不会失去在导航任务中找到更好的政策的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号