KaBaGe-RL: Kanerva-based generalisation and reinforcement learning for possession football

机译：KaBaGe-RL：基于Kanerva的足球足球泛化和强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The complexity of most modem systems prohibits a hand-coded approach to decision making. In addition, many problems have continuous or large discrete state spaces; some have large or continuous action spaces. The problem of learning in large spaces is tackled through generalisation techniques, which allow compact representation of learned information and transfer of knowledge between similar states and actions. In this paper Kanerva. coding and reinforcement learning are combined to produce the KaBaGe-RL decision-making module. The purpose of KaBaGe-RL is twofold. Firstly, Kanerva coding is used as a generalisation method to produce a feature vector from the raw sensory input. Secondly, the reinforcement learning uses this feature vector in order to learn an optimal policy. The efficiency of KaBaGe-RL is tested using the "3 versus 2 possession football" challenge, a subproblem of the RoboCup domain. The results demonstrate that the learning approach outperforms a number of benchmark policies including a hand-coded one.

机译：大多数调制解调器系统的复杂性都禁止采用手动编码的方法进行决策。另外，许多问题具有连续或较大的离散状态空间。有些具有较大或连续的动作空间。大空间中的学习问题通过泛化技术解决，泛化技术可以紧凑地表示所学信息，并在相似状态和动作之间进行知识转移。在本文中，Kanerva。编码和强化学习相结合以产生KaBaGe-RL决策模块。 KaBaGe-RL的目的是双重的。首先，使用Kanerva编码作为一种概括方法，从原始的感觉输入中生成特征向量。其次，强化学习使用此特征向量来学习最佳策略。 KaBaGe-RL的效率是使用“ 3对2拥有足球”挑战赛（RoboCup域的一个子问题）进行测试的。结果表明，学习方法的性能优于许多基准策略，包括手工编码的策略。

著录项

来源
《Intelligent Robots and Systems, 2001. Proceedings. 2001 IEEE/RSJ International Conference on》|2001年|P.292-297|共6页
会议地点
作者
Kostiadis; K.; Huosheng Hu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning [J] . Naoto Horie, Tohgoroh Matsui, Koichi Moriyama, Artificial life and robotics . 2019,第3期

机译：多目标安全强化学习：多目标强化学习与安全强化学习之间的关系
2. The Influence of Match Status on Ball Possession in High Performance Women’s Football [J] . Rubén Maneiro, José L. Losada, Claudio A. Casal, Frontiers in Psychology . 2020,第a期

机译：匹配状况对高性能女性足球球占有的影响
3. The Influence of Match Status on Ball Possession in High Performance Women’s Football [J] . Maneiro Rubn, Losada Jos L., Casal Claudio A., Frontiers in Psychology . 2020,第2期

机译：匹配状况对高性能女性足球中球占有的影响
4. KaBaGe-RL: Kanerva-based generalisation and reinforcement learning for possession football [C] . Kostiadis K., Huosheng Hu, Institute of Electric and Electronic Engineer IEEE/RSJ International Workshop on Intelligent Robots and Systems . 2001

机译：Kabage-RL：基于Kanerva的泛化和占有足球的加强学习
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. The Influence of Match Status on Ball Possession in High Performance Women’s Football [O] . Rubén Maneiro, José L. Losada, Claudio A. Casal, 2020

机译：比赛状态对高性能女子足球持球的影响
7. KaBaGe-RL: Kanerva-based Generalisation and Reinforcement Learning for Possession Football [O] . Kostas Kostiadis, Huosheng Hu 2001

机译：KaBaGe-RL：基于Kanerva的足球足球泛化和强化学习

KaBaGe-RL: Kanerva-based generalisation and reinforcement learning for possession football

摘要

著录项

相似文献

相关主题

期刊订阅