首页> 外文会议>AAAI Conference on Artificial Intelligence >Determinantal Reinforcement Learning

【24h】

Determinantal Reinforcement Learning

机译：决定性加强学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study reinforcement learning for controlling multiple agents in a collaborative manner. In some of those tasks, it is insufficient for the individual agents to take relevant actions, but those actions should also have diversity. We propose the approach of using the determinant of a positive semidefinite matrix to approximate the action-value function in reinforcement learning, where we learn the matrix in a way that it represents the relevance and diversity of the actions. Experimental results show that the proposed approach allows the agents to learn a nearly optimal policy approximately ten times faster than baseline approaches in benchmark tasks of multi-agent reinforcement learning. The proposed approach is also shown to achieve the performance that cannot be achieved with conventional approaches in partially observable environment with exponentially large action space.

机译：我们以协同方式研究加固学习，用于控制多个代理。在其中一些任务中，个人代理人不足以采取相关行动，但这些行动也应该具有多样性。我们提出了使用正半纤维矩阵的决定因素的方法来近似于加强学习中的动作值函数，在那里我们以代表行动的相关性和多样性的方式学习矩阵。实验结果表明，该拟议的方法允许代理人在多智能体加固学习基准任务中的基准方法速度快大约最佳政策大约最佳的政策。还显示了所提出的方法，以实现具有常规方法的常规方法无法实现的性能，其具有指数大的动作空间。

著录项

来源
《AAAI Conference on Artificial Intelligence 》|2019年|4512-4982p|共8页
会议地点
作者
Takayuki Osogami; Rudy Raymond;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning [J] . Naoto Horie, Tohgoroh Matsui, Koichi Moriyama, Artificial life and robotics . 2019 ,第3期

机译：多目标安全强化学习：多目标强化学习与安全强化学习之间的关系
2. Reinforcement learning in learning automata and cellular learning automata via multiple reinforcement signals [J] . Vafashoar Reza, Meybodi Mohammad Reza Knowledge-Based Systems . 2019 ,第APRa1期

机译：通过多个增强信号学习自动机和细胞学习自动机中的增强学习
3. Workshop on Distributed Reinforcement Learning and Reinforcement-Learning Games [Conference Reports] [J] . Kyriakos G. Vamvoudakis, Yan Wan, Frank L. Lewis Control Systems, IEEE . 2019 ,第6期

机译：分布式强化学习和加固学习游戏研讨会[会议报告]
4. Determinantal Reinforcement Learning [C] . Takayuki Osogami, Rudy Raymond AAAI Conference on Artificial Intelligence . 2019

机译：决定性加强学习
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. Frequency of reinforcement as a determinant of extinction-induced aggression during errorless discrimination learning. [O] . M Rilling, H J Caplan 1975

机译：强化的频率作为无误判别学习过程中灭绝诱发的攻击行为的决定因素。
7. Determinantal Reinforcement Learning [O] . Takayuki Osogami, Rudy Raymond 2019

机译：决定性加强学习

Determinantal Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅