首页> 外文会议>European Workshop on Reinforcement Learning >Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets

【24h】

Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets

机译：用改变作用集求解DEC-MDP的批量模式增强学习方法评价

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

DEC-MDPs with changing action sets and partially ordered transition dependencies have recently been suggested as a sub-class of general DEC-MDPs that features provably lower complexity. In this paper, we investigate the usability of a coordinated batch-mode reinforcement learning algorithm for this class of distributed problems. Our agents acquire their local policies independent of the other agents by repeated interaction with the DEC-MDP and concurrent evolvement of their policies, where the learning approach employed builds upon a specialized variant of a neural fitted Q iteration algorithm, enhanced for use in multi-agent settings. We applied our learning approach to various scheduling benchmark problems and obtained encouraging results that show that problems of current standards of difficulty can very well approximately, and in some cases optimally be solved.

机译：最近提出了具有更改动作集和部分有序转换依赖性的DEC-MDP，作为常规DEC-MDP的子类，其特征在于复杂性更低。在本文中，我们调查了对该类分布式问题的协调批量增强学习算法的可用性。我们的代理商通过重复与DEC-MDP的反复互动以及其政策的并发演变，从其他代理商独立于其他代理商获取当地政策，其中采用学习方法在神经拟合Q迭代算法的专用变体上建立，增强用于多个 - 代理设置。我们将学习方法应用于各种调度基准问题，并获得了令人鼓舞的结果，表明当前难度标准的问题非常好，在某些情况下最佳地解决。

著录项

来源
《European Workshop on Reinforcement Learning》|2008年||共14页
会议地点
作者
Thomas Gabel; Martin Riedmiller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
入库时间 2022-08-20 21:22:00

相似文献

外文文献
中文文献
专利

1. Reinforcement learning approach to multi-stage decision making problems with changes in action sets [J] . Takuya Etoh, Hirotaka Takano, Junichi Murata Artificial life and robotics . 2012,第2期

机译：强化学习方法，可解决行动集变化带来的多阶段决策问题
2. Batch-Mode Reinforcement Learning for Continuous State Spaces: A Survey [J] . Gerhard Neumann OGAI Journal . 2008,第1期

机译：连续状态空间的批处理模式强化学习：一项调查
3. A multiagent reinforcement learning algorithm to solve the maximum independent set problem [J] . Alipour Mir Mohammad, Abdolhosseinzadeh Mohsen Multiagent and grid systems . 2020,第1期

机译：多钢筋求解学习算法，解决最大独立集合问题
4. Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets [C] . Thomas Gabel, Martin Riedmiller Recent advances in reinforcement learning . 2008

机译：批处理模式强化学习方法的评估，以解决具有变化的动作集的DEC-MDP
5. Improved empirical methods in reinforcement-learning evaluation [D] . Marivate, Vukosi N. 2015

机译：强化学习评估中改进的经验方法
6. Correction: Spike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail [O] . Eleni Vasilaki, Nicolas Frémaux, Robert Urbanczik, 2009

机译：更正：在连续状态和动作空间中基于峰值的强化学习：当策略梯度方法失败时
7. Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets [O] . Thomas Gabel, Martin Riedmiller 2009

机译：批处理模式强化学习方法的评估，以解决具有变化的动作集的DEC-MDP

Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets

摘要

著录项

相似文献

相关主题

期刊订阅