首页> 美国政府科技报告 >CLEANing the Reward: Counterfactual Actions to Remove Exploratory Action Noise in Multiagent Learning (Extended Abstract).

【24h】

CLEANing the Reward: Counterfactual Actions to Remove Exploratory Action Noise in Multiagent Learning (Extended Abstract).

机译：清理奖励：在多智能体学习中消除探索性行为噪声的反事实行动（扩展摘要）。

获取原文

页面导航

著录项
引文网络
相似文献
相关主题

著录项

作者
Parker, C. H.; Taylor, M. E.; Tumer, K.; Agogino, A.;
展开▼
作者单位

展开▼
年度 2014
页码 1-2
总页数 2
原文格式 PDF
正文语种 eng
中图分类
关键词
Environmental dynamics; Algorithms; Experimentation; Performance; Coordination; Scalability; Exploration; Exploitation;

机译：环境动力学;算法;实验;性能;协调;可扩展性;探索;开发;

相似文献

外文文献
中文文献
专利

1. An abstract model of the basal ganglia, reward learning and action selection [J] . Pierre Berthet, Anders Lansner BMC Neuroscience . 2011,第SUPPLEMENTa1期

机译：基底神经节，奖励学习和动作选择的抽象模型
2. An abstract model of the basal ganglia, reward learning and action selection [J] . Pierre Berthet, Anders Lansner BMC Neuroscience . 2011,第SUPPLEMENTa1期

机译：基底神经节，奖励学习和动作选择的抽象模型
3. Consensus Control of General Linear Multiagent Systems With Antagonistic Interactions and Communication Noises [J] . Hu Jiangping, Wu Yanzhi, Li Tao, IEEE Transactions on Automatic Control . 2019,第5期

机译：具有拮抗作用和通信噪声的通用线性多主体系统的共识控制
4. CLEANing the Reward: Counterfactual Actions to Remove Exploratory Action Noise in Multiagent Learning [C] . Chris HolmesParker, Mathew E. Taylor, Adrian Agogino, International Conference on Autonomous Agents and Multiagent Systems . 2014

机译：清洁奖励：反事实行动，以消除多读学习中的探索性噪声
5. Learning partially observable Markov decision processes using abstract actions. [D] . Janzadeh, Hamed. 2012

机译：使用抽象动作学习部分可观察的马尔可夫决策过程。
6. An abstract model of the basal ganglia reward learning and action selection [O] . Pierre Berthet, Anders Lansner 2011

机译：基底神经节奖励学习和动作选择的抽象模型
7. An abstract model of the basal ganglia, reward learning and action selection [O] . Pierre Berthet, Anders Lansner 2011

机译：基底神经节，奖励学习和动作选择的抽象模型

CLEANing the Reward: Counterfactual Actions to Remove Exploratory Action Noise in Multiagent Learning (Extended Abstract).

著录项

引文网络

相似文献

相关主题

期刊订阅