首页> 外文学位 >Learning Policies for Model-Based Reinforcement Learning Using Distributed Reward Formulation

【24h】

Learning Policies for Model-Based Reinforcement Learning Using Distributed Reward Formulation

机译：使用分布式奖励制定学习基于模型的强化学习的政策

获取原文

获取原文并翻译 | 示例

页面导航

著录项
相似文献
相关主题

著录项

作者
Agarwal, Nikhil.;
展开▼
作者单位

Arizona State University.;

展开▼
授予单位 Arizona State University.;
学科
学位 M.S.
年度 2021
页码 39 p.
总页数 39
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-17 12:03:41

相似文献

外文文献
中文文献
专利

1. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. [J] . Glascher J, Daw N, Dayan P, Neuron . 2010,第4期

机译：状态与回报：基于模型和无模型的强化学习背后的可分离的神经预测错误信号。
2. Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards [J] . Rituraj Kaushik, Konstantinos Chatzilygeroudis, Jean-Baptiste Mouret JMLR: Workshop and Conference Proceedings . 2018,第1期

机译：基于多目标模型的策略搜索以稀疏奖励实现数据有效学习
3. The "Proactive" Model of Learning: Integrative Framework for Model-Free and Model-Based Reinforcement Learning Utilizing the Associative Learning-Based Proactive Brain Concept [J] . Zsuga Judit, Biro Klara, Papp Csaba, Behavioral neuroscience . 2016,第1期

机译：“主动”学习模型：利用基于联合学习的主动脑概念进行无模型和基于模型的强化学习的集成框架
4. OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning [C] . Peter Henderson, Wei-Di Chang, Pierre-Luc Bacon, AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：申请：使用生成的对抗性反增强学习学习联合奖励政策选择
5. Understanding Model-Based Reinforcement Learning and its Application in Safe Reinforcement Learning [D] . Hu, Dingcheng . 2019

机译：了解基于模型的强化学习及其在安全强化学习中的应用
6. States versus Rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning [O] . Jan Gläscher, Nathaniel Daw, Peter Dayan, -1

机译：各种与奖励：可解离的神经预测误差信号底层模型和无模型加强学习
7. States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning [O] . Gläscher Jan, Daw Nathaniel, Dayan Peter, 2010

机译：状态与奖励：基于模型和免费模型的强化学习背后的可分离神经预测误差信号

Learning Policies for Model-Based Reinforcement Learning Using Distributed Reward Formulation

著录项

相似文献

相关主题

期刊订阅