Comparing Reward Shaping, Visual Hints, and Curriculum Learning

机译：比较奖励塑造，视觉提示和课程学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

When considering how to reduce the learning effort required for Reinforcement Learning (RL) agents on complex tasks, designers can apply several common approaches. Reward shaping boosts the immediate reward provided by the environment, effectively encouraging (or discouraging) specific actions. Curriculum learning (Bengio et al. 2009) aims to help an agent learn a complex task by learning a sequence of simpler tasks. Hints may also be provided (e.g., a yellow brick road), which fall outside the notion of shaping or a curricula. Despite the prevalence of these approaches, few studies examine how they compare to (or complement) each other or when an approach is better.

机译：在考虑如何降低复杂任务中加强学习（RL）代理所需的学习努力，设计人员可以应用几种常见方法。奖励塑造提高了环境提供的直接奖励，有效地鼓励（或劝阻）具体行动。课程学习（Bengio等，2009）旨在帮助代理通过学习一系列更简单的任务来学习复杂任务。还可以提供提示（例如，黄砖路），其落在塑造或课程的概念之外。尽管这些方法存在普遍性，但很少有研究审查他们如何相互比较（或补充）或者当一种方法更好时。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence》|2018年|7656-8223p|共2页
会议地点
作者
Rey Pocius; David Isele; Mark Roberts; David W. Aha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Hierarchical automatic curriculum learning: Converting a sparse reward navigation task into dense reward [J] . Jiang Nan, Jin Sheng, Zhang Changshui Neurocomputing . 2019,第Sepa30期

机译：分层自动课程学习：将稀疏奖励导航任务转换为密集奖励
2. Hierarchical automatic curriculum learning: Converting a sparse reward navigation task into dense reward [J] . Jiang Nan, Jin Sheng, Zhang Changshui Neurocomputing . 2019,第SEPa30期

机译：分层自动课程学习：将稀疏奖励导航任务转换为密集奖励
3. Impact of an engineering design-based curriculum compared to an inquiry-based curriculum on fifth graders' content learning of simple machines [J] . Marulcu Ismail, Barnett Michael Research in science & technological education . 2016,第1期

机译：与基于查询的课程相比，基于工程设计的课程对五年级学生对简单机器的内容学习的影响
4. Comparing Reward Shaping, Visual Hints, and Curriculum Learning [C] . Rey Pocius, David Isele, Mark Roberts, AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：比较奖励塑造，视觉提示和课程学习
5. The Effect of Delayed Feedback and Visual Hints Within a Gaming Environment to Facilitate Achievement of Different Learning Objectives [D] . Zeglen, Eric 2015

机译：延迟反馈和视觉提示在游戏环境中的影响，促进实现不同学习目标的影响
6. Visual-visual associative learning and reward-association learning in monkeys: the role of the amygdala [O] . D Gaffan, EA Gaffan, S Harrison 1989

机译：猴子的视觉-视觉联想学习和奖励联想学习：杏仁核的作用
7. Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management [O] . Atsushi Saito 2018

机译：基于奖励稀疏的课程学习，以对对话管理的深度加固学习

Comparing Reward Shaping, Visual Hints, and Curriculum Learning

摘要

著录项

相似文献

相关主题

期刊订阅