Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management

机译：基于奖励稀疏性的课程学习，用于任务完成对话管理的深度强化学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Learning from sparse and delayed reward is a central issue in reinforcement learning. In this paper, to tackle reward sparseness problem of task oriented dialogue management, we propose a curriculum based approach on the number of slots of user goals. This curriculum makes it possible to learn dialogue management for sets of user goals with large number of slots. We also propose a dialogue policy based on progressive neural networks whose modules with parameters are appended with previous parameters fixed as the curriculum proceeds, and this policy improves performances over the one with single set of parameters.

机译：从稀疏和延迟的奖励中学习是强化学习的中心问题。在本文中，为了解决面向任务的对话管理的奖励稀疏问题，我们提出了一种基于课程的用户目标时段数量方法。该课程使学习带有大量广告位的用户目标集的对话管理成为可能。我们还提出了一种基于递进神经网络的对话策略，该对话策略的参数模块随课程进行而固定，之前的参数固定不变，并且该策略比单组参数的性能有所提高。

著录项

来源
《2018 EMNLP workshop SCAI: 2nd international workshop on search-oriented conversational AI》|2018年|46-51|共6页
会议地点 Brussels(BE)
作者
Atsushi Saito;
展开▼
作者单位

Nextremer Co., Ltd., Tokyo, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Hierarchical automatic curriculum learning: Converting a sparse reward navigation task into dense reward [J] . Jiang Nan, Jin Sheng, Zhang Changshui Neurocomputing . 2019,第Sepa30期

机译：分层自动课程学习：将稀疏奖励导航任务转换为密集奖励
2. Hierarchical automatic curriculum learning: Converting a sparse reward navigation task into dense reward [J] . Jiang Nan, Jin Sheng, Zhang Changshui Neurocomputing . 2019,第SEPa30期

机译：分层自动课程学习：将稀疏奖励导航任务转换为密集奖励
3. Efficient hindsight reinforcement learning using demonstrations for robotic tasks with sparse rewards [J] . Guoyu Zuo, Qishen Zhao, Jiahao Lu, International Journal of Advanced Robotic Systems . 2020,第1期

机译：使用具有稀疏奖励的机器人任务的演示高效的后敏感钢筋学习
4. Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management [C] . Atsushi Saito Conference on empirical methods in natural language processing . 2018

机译：基于奖励稀疏的课程学习，以对对话管理的深度加固学习
5. Deep Reinforcement Learning with Accelerated Reward Function Technique for Robotics Task Planning [D] . Shaikh, Shifa. 2021

机译：机器人任务规划加速奖励功能技术的深增强学习
6. Cooperative Object Transportation Using Curriculum-Based Deep Reinforcement Learning [O] . Gyuho Eoh, Tae-Hyoung Park 2021

机译：基于课程的深度加强学习的合作对象运输
7. Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management [O] . Atsushi Saito 2018

机译：基于奖励稀疏的课程学习，以对对话管理的深度加固学习

Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅