首页> 外文会议>Conference on empirical methods in natural language processing >Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management
【24h】

Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management

机译:基于奖励稀疏的课程学习,以对对话管理的深度加固学习

获取原文

摘要

Learning from sparse and delayed reward is a central issue in reinforcement learning. In this paper, to tackle reward sparseness problem of task oriented dialogue management, we propose a curriculum based approach on the number of slots of user goals. This curriculum makes it possible to learn dialogue management for sets of user goals with large number of slots. We also propose a dialogue policy based on progressive neural networks whose modules with parameters are appended with previous parameters fixed as the curriculum proceeds, and this policy improves performances over the one with single set of parameters.
机译:从稀疏和延迟奖励中学习是强化学习的核心问题。在本文中,为了解决任务导向的对话管理的奖励稀疏问题,我们提出了一种基于课程的方法,用于用户目标的插槽数量。此课程可以为具有大量插槽的用户目标组学习对话管理。我们还提出了一种基于渐进神经网络的对话策略,其模块与课程所得级的先前参数附加有前一个参数,而本策略可提高单一参数的单一参数的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号