首页> 外文会议>European Workshop on Natural Language Generation >Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems
【24h】

Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems

机译:学习用于口头对话系统的自适应表达式生成策略

获取原文

摘要

We address the problem that different users have different lexical knowledge about problem domains, so that automated dialogue systems need to adapt their generation choices online to the users' domain knowledge as it encounters them. We approach this problem using Reinforcement Learning in Markov Decision Processes (MDP). We present a reinforcement learning framework to learn adaptive referring expression generation (REG) policies that can adapt dynamically to users with different domain knowledge levels. In contrast to related work we also propose a new statistical user model which incorporates the lexical knowledge of different users. We evaluate this framework by showing that it allows us to learn dialogue policies that automatically adapt their choice of referring expressions online to different users, and that these policies are significantly better than hand-coded adaptive policies for this problem. The learned policies are consistently between 2 and 8 turns shorter than a range of different hand-coded but adaptive baseline REG policies.
机译:我们解决了不同用户对问题域具有不同词汇知识的问题,因此自动对话系统需要在遇到它们的用户的域知识上对用户的域知识进行调整。我们在马尔可夫决策过程(MDP)中使用强化学习来解决这个问题。我们提出了一种加强学习框架,用于学习可以使用不同域知识级别的用户动态调整的自适应引用的表达式(REG)策略。与相关工作相比,我们还提出了一种新的统计用户模型,它包含不同用户的词汇知识。我们通过表明它允许我们学习自动调整其在线选择的对话策略,以便在线调整对不同的用户,并且这些策略显着优于此问题的手工编码的自适应策略。学习的策略始终如一的2和8匝数短于一系列不同的手工编码但自适应基线reg策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号