首页> 外文会议>SIAM International Conference on Data Mining >Investigating Deep Reinforcement Learning Techniques in Personalized Dialogue Generation
【24h】

Investigating Deep Reinforcement Learning Techniques in Personalized Dialogue Generation

机译:调查个性化对话生成中的深度加强学习技巧

获取原文

摘要

In this paper, we propose a personalized dialogue generation system, which combines reinforcement learning techniques with an attention-based hierarchical recurrent encoder-decoder model. Firstly, we incorporate user-specific information into the decoder to capture user's background information and speaking style. Secondly, we employ reinforcement learning techniques to maximize future reward in dialogue, which enables our system to generate topic-coherent, informative and grammatical responses. Moreover, we propose three types of rewards to characterize good conversations. Finally, we compare the performance of the following reinforcement learning methods in dialogue generation: policy gradient, Q-learning, and actor-critic algorithms. We conduct experiments to verify the effectiveness of the proposed model on two dialogue datasets. Experimental results demonstrate that our model can generate better personalized dialogues for different users. Quantitatively, our method achieves better performance than the state-of-the-art dialogue systems in terms of BLEU score, perplexity, and human evaluation.
机译:在本文中,我们提出了一个个性化的对话生成系统,该系统将增强学习技术与基于注意的分层复制编码器 - 解码器模型相结合。首先,我们将特定于用户的信息纳入解码器中以捕获用户的背景信息和说话方式。其次,我们采用了加强学习技术来最大限度地提高对话中的未来奖励,这使我们的系统能够生成主题连贯,信息和语法响应。此外,我们提出了三种类型的奖励来表征良好的谈话。最后,我们比较对话生成中以下强化学习方法的表现:政策梯度,Q学习和演员 - 批评算法。我们进行实验以验证所提出模型对两个对话数据集的有效性。实验结果表明,我们的模型可以为不同的用户产生更好的个性化对话。定量地,我们的方法在Bleu得分,困惑和人类评估方面实现了比最先进的对话系统更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号