Investigating Deep Reinforcement Learning Techniques in Personalized Dialogue Generation

机译：调查个性化对话生成中的深度加强学习技巧

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a personalized dialogue generation system, which combines reinforcement learning techniques with an attention-based hierarchical recurrent encoder-decoder model. Firstly, we incorporate user-specific information into the decoder to capture user's background information and speaking style. Secondly, we employ reinforcement learning techniques to maximize future reward in dialogue, which enables our system to generate topic-coherent, informative and grammatical responses. Moreover, we propose three types of rewards to characterize good conversations. Finally, we compare the performance of the following reinforcement learning methods in dialogue generation: policy gradient, Q-learning, and actor-critic algorithms. We conduct experiments to verify the effectiveness of the proposed model on two dialogue datasets. Experimental results demonstrate that our model can generate better personalized dialogues for different users. Quantitatively, our method achieves better performance than the state-of-the-art dialogue systems in terms of BLEU score, perplexity, and human evaluation.

机译：在本文中，我们提出了一个个性化的对话生成系统，该系统将增强学习技术与基于注意的分层复制编码器 - 解码器模型相结合。首先，我们将特定于用户的信息纳入解码器中以捕获用户的背景信息和说话方式。其次，我们采用了加强学习技术来最大限度地提高对话中的未来奖励，这使我们的系统能够生成主题连贯，信息和语法响应。此外，我们提出了三种类型的奖励来表征良好的谈话。最后，我们比较对话生成中以下强化学习方法的表现：政策梯度，Q学习和演员 - 批评算法。我们进行实验以验证所提出模型对两个对话数据集的有效性。实验结果表明，我们的模型可以为不同的用户产生更好的个性化对话。定量地，我们的方法在Bleu得分，困惑和人类评估方面实现了比最先进的对话系统更好的性能。

著录项

来源
《SIAM International Conference on Data Mining》|2018年|764p|共9页
会议地点
作者
Min Yang; Qiang Qu; Kai Lei; Jia Zhu; Zhou Zhao; Xiaojun Chen; Joshua Z. Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274-53;
关键词
Dialogue generation; Reinforcement learning; Personalized system; Deep learning;

机译：对话一代;加强学习;个性化系统;深入学习;

相似文献

外文文献
中文文献
专利

1. Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning [J] . Saha Tulika, Gupta Dhawal, Saha Sriparna, Expert Systems with Application . 2020,第Deca期

机译：利用分层深度加强学习对多个域和意图的综合对话政策学习
2. Personalized adaptive instruction design (PAID) for brain-computer interface using reinforcement learning and deep learning: simulated data study [J] . A. Eliseyev, T. Aksenova Brain-Computer Interfaces . 2019,第1a2期

机译：使用强化学习和深度学习的人机界面个性化自适应指令设计（PAID）：模拟数据研究
3. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies [J] . JOST SCHATZMANN, KARL WEILH AMM ER, MATT STUTTLE, The Knowledge Engineering Review . 2006,第2期

机译：统计用户模拟技术的调查，以加强学习对话管理策略
4. Investigating Deep Reinforcement Learning Techniques in Personalized Dialogue Generation [C] . Min Yang, Qiang Qu, Kai Lei, SIAM International Conference on Data Mining . 2018

机译：调查个性化对话生成中的深度加强学习技巧
5. Deep Reinforcement Learning with Accelerated Reward Function Technique for Robotics Task Planning [D] . Shaikh, Shifa. 2021

机译：机器人任务规划加速奖励功能技术的深增强学习
6. Diversity oriented Deep Reinforcement Learning for targeted molecule generation [O] . Tiago Pereira, Maryam Abbasi, Bernardete Ribeiro, 2021

机译：针对性分子生成的多样性深度增强学习
7. Deep Reinforcement Learning for Dialogue Generation [O] . Li, Jiwei, Monroe, Will, Ritter, Alan, 2016

机译：对话生成的深层强化学习

Investigating Deep Reinforcement Learning Techniques in Personalized Dialogue Generation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅