Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

机译：在端到端可训练的，面向任务的对话系统中通过人类教学和反馈进行对话学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we present a hybrid learning method for training task-oriented dialogue systems through online user interactions. Popular methods for learning task-oriented dialogues include applying reinforcement learning with user feedback on supervised pre-training models. Efficiency of such learning method may suffer from the mismatch of dialogue state distribution between offline training and online interactive learning stages. To address this challenge, we propose a hybrid imitation and reinforcement learning method, with which a dialogue agent can effectively learn from its interaction with users by learning from human teaching and feedback. We design a neural network based task-oriented dialogue agent that can be optimized end-to-end with the proposed learning method. Experimental results show that our end-to-end dialogue agent can learn effectively from the mistake it makes via imitation learning from user teaching. Applying reinforcement learning with user feedback after the imitation learning stage further improves the agent's capability in successfully completing a task.

机译：在这项工作中，我们提出了一种混合学习方法，用于通过在线用户交互来训练面向任务的对话系统。学习面向任务的对话的流行方法包括在监督的预训练模型上通过用户反馈应用强化学习。这种学习方法的效率可能会受到离线培训和在线互动学习阶段之间对话状态分布不匹配的困扰。为了应对这一挑战，我们提出了一种混合模仿和强化学习方法，通过这种方法，对话代理可以通过从人类的教学和反馈中学习，从而从与用户的互动中有效地学习。我们设计了一种基于神经网络的，面向任务的对话代理，该代理可以通过所提出的学习方法进行端到端的优化。实验结果表明，我们的端到端对话代理可以通过从用户教学中进行的模仿学习而从错误中有效学习。在模仿学习阶段之后，通过用户反馈应用强化学习，可以进一步提高座席成功完成任务的能力。

著录项

来源
《Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies》|2018年|2060-2069|共10页
会议地点
作者
Bing Liu; Gokhan Tuer; Dilek Hakkani-Tuer; Pararth Shah; Larry Heck;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:51:18

相似文献

外文文献
中文文献
专利

1. End-to-End latent-variable task-oriented dialogue system with exact log-likelihood optimization [J] . Haotian Xu, Haiyun Peng, Haoran Xie, World Wide Web . 2020,第3期

机译：具有精确的日志似然优化的端到端潜在可变任务对话系统
2. Memory-Augmented Dialogue Management for Task-Oriented Dialogue Systems [J] . Zhang Zheng, Huang Minlie, Zhao Zhongzhou, ACM Transactions on Information Systems . 2019,第3期

机译：面向任务的对话系统的内存增强对话管理
3. A memory network based end-to-end personalized task-oriented dialogue generation [J] . Zhang Bowen, Xu Xiaofei, Li Xutao, Knowledge-Based Systems . 2020,第Nova5期

机译：基于内存网络的端到端个性化任务导向的对话生成
4. Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems [C] . Bing Liu, Gokhan Tuer, Dilek Hakkani-Tuer, Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2018

机译：与人类教学的对话学习和最终培训任务的任务的对话系统中的反馈
5. Transfer Reinforcement Learning for Task-Oriented Dialogue Systems [D] . Mo, Kaixiang. 2018

机译：面向任务的对话系统的转移强化学习
6. General Didactics and Instructional Design: eyes like twins A transatlantic dialogue about similarities and differences about the past and the future of two sciences of learning and teaching [O] . Klaus Zierer, Norbert M Seel -1

机译：一般教学法和教学设计：双胞胎般的双眼跨大西洋的对话探讨异同关于两种学与教科学的过去和未来
7. Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems [O] . Bing Liu, Gokhan Tür, Dilek Hakkani-Tür, 2018

机译：与人类教学的对话学习和最终培训任务的任务的对话系统中的反馈

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

摘要

著录项

相似文献

相关主题

期刊订阅