Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training

机译：从扰动中学习：多样化和信息性的对话产生，具有反对反对派培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose Inverse Adversarial Training (IAT) algorithm for training neural dialogue systems to avoid generic responses and model dialogue history better. In contrast to standard adversarial training algorithms, IAT encourages the model to be sensitive to the perturbation in the dialogue history and therefore learning from perturbations. By giving higher rewards for responses whose output probability reduces more significantly when dialogue history is perturbed, the model is encouraged to generate more diverse and consistent responses. By penalizing the model when generating the same response given perturbed dialogue history, the model is forced to better capture dialogue history and generate more informative responses. Experimental results on two benchmark datasets show that our approach can better model dialogue history and generate more diverse and consistent responses. In addition, we point out a problem of the widely used maximum mutual information (MMI) based methods for improving the diversity of dialogue response generation models and demonstrate it empirically.

机译：在本文中，我们提出了对训练神经对话系统的逆势抗逆性训练（IAT）算法，以避免更好的响应和模型对话史。与标准对抗训练算法相比，IAT鼓励模型对对话历史中的扰动敏感，从而从扰动中学习。通过给予更高的奖励来响应，当对话历史被扰乱时，其输出概率更加明显减少，鼓励模型产生更多样化和一致的反应。通过在给予扰动对话历史记录的同一响应时惩罚模型，该模型被迫更好地捕获对话历史并生成更具信息性的响应。两个基准数据集的实验结果表明，我们的方法可以更好地模型对话历史，并产生更多样化和一致的反应。此外，我们指出了基于广泛使用的基于最大互信息（MMI）的问题，以改善对话响应生成模型的多样性并经验证明它。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics;International Joint Conference on natural Language Processing》|2021年|694-703|共10页
会议地点
作者
Wangchunshu Zhou; Qifei Li; Chenle Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Perturbation Analysis of Learning Algorithms: Generation of Adversarial Examples From Classification to Regression [J] . Balda Emilio Rafael, Behboodi Arash, Mathar Rudolf IEEE Transactions on Signal Processing . 2019,第23期

机译：学习算法的扰动分析：从分类到回归的对抗性示例的生成
2. Fast-UAP: An algorithm for expediting universal adversarial perturbation generation using the orientations of perturbation vectors [J] . Dai Jiazhu, Shu Le Neurocomputing . 2021,第Jana21期

机译：FAST-UAP：使用扰动向量方向加速通用对抗扰动生成的算法
3. Anatomical context protects deep learning from adversarial perturbations in medical imaging [J] . Li Yi, Zhang Huahong, Bermudez Camilo, Neurocomputing . 2020,第Feba28期

机译：解剖背景可保护深度学习免受医学成像中的对抗性干扰
4. Dialogue Generation: From Imitation Learning to Inverse Reinforcement Learning [C] . Ziming Li, Julia Kiseleva, Maarten de Rijke AAAI Conference on Artificial Intelligence . 2019

机译：对话一代：从模仿学习到逆钢筋学习
5. Min-Max Inverse Reinforcement Learning for Learning Bi-Modal Dialogue Policies [D] . Patil, Gandharv. 2020

机译：用于学习双模对话策略的最大最大逆钢筋学习
6. Deep learning approach to classification of lung cytological images: Two-step training using actual and synthesized images by progressive growing of generative adversarial networks [O] . Atsushi Teramoto, Tetsuya Tsukamoto, Ayumi Yamada, 2020

机译：深层学习方法肺细胞学图像分类：使用实际和合成图像的两步训练通过逐步生长的生长生长育种网络
7. Multi-turn Dialogue Response Generation in an Adversarial Learning Framework [O] . Oluwatobi Olabiyi, Alan O Salimov, Anish Khazane, 2019

机译：在对抗学习框架中的多转对对话响应生成

Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training

摘要

著录项

相似文献

相关主题

期刊订阅