首页> 外文会议>International conference on computational linguistics >Sequence-to-Sequence Data Augmentation for Dialogue Language Understanding
【24h】

Sequence-to-Sequence Data Augmentation for Dialogue Language Understanding

机译:对话语言理解的序列到序列数据增强

获取原文

摘要

In this paper, we study the problem of data augmentation for language understanding in task-oriented dialogue system. In contrast to previous work which augments an utterance without considering its relation with other utterances, we propose a scquence-to-scqucncc generation based data augmentation framework that leverages one utterance's same semantic alternatives in the training data. A novel diversity rank is incorporated into the utterance representation to make the model produce diverse utterances and these diversely augmented utterances help to improve the language understanding module. Experimental results on the Airline Travel Information System dataset and a newly created semantic frame annotation on Stanford Multi-turn, Multi-domain Dialogue Dataset show that our framework achieves significant improvements of 6.38 and 10.04 F-scores respectively when only a training set of hundreds utterances is represented. Case studies also confirm that our method generates diverse utterances.
机译:在本文中,我们研究了任务导向对话系统中语言理解的数据增强问题。与之前的工作相比,在不考虑其与其他话语关系的情况下增强了一个话语,我们提出了一种基于练习的数据增强框架,它利用了一个话语在训练数据中相同的语义替代品。新颖的多样性等级被纳入话语代表,以使模型产生不同的话语,这些多样化的话语有助于改善语言理解模块。在航空公司旅行信息系统数据集和新创建的语义帧注释对斯坦福多匝,多域对话数据集的实验结果表明,当只有数百个话语训练套时,我们的框架分别实现了6.38和10.04 F分别的显着改进代表。案例研究还证实我们的方法产生不同的话语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号