首页> 外文会议>Conference of the European Chapter of the Association for Computational Linguistics >Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs
【24h】

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

机译:在不使用带注释的平行对的情况下生成语法控制的释义

获取原文

摘要

Paraphrase generation plays an essential role in natural language process (NLP), and it has many downstream applications. However, training supervised paraphrase models requires many annotated paraphrase pairs, which are usually costly to obtain. On the other hand, the paraphrases generated by existing unsupervised approaches are usually syntactically similar to the source sentences and are limited in diversity. In this paper, we demonstrate that it is possible to generate syntactically various paraphrases without the need for annotated paraphrase pairs. We propose Syntactically controlled Paraphrase Generator (SynPG), an encoder-decoder based model that learns to disentangle the semantics and the syntax of a sentence from a collection of unan-notated texts. The disentanglement enables SynPG to control the syntax of output paraphrases by manipulating the embedding in the syntactic space. Extensive experiments using automatic metrics and human evaluation show that SynPG performs better syntactic control than unsupervised baselines, while the quality of the generated paraphrases is competitive. We also demonstrate that the performance of SynPG is competitive or even better than supervised models when the unannotated data is large. Finally, we show that the syntactically controlled paraphrases generated by SynPG can be utilized for data augmentation to improve the robustness of NLP models.
机译:释义生成在自然语言过程(NLP)中起着重要作用,并且它具有许多下游应用。然而,培训监督的解释模型需要许多注释的释义对,这通常是获得的。另一方面,现有无监督方法产生的释义通常与源句要同意地类似,并且在多样性中受到限制。在本文中,我们证明可以在没有需要注释的释义对的情况下产生句法各种释义。我们提出了语法控制的释义生成器(Synpg),一个基于编码器解码器的模型,它学会解除语义中的语义和句子的语法,从一系列UNAN记录的文本中。 DisonDangement使Synpg能够通过操纵句法空间中的嵌入来控制输出释义的语法。使用自动度量和人类评估的广泛实验表明,Synpg执行比无监督的基线更好的句法控制,而生成的释义的质量具有竞争力。我们还证明,当未定位的数据很大时,Synpg的性能竞争甚至更好。最后,我们表明Synpg生成的语法控制释义可以用于数据增强以提高NLP模型的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号