Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

机译：在不使用带注释的平行对的情况下生成语法控制的释义

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Paraphrase generation plays an essential role in natural language process (NLP), and it has many downstream applications. However, training supervised paraphrase models requires many annotated paraphrase pairs, which are usually costly to obtain. On the other hand, the paraphrases generated by existing unsupervised approaches are usually syntactically similar to the source sentences and are limited in diversity. In this paper, we demonstrate that it is possible to generate syntactically various paraphrases without the need for annotated paraphrase pairs. We propose Syntactically controlled Paraphrase Generator (SynPG), an encoder-decoder based model that learns to disentangle the semantics and the syntax of a sentence from a collection of unan-notated texts. The disentanglement enables SynPG to control the syntax of output paraphrases by manipulating the embedding in the syntactic space. Extensive experiments using automatic metrics and human evaluation show that SynPG performs better syntactic control than unsupervised baselines, while the quality of the generated paraphrases is competitive. We also demonstrate that the performance of SynPG is competitive or even better than supervised models when the unannotated data is large. Finally, we show that the syntactically controlled paraphrases generated by SynPG can be utilized for data augmentation to improve the robustness of NLP models.

机译：释义生成在自然语言过程（NLP）中起着重要作用，并且它具有许多下游应用。然而，培训监督的解释模型需要许多注释的释义对，这通常是获得的。另一方面，现有无监督方法产生的释义通常与源句要同意地类似，并且在多样性中受到限制。在本文中，我们证明可以在没有需要注释的释义对的情况下产生句法各种释义。我们提出了语法控制的释义生成器（Synpg），一个基于编码器解码器的模型，它学会解除语义中的语义和句子的语法，从一系列UNAN记录的文本中。 DisonDangement使Synpg能够通过操纵句法空间中的嵌入来控制输出释义的语法。使用自动度量和人类评估的广泛实验表明，Synpg执行比无监督的基线更好的句法控制，而生成的释义的质量具有竞争力。我们还证明，当未定位的数据很大时，Synpg的性能竞争甚至更好。最后，我们表明Synpg生成的语法控制释义可以用于数据增强以提高NLP模型的鲁棒性。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics》|2021年|1022-1033|共12页
会议地点
作者
Kuan-Hao Huang; Kai-Wei Chang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Control of an Object With Parallel Surfaces by a Pair of Finger Robots Without Object Sensing [J] . Ivuta Ozawa, Suguru Arimoto, Shinsuke Nakamura, IEEE Transactions on Robotics . 2005,第5期

机译：一对没有物体感应的手指机器人对具有平行表面的物体的控制
2. Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features [J] . Mohammad AL-Smadi, Zain Jaradat, Mahmoud AL-Ayyoub, Information Processing & Management . 2017,第3期

机译：使用词汇，句法和语义特征的阿拉伯新闻推文中的释义识别和语义文本相似性分析
3. Dependency vs. Constituent Based Syntactic N-Grams in Text Similarity Measures for Paraphrase Recognition [J] . Alejandro García, Andrea Segura-Olivares, Hiram Calvo Computacion y Sistemas . 2014,第3期

机译：短语相似度的文本相似性度量中基于依存关系和基于成分的句法N语法
4. Generating Syntactic Paraphrases [C] . Emilie Colin, Claire Gardent Conference on empirical methods in natural language processing . 2018

机译：生成句法释义
5. Advanced control techniques for parallel inverter operation without control interconnections. [D] . Tuladhar, Anil. 2000

机译：先进的控制技术，用于无控制互连的并联逆变器操作。
6. Crystal structure of d(GCGAAAGCT) containing a parallel-stranded duplex with homo base pairs and an anti-parallel duplex with Watson–Crick base pairs [O] . Tomoko Sunami, Jiro Kondo, Tomonori Kobuna, 2002

机译：d（GCGAAAGCT）的晶体结构包含具有均碱基对的平行链双链体和具有Watson-Crick碱基对的反平行双链体
7. Syntactic Constraints on Paraphrases Extracted from Parallel Corpora [O] . Chris Callison-burch 2008

机译：从平行语料库提取的释义的句法约束

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

摘要

著录项

相似文献

相关主题

期刊订阅