首页> 外文期刊>Royal Society Open Science >Evaluating prose style transfer with the Bible
【24h】

Evaluating prose style transfer with the Bible

机译:用圣经评估散文风格的转移

获取原文
           

摘要

In the prose style transfer task a system, provided with text input and a target prose style, produces output which preserves the meaning of the input text but alters the style. These systems require parallel data for evaluation of results and usually make use of parallel data for training. Currently, there are few publicly available corpora for this task. In this work, we identify a high-quality source of aligned, stylistically distinct text in different versions of the Bible. We provide a standardized split, into training, development and testing data, of the public domain versions in our corpus. This corpus is highly parallel since many Bible versions are included. Sentences are aligned due to the presence of chapter and verse numbers within all versions of the text. In addition to the corpus, we present the results, as measured by the BLEU and PINC metrics, of several models trained on our data which can serve as baselines for future research. While we present these data as a style transfer corpus, we believe that it is of unmatched quality and may be useful for other natural language tasks as well.
机译:在散文风格转换任务中,一个提供文本输入和目标散文风格的系统会产生输出,该输出保留输入文本的含义,但会更改样式。这些系统需要并行数据来评估结果,并且通常将并行数据用于训练。当前,很少有公开可用的语料库来完成此任务。在这项工作中,我们在不同版本的圣经中找到了高质量的对齐,风格独特的文本来源。我们提供了我们语料库中公共领域版本的标准化划分,包括培训,开发和测试数据。由于包括了许多圣经版本,因此该语料库是高度并行的。由于文本的所有版本中都存在章节和经文编号,因此使句子对齐。除了语料库,我们还提供了根据BLEU和PINC度量标准测得的几种基于我们的数据训练的模型的结果,这些模型可以用作将来研究的基准。尽管我们将这些数据作为样式传递语料库提供,但我们认为它的质量无与伦比,并且对于其他自然语言任务也可能有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号