【24h】

Sentence-Level Content Planning and Style Specification for Neural Text Generation

机译:神经文本生成的句子级内容规划和样式规范

获取原文

摘要

Building effective text generation systems requires three critical components: content selection, text planning, and surface realization, and traditionally they are tackled as separate problems. Recent all-in-one style neural generation models have made impressive progress, yet they often produce outputs that are incoherent and unfaithful to the input. To address these issues, we present an end-to-end trained two-step generation model, where a sentence-level content planner first decides on the keyphrases to cover as well as a desired language style, followed by a surface realization decoder that generates relevant and coherent text. For experiments, we consider three tasks from domains with diverse topics and varying language styles: persuasive argument construction from Reddit, paragraph generation for normal and simple versions of Wikipedia, and abstract generation for scientific articles. Automatic evaluation shows that our system can significantly outperform competitive comparisons. Human judges further rate our system generated text as more fluent and correct, compared to the generations by its variants that do not consider language style.
机译:构建有效的文本生成系统需要三个关键组件:内容选择,文本计划和表面实现,并且传统上将它们作为单独的问题解决。最近的多合一样式神经生成模型取得了令人瞩目的进展,但它们通常会产生不连贯且忠实于输入的输出。为了解决这些问题,我们提出了一个端到端训练有素的两步生成模型,其中句子级别的内容计划者首先确定要覆盖的关键字以及所需的语言样式,然后由表面实现解码器生成相关和连贯的文本。对于实验,我们考虑了来自具有不同主题和不同语言风格的领域的三个任务:Reddit的说服力论点构建,普通和简单版本的Wikipedia的段落生成以及科学文章的抽象生成。自动评估表明,我们的系统可以大大胜过竞争比较。与不考虑语言风格的变体相比,人类法官进一步评价了我们系统生成的文本更加流畅和正确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号