首页> 外文期刊>ACM transactions on Asian language information processing >Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency
【24h】

Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

机译:规范化中文抽象社交媒体文本摘要的输出分布,以改善语义一致性

获取原文
获取原文并翻译 | 示例
       

摘要

The Abstractive text summarization is a highly difficult problem, and the sequence-to-sequence model has shown success in improving the performance on the task. However, the generated summaries are often inconsistent with the source content in semantics. In such cases, when generating summaries, the model selects semantically unrelated words with respect to the source content as the most probable output. The problem can be attributed to heuristically constructed training data, where summaries can be unrelated to the source content, thus containing semantically unrelated words and spurious word correspondence. In this article, we propose a regularization approach for the sequence-to-sequence model and make use of what the model has learned to regularize the learning objective to alleviate the effect of the problem. In addition, we propose a practical human evaluation method to address the problem that the existing automatic evaluation method does not evaluate the semantic consistency with the source content properly. Experimental results demonstrate the effectiveness of the proposed approach, which outperforms almost all the existing models. Especially, the proposed approach improves the semantic consistency by 4% in terms of human evaluation.
机译:摘要文本摘要是一个非常困难的问题,并且序列到序列模型已显示出成功提高任务性能的成功。但是,生成的摘要在语义上通常与源内容不一致。在这种情况下,当生成摘要时,模型会选择相对于源内容在语义上不相关的单词作为最可能的输出。该问题可归因于启发式构造的训练数据,其中摘要可能与源内容无关,因此包含语义上不相关的单词和虚假单词对应关系。在本文中,我们提出了一种序列到序列模型的正则化方法,并利用该模型学到的知识来正则化学习目标,以减轻问题的影响。此外,我们提出了一种实用的人工评估方法,以解决现有的自动评估方法无法正确评估与源内容的语义一致性的问题。实验结果证明了该方法的有效性,该方法优于几乎所有现有模型。特别是,该方法在人工评估方面将语义一致性提高了4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号