【24h】

Cross-Lingual Abstractive Summarization with Limited Parallel Resources

机译:具有有限并行资源的交叉语言抽象总结

获取原文

摘要

Parallel cross-lingual summarization data is scarce, requiring models to better use the limited available cross-lingual resources. Existing methods to do so often adopt sequence-to-sequence networks with multi-task frameworks. Such approaches apply multiple decoders, each of which is utilized for a specific task. However, these independent decoders share no parameters, hence fail to capture the relationships between the discrete phrases of summaries in different languages, breaking the connections in order to transfer the knowledge of the high-resource languages to low-resource languages. To bridge these connections, we propose a novel Multi-Task framework for Cross-Lingual Abstractive Summarization (MCLAS) in a low-resource setting. Employing one unified decoder to generate the sequential concatenation of monolingual and cross-lingual summaries, MCLAS makes the monolingual summarization task a prerequisite of the cross-lingual summarization (CLS) task. In this way, the shared decoder learns interactions involving alignments and summary patterns across languages, which encourages attaining knowledge transfer. Experiments on two CLS datasets demonstrate that our model significantly outperforms three baseline models in both low-resource and full-dataset scenarios. Moreover, in-depth analysis on the generated summaries and attention heads verifies that interactions are learned well using MCLAS, which benefits the CLS task under limited parallel resources.
机译:并行交叉术概要数据是稀缺的,需要模型更好地使用有限的可用交叉资源。现有方法如此通常采用具有多任务框架的序列到序列网络。这种方法应用多个解码器,每个解码器用于特定任务。然而,这些独立解码器不共享任何参数,因此无法捕获不同语言的摘要的离散短语之间的关系,打破连接,以便将高资源语言的知识转移到低资源语言。要桥接这些连接,我们提出了一种用于低资源设置中的跨语言抽象摘要(MCLA)的新型多任务框架。采用一个统一的解码器来生成单晶和交叉概要的顺序连接,MCLA使单声道摘要任务成为交叉逻辑摘要(CLS)任务的先决条件。通过这种方式,共享解码器学习涉及跨语言的对齐和摘要模式的交互,这鼓励实现知识转移。两个CLS数据集上的实验表明,我们的模型在低资源和全部数据集方案中显着优于三种基线模型。此外,对所产生的摘要和注意力头的深入分析验证了使用MCLA学习良好的交互,这使得CLS任务在有限的并行资源下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号