【24h】

Summarising Historical Text in Modern Languages

机译:在现代语言中概述历史文本

获取原文

摘要

We introduce the task of historical text summarisation, where documents in historical forms of a language are summarised in the corresponding modern language. This is a fundamentally important routine to historians and digital humanities researchers but has never been automated. We compile a high-quality gold-standard text summarisation dataset, which consists of historical German and Chinese news from hundreds of years ago summarised in modern German or Chinese. Based on cross-lingual transfer learning techniques, we propose a summarisation model that can be trained even with no cross-lingual (historical to modern) parallel data, and further benchmark it against state-of-the-art algorithms. We report automatic and human evaluations that distinguish the historic to modern language summarisation task from standard cross-lingual summarisation (i.e., modern to modern language), highlight the distinctness and value of our dataset. and demonstrate that our transfer learning approach outperforms standard cross-lingual benchmarks on this task.
机译:我们介绍了历史文本汇总的任务,其中语言的历史形式的文件总结在相应的现代语言中。这是历史学家和数字人文研究人员的根本重要的例程,但从未自动化。我们编制了一个高质量的金标准文本汇总数据集,其中包括来自数百年前的历史德国和中国新闻,总结在现代德国或中国人。基于交叉传输学习技术,我们提出了一个可以训练的概要模型,即使没有跨语言(历史到现代)并行数据,还可以进一步基准,而不是最先进的算法。我们报告了自动和人性化评估,将历史性与现代语言汇总任务区分开于标准的交叉术语(即现代语言),突出了我们数据集的明显和价值。并证明我们的转移学习方法在这项任务上占此标准的交叉基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号