Long-Span Summarization via Local Attention and Content Selection

机译：通过当地关注和内容选择的长期概括

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Transformer-based models have achieved state-of-the-art results in a wide range of natural language processing (NLP) tasks including document summarization. Typically these systems are trained by fine-tuning a large pre-trained model to the target task. One issue with these transformer-based models is that they do not scale well in terms of memory and compute requirements as the input length grows. Thus, for long document summarization, it can be challenging to train or fine-tune these models. In this work, we exploit large pre-trained transformer-based models and address long-span dependencies in abstractive summarization using two methods: local self-attention; and explicit content selection. These approaches are compared on a range of network configurations. Experiments are carried out on standard long-span summarization tasks, including Spotify Pod-cast, arXiv, and PubMed datasets. We demonstrate that by combining these methods, we can achieve state-of-the-art results on all three tasks in the ROUGE scores. Moreover, without a large-scale GPU card, our approach can achieve comparable or better results than existing approaches.

机译：基于变压器的模型已经实现了最先进的，导致各种自然语言处理（NLP）任务包括文件摘要。通常，这些系统通过微调大型预先训练模型来训练到目标任务。这些基于变换器的模型的一个问题是它们在内存方面不符扩展，并且计算要求随着输入长度的增长。因此，对于长期的文件摘要来说，训练或微调这些模型可能会挑战。在这项工作中，我们利用大型预训练的基于变压器的模型，并使用两种方法进行抽象摘要解决长期依赖性：局部自我关注;并显式的内容选择。这些方法在一系列网络配置上进行了比较。实验是在标准的长期概要任务上进行的，包括Spotify Pod-Cast，Arxiv和PubMed数据集。我们证明，通过组合这些方法，我们可以在胭脂分数中的所有三个任务中实现最先进的结果。此外，没有大规模的GPU卡，我们的方法可以实现比现有方法相当或更好的结果。

著录项

来源
《International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics》|2021年|6026-6041|共16页
会议地点
作者
Potsawee Manakul; Mark J. F. Gales;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Content selection criteria for news multi-video summarization based on human strategies [J] . Tamires Tessarolli de Souza Barbieri, Rudinei Goularte International journal on digital libraries . 2021,第1期

机译：基于人类策略的新闻多视频摘要内容选择标准
2. Improving Content Selection for Update Summarization with Subtopic- Enriched Sentence Ranking Functions [J] . FERNANDO A. A. NOBREGA, THIAGO A. S. PARDO International journal of computational linguistics and applications . 2016,第2期

机译：使用子主题丰富的句子排名功能改善内容选择，以进行更新摘要
3. An Approximated Selection Algorithm for Combinations of Content with Virtual Local Server for Traffic Localization in Peer-Assisted Content Delivery Networks [J] . Naoya MAKI, Ryoichi SHINKUMA, Tatsuro TAKAHASHI IEICE transactions on information and systems . 2013,第12期

机译：在对等内容传输网络中，将内容与虚拟本地服务器组合以进行流量本地化的近似选择算法
4. Attention Head Masking for Inference Time Content Selection in Abstractive Summarization [C] . Shuyang Cao, Lu Wang Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2021

机译：注意力头掩蔽推动时间内容选择抽象摘要
5. Understanding the process of multi-document summarization: Content selection, rewriting and evaluation. [D] . Nenkova, Ani. 2006

机译：了解多文档摘要的过程：内容选择，重写和评估。
6. Resource-sharing between internal maintenance and external selection modulates attentional capture by working memory content [O] . Anastasia Kiyonaga, Tobias Egner 2014

机译：内部维护和外部选择之间的资源共享通过工作内存内容来调节注意力捕获
7. Attention Head Masking for Inference Time Content Selection in Abstractive Summarization [O] . Shuyang Cao, Lu Wang 2021

机译：注意力头掩蔽推理时间内容选择抽象摘要
8. Syntactic Simplification for Improving Content Selection in Multi- Document Summarization [R] . Siddharthan, A. , Nenkova, A. , McKeown, K. 2004

机译：在多文档摘要中改进内容选择的句法简化

Long-Span Summarization via Local Attention and Content Selection

摘要

著录项

相似文献

相关主题

期刊订阅