Sentence Centrality Revisited for Unsupervised Summarization

机译：重新审视句子中心以实现无监督汇总

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Single document summarization has enjoyed renewed interest in recent years thanks to the popularity of neural network models and the availability of large-scale datasets. In this paper we develop an unsupervised approach arguing that it is unrealistic to expect large-scale and high-quality training data to be available or created for different types of summaries, domains, or languages. We revisit a popular graph-based ranking algorithm and modify how node (aka sentence) centrality is computed in two ways: (a) we employ BERT, a state-of-the-art neural representation learning model to better capture sentential meaning and (b) we build graphs with directed edges arguing that the contribution of any two nodes to their respective centrality is influenced by their relative position in a document. Experimental results on three news summarization datasets representative of different languages and writing styles show that our approach outperforms strong baselines by a wide margin.

机译：近年来，由于神经网络模型的普及和大规模数据集的可用性，单一文档摘要引起了新的兴趣。在本文中，我们开发了一种无监督的方法，认为期望为各种类型的摘要，域或语言提供或创建大规模且高质量的培训数据是不现实的。我们重新审视了一种流行的基于图的排名算法，并修改了以两种方式计算节点（aka句子）中心度的方式：（a）我们使用BERT（一种最新的神经表示学习模型）来更好地捕获句子的含义，并且（ b）我们建立有向边的图，认为任何两个节点对其各自中心的贡献受它们在文档中的相对位置的影响。在代表不同语言和写作风格的三个新闻摘要数据集上的实验结果表明，我们的方法在很大程度上优于强基准。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|6236-6247|共12页
会议地点
作者
Hao Zheng; Mirella Lapata;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An unsupervised method for extractive multi-document summarization based on centroid approach and sentence embeddings [J] . Lamsiyah Salima, El Mahdaouy Abdelkader, Espinasse Bernard, Expert systems with applications . 2021,第Apra期

机译：基于质心方法和句子嵌入的提取多文件摘要的无监督方法
2. Blending Sentence Optimization Weights of Unsupervised Approaches for Extractive Speech Summarization [J] . Noraini Seman, Nursuriati Jamil Procedia Computer Science . 2015,第1期

机译：无监督方法的混合句子优化权重用于提取语音摘要
3. Unsupervised sentence representations as word information series: Revisiting TF-IDF [J] . Arroyo-Fernandez Ignacio, Mendez-Cruz Carlos-Francisco, Sierra Gerardo, Computer speech and language . 2019,第JULa期

机译：无监督的句子表示形式，如单词信息系列：再访TF-IDF
4. Sentence Centrality Revisited for Unsupervised Summarization [C] . Hao Zheng, Mirella Lapata Annual meeting of the Association for Computational Linguistics . 2019

机译：因无监督摘要检索的句子中心
5. Video Summarization Using Unsupervised Methods [D] . Bhosale, Akanksha. 2018

机译：使用无监督方法进行视频汇总
6. Interp-SUM: Unsupervised Video Summarization with Piecewise Linear Interpolation [O] . Ui-Nyoung Yoon, Myung-Duk Hong, Geun-Sik Jo 2021

机译：Interp-Sum：具有分段线性插值的无监督视频摘要
7. Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction [O] . Raphael Schumann, Lili Mou, Yao Lu, 2020

机译：无监督句子摘要与词级提取的离散优化
8. Generic Sentence Fusion is an Ill-Defined Summarization Task [R] . Daume, H. , Marcu, D. 2004

机译：通用句子融合是一种不确定的摘要任务

Sentence Centrality Revisited for Unsupervised Summarization

摘要

著录项

相似文献

相关主题

期刊订阅