首页> 外文会议>International Florida Aritificial Intelligence Research Society Conference >Multi-Document Summarization Using Graph-Based Iterative Ranking Algorithms and Information Theoretical Distortion Measures
【24h】

Multi-Document Summarization Using Graph-Based Iterative Ranking Algorithms and Information Theoretical Distortion Measures

机译:基于图的迭代排名算法和信息理论失真措施的多文件概述

获取原文

摘要

Text summarization is an important field in the area of natural language processing and text mining. This paper proposes an extraction-based model which uses graph-based and information theoretic concepts for multi-document summarization. Our method constructs a directed weighted graph from the original text by adding a vertex for each sentence, and compute a weighted edge between sentences which is based on distortion measures. In this paper we proposed a combination of these two models by representing the input as a graph, using distortion measures as the weight function and a ranking algorithm. Finally, a ranking algorithm is applied to identify the most important sentences to be included in the summary. By defining a proper distortion measure and ranking algorithm, this model gains promising results on the DUC2002 which is a well known real world data set. The results and ROUGE-1 scores of our model is fairly close to other successful models.
机译:文本摘要是自然语言处理和文本挖掘领域的重要领域。本文提出了一种基于提取的模型,它使用基于图的和信息理论概念进行多文件摘要。我们的方法通过为每个句子添加顶点来构造来自原始文本的定向加权图,并计算基于失真措施的句子之间的加权边缘。在本文中,我们通过将输入作为曲线图表示为曲线图,提出了这两个模型的组合,其使用失真措施作为权重函数和排名算法。最后,应用排名算法来标识要包含在摘要中的最重要的句子。通过定义适当的失真测量和排序算法,该模型在DUC2002上获得了有希望的结果,这是一个众所周知的真实世界数据集。我们模型的结果和Rouge-1分数相当接近其他成功的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号