【24h】

Generating Text Summaries through the Relative Importance of Topics

机译:通过主题的相对重要性生成文本摘要

获取原文
获取原文并翻译 | 示例

摘要

This work proposes a new extractive text-summarization algorithm based on the importance of the topics contained in a document. The basic ideas of the proposed algorithm are as follows. At first the document is partitioned by using the TextTiling algorithm, which identifies topics (coherent segments of text) based on the TF-IDF metric. Then for each topic the algorithm computes a measure of its relative relevance in the document. This measure is computed by using the notion of TF-ISF (Term Frequency - Inverse Sentence Frequency), which is our adaptation of the well-known TF-IDF (Term Frequency - Inverse Document Frequency) measure in information retrieval. Finally, the summary is generated by selecting from each topic a number of sentences proportional to the importance of that topic.
机译:这项工作基于文档中主题的重要性,提出了一种新的提取文本摘要算法。提出的算法的基本思想如下。首先,使用TextTiling算法对文档进行分区,该算法基于TF-IDF指标识别主题(文本的连贯段)。然后,对于每个主题,算法都会计算出其在文档中的相对相关性。该度量是通过使用TF-ISF(术语频率-逆句子频率)的概念来计算的,这是我们对信息检索中众所周知的TF-IDF(术语频率-逆文档频率)度量的改编。最后,通过从每个主题中选择与该主题的重要性成比例的多个句子来生成摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号