首页> 外文会议>International conference on neural information processing >Obtaining Single Document Summaries Using Latent Dirichlet Allocation
【24h】

Obtaining Single Document Summaries Using Latent Dirichlet Allocation

机译:使用潜在Dirichlet分配获取单个文档摘要

获取原文

摘要

In this paper, we present a novel approach that makes use of topic models based on Latent Dirichlet allocation(LDA) for generating single document summaries. Our approach is distinguished from other LDA based approaches in that we identify the summary topics which best describe a given document and only extract sentences from those paragraphs within the document which are highly correlated given the summary topics. This ensures that our summaries always highlight the crux of the document without paying any attention to the grammar and the structure of the documents. Finally, we evaluate our summaries on the DUC 2002 Single document summarization data corpus using ROUGE measures. Our summaries had higher ROUGE values and better semantic similarity with the documents than the DUC summaries.
机译:在本文中,我们提出了一种新颖的方法,该方法利用基于潜在Dirichlet分配(LDA)的主题模型来生成单个文档摘要。我们的方法与其他基于LDA的方法的区别在于,我们确定最能描述给定文档的摘要主题,并且仅从文档中与摘要主题高度相关的那些段落中提取句子。这样可以确保我们的摘要始终突出显示文档的症结,而无需关注文档的语法和结构。最后,我们使用ROUGE度量对DUC 2002单文档摘要数据语料库的摘要进行评估。与DUC摘要相比,我们的摘要具有更高的ROUGE值和与文档更好的语义相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号