首页> 外文期刊>Journal of information and computational science >Sentence Scoring in Multi-document Summarizing under Topic Model LDA
【24h】

Sentence Scoring in Multi-document Summarizing under Topic Model LDA

机译:主题模型LDA下多文档摘要中的句子评分

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper automatic multi-document summarizing in a greedy framework is studied, where sentences are selected based on their contribution for the theme construction of the summary. The scores of sentences are evaluated based on their topic representations obtained from LDA (Latent Dirichlet Allocation), which is a probabilistic topic model. Consistent probabilistic representations of the relations between texts and topics are first proposed, and then two scoring methods are developed based on these representations. In addition the sentence length as an important factor in document summarizing is also studied. Experimental results show the pertinence of these probabilities, and the effectiveness of our scoring methods.
机译:本文研究贪婪框架中的自动多文档摘要,其中基于句子的贡献选择句子,以实现摘要的主题构建。基于从概率主题模型LDA(潜在Dirichlet分配)获得的主题表示,评估句子的分数。首先提出了文本和主题之间关系的一致概率表示,然后基于这些表示开发了两种评分方法。另外,还研究了句子长度作为文档摘要中的重要因素。实验结果表明了这些概率的相关性,以及我们的评分方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号