首页> 外文会议>IEEE International Workshop on Semantic Computing and Systems >Personalized PageRank based Multi-document Summarization
【24h】

Personalized PageRank based Multi-document Summarization

机译:基于个性化PageRank的多文件摘要

获取原文

摘要

This paper presents a novel multi-document summarization approach based on Personalized PageRank (PPRSum). In this algorithm, we uniformly integrate various kinds of information in the corpus. At first, we train a salience model of sentence global features based on Naive Bayes Model. Secondly, we generate a relevance model for each corpus utilizing the query of it. Then, we compute the personalized prior probability for each sentence in the corpus utilizing the salience model and the relevance model both. With the help of personalized prior probability, a Personalized PageRank ranking process is performed depending on the relationships among all sentences in the corpus. Additionally, the redundancy penalty is imposed on each sentence. The summary is produced by choosing the sentences with both high query-focused information richness and high information novelty. Experiments on DUC2007 are performed and the ROUGE evaluation results show that PPRSum ranks between the 1st and the 2nd systems on DUC2007 main task.
机译:本文提出了一种基于个性化PageRank(PPRSUM)的新型多文件摘要方法。在该算法中,我们在语料库中统一地集成了各种信息。首先,我们培养了基于天真贝叶斯模型的句子全球特征的蓬勃发展。其次,我们为利用它的查询生成每个语料库的相关模型。然后,我们在利用Parience模型和相关模型中计算语料库中的每个句子的个性化的先前概率。借助个性化的先前概率,根据语料库中所有句子之间的关系来执行个性化PageRank排名进程。此外,每个句子都强加了冗余惩罚。通过选择具有高查询集中信息丰富和高信息新颖性的句子来制作摘要。执行关于DUC2007的实验,Rouge评估结果表明,PPRSUM在DUC2007主任务上的第1和第2系统之间排名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号