首页> 外文会议>Pacific Rim International Conference on Artificial Intelligence(PRICAI 2006); 20060807-11; Guilin(CN) >Chinese Multi-document Summarization Using Adaptive Clustering and Global Search Strategy
【24h】

Chinese Multi-document Summarization Using Adaptive Clustering and Global Search Strategy

机译:自适应聚类和全局搜索策略的中文多文档摘要

获取原文
获取原文并翻译 | 示例

摘要

Multi-document summarization has become a key technology in natural language processing. This paper proposes a strategy for Chinese multi-document summarization based on clustering and sentence extraction. As for clustering, we propose two heuristics to automatically detect the proper number of clusters: the first one makes full use of the summary length fixed by the user; the second is a stability method, which has been applied to other unsupervised learning problems. We also discuss a global searching method for sentence selection from the clusters. To evaluate our summarization strategy, an extrinsic evaluation method based on classification task is adopted. Experimental results on news document set show that the new strategy can significantly enhance the performance of Chinese multi-document summarization.
机译:多文档摘要已成为自然语言处理中的关键技术。本文提出了一种基于聚类和句子提取的中文多文档摘要策略。对于聚类,我们提出了两种启发式方法来自动检测适当数量的聚类:第一个充分利用用户确定的汇总长度;第二种是稳定性方法,已应用于其他无监督学习问题。我们还讨论了用于从聚类中选择句子的全局搜索方法。为了评估我们的总结策略,采用了基于分类任务的外部评价方法。在新闻文档集上的实验结果表明,该新策略可以显着提高中文多文档摘要的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号