首页> 外文OA文献 >Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization
【2h】

Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization

机译:通过子空间聚类和基于聚类的优化来增强文档摘要的多样性和覆盖范围

摘要

Sentence clustering has been successfully applied in document summarization to discover the topics conveyed in a collection of documents. However, existing clustering-based summarization approaches are seldom targeted for both diversity and coverage of summaries, which are believed to be the two key issues to determine the quality of summaries. The focus of this work is to explore a systematic approach that allows diversity and coverage to be tackled within an integrated clustering-based summarization framework. Given the fact that normally each topic can be described by a set of keywords and the choice of the keywords among the topics is topic-dependent, we take the advantage of the newly emerged subspace clustering to enable the flexibility of keyword selection and the improved quality of sentence clustering. On this basis, we develop two clustering-based optimization strategies, namely local optimization and global optimization to pursue our targets. Experimental results on the DUC datasets demonstrate effectiveness and robustness of the proposed approach.
机译:句子聚类已成功应用于文档摘要中,以发现文档集合中传达的主题。但是,现有的基于聚类的摘要方法很少同时针对摘要的多样性和覆盖范围,这被认为是确定摘要质量的两个关键问题。这项工作的重点是探索一种系统的方法,该方法允许在基于聚类的集成摘要框架内解决多样性和覆盖范围。鉴于通常每个主题都可以由一组关键字来描述并且主题之间的关键字选择取决于主题这一事实,我们利用新出现的子空间聚类的优势来实现关键字选择的灵活性和改进的质量句子聚类。在此基础上,我们开发了两种基于聚类的优化策略,即局部优化和全局优化以追求目标。 DUC数据集上的实验结果证明了该方法的有效性和鲁棒性。

著录项

  • 作者

    Cai XY; Li WJ; Zhang RX;

  • 作者单位
  • 年度 2014
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号