【24h】

Fast Concept Mention Grouping for Concept Map-based Multi-Document Summarization

机译:基于概念图的多文档摘要的快速概念提及分组

获取原文

摘要

Concept map-based multi-document summarization has recently been proposed as a variant of the traditional summarization task with graph-structured summaries. As shown by previous work, the grouping of coreferent concept mentions across documents is a crucial subtask of it. However, while the current state-of-the-art method suggested a new grouping method that was shown to improve the summary quality, its use of pairwise comparisons leads to polynomial runtime complexity that prohibits the application to large document collections. In this paper, we propose two alternative grouping techniques based on locality sensitive hashing, approximate nearest neighbor search and a fast clustering algorithm. They exhibit linear and log-linear runtime complexity, making them much more scalable. We report experimental results that confirm the improved runtime behavior while also showing that the quality of the summary concept maps remains comparable.'
机译:最近,基于概念图的多文档摘要已提出,是具有图结构摘要的传统摘要任务的一种变体。如先前的工作所示,跨文档的核心概念提及的分组是其关键的子任务。但是,尽管当前的最新方法建议使用一种新的分组方法来提高汇总质量,但其成对比较的使用会导致多项式运行时复杂性,从而使该方法无法应用于大型文档集合。在本文中,我们提出了两种基于局部敏感哈希的替代分组技术:近似最近邻搜索和快速聚类算法。它们表现出线性和对数线性的运行时复杂性,使其更具可伸缩性。我们报告的实验结果证实了改进的运行时行为,同时还表明摘要概念图的质量仍然可比。”

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号