首页> 外文期刊>Expert Systems with Application >Experimental analysis of multiple criteria for extractive multi-document text summarization
【24h】

Experimental analysis of multiple criteria for extractive multi-document text summarization

机译:提取多文档文本摘要的多个条件的实验分析

获取原文
获取原文并翻译 | 示例
       

摘要

Automatic text summarization methods are increasingly needed in different fields of knowledge. In the scientific literature, generic extractive multi-document text summarization can be formulated as an optimization problem which involves several criteria. Only two criteria have been considered simultaneously, i.e., content coverage and redundancy reduction, whereas the other ones, relevance and coherence have been considered separately. Therefore, there is a lack of studies comparing the performance of different criteria. For this reason, a comparative study of the different criteria suitable for generic extractive multi document text summarization is performed here. All possible combinations of two, three, and four criteria have been considered within a multi-objective optimization context. Experiments have been carried out based on datasets from Document Understanding Conferences (DUC), and the combinations of objective functions have been compared and evaluated with Recall-Oriented Understudy for Gisting Evaluation (ROUGE) metrics. Redundancy reduction has been demonstrated as an indispensable criterion, being the coherence the least significant and efficient criterion. The combination that includes content coverage, redundancy reduction, and relevance obtains the most balanced results in terms of average ROUGE and execution time. (C) 2019 Elsevier Ltd. All rights reserved.
机译:在不同的知识领域,越来越需要自动文本摘要方法。在科学文献中,通用的提取性多文档文本摘要可以表述为涉及几个标准的优化问题。仅同时考虑了两个标准,即内容覆盖率和冗余减少,而其他标准则分别考虑了相关性和一致性。因此,缺乏比较不同标准的性能的研究。由于这个原因,这里对适用于通用提取性多文档文本摘要的不同标准进行了比较研究。在多目标优化上下文中考虑了两个,三个和四个条件的所有可能组合。实验是基于文档理解会议(DUC)的数据集进行的,并且已将目标函数的组合与面向召回评估的迷信评估(ROUGE)指标进行了比较和评估。减少冗余已被证明是必不可少的标准,因为一致性是最不重要和最有效的标准。就平均ROUGE和执行时间而言,内容覆盖范围,冗余减少和相关性的组合获得了最平衡的结果。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号