首页> 外文期刊>Journal of Parallel and Distributed Computing >Parallelizing a multi-objective optimization approach for extractive multi-document text summarization
【24h】

Parallelizing a multi-objective optimization approach for extractive multi-document text summarization

机译:用于提取多文档文本摘要的并行多目标优化方法

获取原文
获取原文并翻译 | 示例

摘要

Currently, automatic multi-document text summarization is an important task in many fields of knowledge, due to the continuous exponential growth of information on the Internet. Nevertheless, this task is computationally demanding. In the last years, automatic text summarization has been addressed by using multi-objective optimization approaches. In particular, recently, the Multi-Objective Artificial Bee Colony (MOABC) algorithm has obtained very good results. This work focuses on the parallelization of this approach. Several steps have been carried out for this goal. After a time profiling of the algorithm, a runtime comparison has been performed between the use of different random number generators within the algorithm. Then, a parallel implementation of the MOABC algorithm has been designed following its original scheme, in which the main steps are parallelized, and different parallel schedules have been studied and compared. Finally, a second design based on the asynchronous behavior of the bee colony in nature has been implemented and compared. Experiments have been carried out with datasets from Document Understanding Conference (DUC). The results show that the asynchronous design improves greatly the parallel design, being more than 55 times faster with 64 threads than the standard design. An efficiency of 86.72% has been reported for 64 threads. (C) 2019 Elsevier Inc. All rights reserved.
机译:当前,由于互联网上信息的持续指数增长,自动多文档文本摘要是许多知识领域的重要任务。然而,该任务在计算上要求很高。在过去的几年中,通过使用多目标优化方法解决了自动文本摘要问题。特别是最近,多目标人工蜂群算法(MOABC)取得了很好的效果。这项工作专注于这种方法的并行化。为此目的已执行了几个步骤。在对算法进行时间分析后,已在算法内使用不同随机数生成器之间执行了运行时比较。然后,按照其原始方案设计了MOABC算法的并行实现,其中主要步骤并行化,并且研究和比较了不同的并行调度。最后,已经实施并比较了基于自然界中蜂群异步行为的第二种设计。已经使用文档理解会议(DUC)的数据集进行了实验。结果表明,异步设计大大改善了并行设计,使用64个线程的速度比标准设计快55倍以上。已报告64个线程的效率为86.72%。 (C)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号