Optimizing Data-Driven Models for Summarization as Parallel Tasks

Zamuda Ales; Lloret Elena

首页> 外文期刊>Journal of computational science >Optimizing Data-Driven Models for Summarization as Parallel Tasks

【24h】

Optimizing Data-Driven Models for Summarization as Parallel Tasks

机译：优化数据驱动模型，以便汇总为并行任务

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents tackling of a hard optimization problem of computational linguistics, specifically automatic multi-document text summarization, using grid computing. The main challenge of multi-document summarization is to extract the most relevant and unique information effectively and efficiently from a set of topic-related documents, constrained to a specified length. In the Big Data/Text era, where the information increases exponentially, optimization becomes essential in selection of the most representative sentences for generating the best summaries. Therefore, a data-driven summarization model is proposed and optimized during a run of Differential Evolution (DE).Different DE runs are distributed to a grid in parallel as optimization tasks, seeking high processing throughput despite the demanding complexity of the linguistic model, especially on longer multidocuments where DE improves results given more iterations. Namely, parallelization and the grid enable, running several independent DE runs at same time within fixed real-time budget. Such approach results in improving a Document Understanding Conference (DUC) benchmark recall metric over a previous setting. (C) 2020 Elsevier B.V. All rights reserved.

机译：本文介绍了使用网格计算的计算语言学，特别是自动多文件文本摘要的硬优化问题。多文件摘要的主要挑战是从一组主题相关文档中有效，有效地提取最相关和唯一的信息，约束为指定的长度。在大数据/文本时代，信息呈指数增加，优化在选择最佳摘要中的选择方面是必不可少的。因此，在差分演进的运行期间提出和优化了数据驱动的摘要模型（DE）。多样性DE运行并联分发到网格，尽管语言模型的要求苛刻的复杂性，但仍然可以寻求高处理吞吐量。在更长的多程度上，在那里改善结果给出了更多的迭代。即，并行化和网格使能，在固定的实时预算中同时运行多个独立的DE。这种方法导致改进了一个文献了解会议（DUC）基准测试在前一个设置上的度量标准。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Journal of computational science》 |2020年第4期|101101.1-101101.16|共16页
作者
Zamuda Ales; Lloret Elena;
展开▼
作者单位

Univ Maribor Fac Elect Engn & Comp Sci Koroska Cesta 46 Maribor 2000 Slovenia;

Univ Alicante Dept Software & Comp Syst Apdo Correos 99 E-03080 Alicante Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Text Summarization; Discrete Optimization; Distributed Computing; Data-Driven Model; Differential Evolution;

机译：文本摘要;离散优化;分布式计算;数据驱动模型;差分演变;
入库时间 2022-08-18 21:31:47

相似文献

外文文献
中文文献
专利

1. Parallelizing a multi-objective optimization approach for extractive multi-document text summarization [J] . Sanchez-Gomez Jesus M., Vega-Rodriguez Miguel A., Perez Carlos J. Journal of Parallel and Distributed Computing . 2019,第Deca期

机译：用于提取多文档文本摘要的并行多目标优化方法
2. Parallelization of the solve phase in a task-based Cholesky solver using a sequential task flow model [J] . Florent Lopez International Journal of High Performance Computing Applications . 2020,第3期

机译：使用顺序任务流程模型的基于任务的Cholesky求解器中的解决阶段的并行化
3. On-Edge Multi-Task Transfer Learning: Model and Practice With Data-Driven Task Allocation [J] . Qiong Chen, Zimu Zheng, Chuang Hu, Parallel and Distributed Systems, IEEE Transactions on . 2020,第6期

机译：在边缘多任务传输学习：使用数据驱动任务分配模型和实践
4. Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study [C] . Erion Cano, Ondrej Bojar International natural language generation conference . 2019

机译：数据驱动模型的效率指标：文本摘要案例研究
5. American Sign Language recognition: Reducing the complexity of the task with phoneme-based modeling and parallel hidden Markov models. [D] . Vogler, Christian Philipp. 2003

机译：美国手语识别：通过基于音素的建模和并行隐马尔可夫模型，降低了任务的复杂性。
6. Parallel Workflows for Data-Driven Structural Equation Modeling in Functional Neuroimaging [O] . Sarah Kenny, Michael Andric, Steven M. Boker, 2009

机译：功能性神经成像中数据驱动的结构方程建模的并行工作流
7. Executing Optimized Irregular Applications Using Task Graphs Within Existing Parallel Models [O] . Christopher D. Krieger, Michelle Mills Strout, Jonathan Roelofs, 2013

机译：在现有并行模型中使用任务图执行优化的不规则应用程序

Optimizing Data-Driven Models for Summarization as Parallel Tasks

摘要

著录项

相似文献

相关主题

期刊订阅