Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification

机译：复杂度加权损失和不同等级的句子简化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sentence simplification is the task of rewriting texts so they arc easier to understand. Recent research has applied sequence-to-sequence (Seq2Seq) models to this task, focusing largely on training-time improvements via reinforcement learning and memory augmentation. One of the main problems with applying generic Seq2Seq models for simplification is that these models tend to copy directly from the original sentence, resulting in outputs that are relatively long and complex. We aim to alleviate this issue through the use of two main techniques. First, we incorporate content word complexities, as predicted with a leveled word complexity model, into our loss function during training. Second, we generate a large set of diverse candidate simplifications at test time. and rerank these to promote fluency, adequacy, and simplicity. Here, we measure simplicity through a novel sentence complexity model. These extensions allow our models to perform competitively with state-of-the-art systems while generating simpler sentences. We report standard automatic and human evaluation metrics.

机译：句子简化是重写文本的任务，使它们更容易理解。最近的研究已经将序列到序列（SEQ2Seq）模型应用于此任务，主要关注通过加强学习和内存增强的培训时间改进。应用通用SEQ2SEQ模型的主要问题之一是简化的，这些模型倾向于直接从原始句子复制，导致输出相对较长且复杂。我们的目标是通过使用两种主要技术来缓解这个问题。首先，我们将内容文字复杂性纳入培训期间使用级别的文字复杂性模型预测到我们的损耗功能。其次，我们在测试时间生成大量不同的候选简化。并重新评估这些以促进流利，充足和简单性。在这里，我们通过新颖的句子复杂性模型来衡量简单性。这些扩展允许我们的模型在生成更简单的句子的同时竞争地区执行最先进的系统。我们报告标准的自动和人类评估度量。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|3137-3147|共11页
会议地点
作者
Reno Kriz; Joao Sedoc; Marianna Apidianaki; Carolina Zheng; Gaurav Kumar; Eleni Miltsakaki; Chris Callison-Burch;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. User session level diverse reranking of search results [J] . Ren Pengjie, Chen Zhumin, Ma Jun, Neurocomputing . 2018,第jana24期

机译：用户会话级别的搜索结果多样化排名
2. Landmark Reranking for Smart Travel Guide Systems by Combining and Analyzing Diverse Media [J] . Junge Shen, Jialie Shen, Tao Mei, IEEE Transactions on Systems, Man, and Cybernetics . 2016,第11期

机译：结合和分析多种媒体对智能旅行指南系统进行地标重排
3. Dependency-Based Sentence Simplification for Large-Scale LFG Parsing: Selecting Simplified Candidates for Efficiency and Coverage [J] . ILNAR SALIMZIANOV, OEZLEM CETINOGLU International journal of computational linguistics and applications . 2016,第2期

机译：大规模LFG解析的基于依赖性的句子简化：为效率和覆盖范围选择简化的候选对象
4. Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification [C] . Reno Kriz, Joao Sedoc, Marianna Apidianaki, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：句子简化复杂性加权损失和多样性重新划分
5. Building-specific loss estimation methods & tools for simplified performance-based earthquake engineering. [D] . Ramirez, Carlos Marcelo. 2009

机译：针对特定建筑物的损失估算方法和工具，用于简化基于性能的地震工程。
6. iSimp in BioC standard format: enhancing the interoperability of a sentence simplification system [O] . Yifan Peng, Catalina O. Tudor, Manabu Torii, 2014

机译：iSimp采用BioC标准格式：增强句子简化系统的互操作性
7. Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification [O] . Reno Kriz, João Sedoc, Marianna Apidianaki, 2019

机译：句子简化复杂性加权损失和多样性重新划分

Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification

摘要

著录项

相似文献

相关主题

期刊订阅