Shape of Synth to Come: Why We Should Use Synthetic Data for English Surface Realization

机译：Synth的未来形态：为什么我们应该使用合成数据来实现英语表面

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The Surface Realization Shared Tasks of 2018 and 2019 were Natural Language Generation shared tasks with the goal of exploring approaches to surface realization from Universal-Dependency-like trees to surface strings for several languages. In the 2018 shared task there was very little difference in the absolute performance of systems trained with and without additional, synthetically created data, and a new rule prohibiting the use of synthetic data was introduced for the 2019 shared task. Contrary to the findings of the 2018 shared task, we show, in experiments on the English 2018 dataset, that the use of synthetic data can have a substantial positive effect - an improvement of almost 8 BLEU points for a previously state-of-the-art system. We analyse the effects of synthetic data, and we argue that its use should be encouraged rather than prohibited so that future research efforts continue to explore systems that can take advantage of such data.

机译：2018年和2019年的表面实现共享任务是自然语言生成共享任务，目标是探索从通用依赖（如树）到几种语言的表面字符串的表面实现方法。在2018年的共享任务中，使用和不使用额外的合成数据训练的系统的绝对性能几乎没有差异，2019年的共享任务引入了一项新规则，禁止使用合成数据。与2018年共享任务的研究结果相反，我们在2018年英语数据集的实验中表明，使用合成数据可以产生显著的积极影响——对于以前最先进的系统来说，几乎提高了8个BLEU点。我们分析了合成数据的影响，认为应该鼓励而不是禁止使用合成数据，以便未来的研究工作继续探索能够利用此类数据的系统。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics》|2020年|7465-7471|共7页
会议地点
作者
Henry Elder; Robert Burke; Alexander OConnor; Jennifer Foster;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Coarticulatory vowel nasalization in American English: Data of individual differences in acoustic realization of vowel nasalization as a function of prosodic prominence and boundary [J] . Daejin Kim, Sahyang Kim Data in Brief . 2019,第2期

机译：美国英语的独木舟性元音鼻化：作为韵律突出和边界的函数的元音鼻化声学实现的个体差异数据
2. The Analysis and Realization of Motion Compensation for Circular Synthetic Aperture Radar Data [J] . Gaowei Jia, Wenge Chang, Qilei Zhang, Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2016,第7期

机译：圆形合成孔径雷达数据运动补偿的分析与实现
3. Meteorological and oceanographic surface roughness phenomena in the English Channel investigated using ERS synthetic aperture radar and an empirical model of backscatter [J] . Scoon A., Robinson I. Journal of Geophysical Research. Biogeosciences . 2000,第C3期

机译：使用ERS合成孔径雷达和反向散射经验模型研究英吉利海峡的气象和海洋表面粗糙度现象
4. Synthetic Data for English Lexical Normalization: How Close Can We Get to Manually Annotated Data? [C] . Kelly Dekker, Rob van der Goot International Conference on Language Resources and Evaluation . 2020

机译：英语词汇标准化的合成数据：我们可以接近手动注释数据吗？
5. Synthetic approaches to nanoscale shape memory alloys (SMAs) and adhesion properties of composites derived from surface-modified SMAs. [D] . Smith, Nickolaus A. 2005

机译：纳米级形状记忆合金（SMAs）的合成方法以及表面改性SMAs的复合材料的粘合性能。
6. Coarticulatory vowel nasalization in American English: Data of individual differences in acoustic realization of vowel nasalization as a function of prosodic prominence and boundary [O] . Daejin Kim, Sahyang Kim 2019

机译：美国英语中的发音元音鼻音化：元音鼻音化在声学实现中的个体差异作为韵律突触和边界的函数的数据
7. Shape of Synth to Come: Why We Should Use Synthetic Data for English Surface Realization [O] . Henry Elder, Robert Burke, Alexander O’Connor, 2020

机译：合成的形状：为什么我们应该使用合成数据进行英语表面实现
8. Evaluation of Ocean Models Using Observed and Simulated Drifter Trajectories: Impact of Sea Surface Height on Synthetic Profiles for Data Assimilation [R] . Barron, C. N. , Smedstad, L. F. , Dastugue, J. M. , 2007

机译：使用观测和模拟流浪者轨迹评估海洋模型：海面高度对数据同化的合成剖面的影响

Shape of Synth to Come: Why We Should Use Synthetic Data for English Surface Realization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅