Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions

机译：WMT 2019共享资源不足资源并行语料库筛选任务的发现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Following the WMT 2018 Shared Task on Parallel Corpus Filtering (Koehn et al., 2018), we posed the challenge of assigning sentence-level quality scores for very noisy corpora of sentence pairs crawled from the web, with the goal of sub-selecting 2% and 10% of the highest-quality data to be used to train machine translation systems. This year, the task tackled the low resource condition of Nepali-English and Sinhala-English. Eleven participants from companies, national research labs, and universities participated in this task.

机译：在WMT 2018并行语料库过滤共享任务（Koehn等人，2018）之后，我们提出了为从网络上爬网的非常嘈杂的句子对语料库分配句子级质量得分的挑战，目标是选择2用于训练机器翻译系统的最高质量数据的百分比和10％。今年，该任务解决了尼泊尔英语和僧伽罗英语资源不足的问题。来自公司，国家研究实验室和大学的11名参与者参加了此任务。

著录项

来源
《Conference on machine translation;Annual meeting of the Association for Computational Linguistics》|2019年|54-72|共19页
会议地点
作者
Philipp Koehn; Francisco Guzman; Vishrav Chaudhary; Juan Pino;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Filtered Pseudo-parallel Corpus Improves Low-resource Neural Machine Translation [J] . Imankulova Aizhan, Sato Takayuki, Komachi Mamoru ACM transactions on Asian and low-resource language information processing . 2020,第2期

机译：过滤伪并行语料库可提高低资源神经电机翻译
2. Pseudotext Injection and Advance Filtering of Low-Resource Corpus for Neural Machine Translation [J] . Michael Adjeisah, Guohua Liu, Douglas Omwenga Nyabuga, Computational intelligence and neuroscience . 2021,第a期

机译：神经电机翻译低资源语料的假义注射和预先滤波
3. SUPERGLUE: A SHARED MEMORY FRAMEWORK USING DATA VERSIONING FOR DEPENDENCY-AWARE TASK-BASED PARALLELIZATION [J] . Tillenius Martin SIAM Journal on Scientific Computing . 2015,第6期

机译：SUPERGLUE：使用数据版本化的共享内存框架，用于基于依赖感知的任务的并行化
4. Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions [C] . Philipp Koehn, Francisco Guzman, Vishrav Chaudhary, Conference on machine translation . 2019

机译：关于WMT 2019共享任务的并行语料库滤波的调查结果为低资源条件
5. Enhancing team performance in adverse conditions: The role of shared team mental models and team training on an interdependent task. [D] . Minionis, David Philip. 1995

机译：在不利条件下提高团队绩效：共享团队思维模型和团队培训对相互依赖的任务的作用。
6. Pseudotext Injection and Advance Filtering of Low-Resource Corpus for Neural Machine Translation [O] . Michael Adjeisah, Guohua Liu, Douglas Omwenga Nyabuga, 2021

机译：神经电机翻译低资源语料的假义注射和预先滤波
7. Webinterpret Submission to the WMT2019 Shared Task on Parallel Corpus Filtering [O] . Jesús González-Rubio 2019

机译：Web interpret提交给WMT2019并行语料库过滤的共享任务

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions

摘要

著录项

相似文献

相关主题

期刊订阅