首页> 外文会议>Second conference on machine translation >Automatic Threshold Detection for Data Selection in Machine Translation

【24h】

Automatic Threshold Detection for Data Selection in Machine Translation

机译：机器翻译中数据选择的自动阈值检测

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present in this paper the participation of the University of Hamburg in the Biomedical Translation Task of the Second Conference on Machine Translation (WMT 2017). Our contribution lies in adopting a new direction for performing data selection for Machine Translation via Paragraph Vector and a Feed Forward Neural Network Classifier. Continuous distributed vector representations of the sentences are used as features for the binary classifier. Most approaches in data selection rely on scoring and ranking general domain sentences with respect to their similarity to the in-domam and setting a range of thresholds for selecting a percentage of them for training various MT systems. The novelty of our method consists in developing an automatic threshold detection paradigm for data selection which provides an efficient and simple way for selecting the most similar sentences to the m-domain. Encouraging results are obtained using this approach for seven language pairs and four data sets.

机译：我们在本文中介绍汉堡大学参加第二届机器翻译会议（WMT 2017）的生物医学翻译任务。我们的贡献在于采用新的方向来执行通过段落矢量和前馈神经网络分类器进行机器翻译的数据选择。句子的连续分布矢量表示用作二进制分类器的功能。数据选择中的大多数方法都依赖于对通用域句子与内部相似度的评分和排名，并设置阈值范围以选择百分比以训练各种MT系统。我们方法的新颖性在于开发一种用于数据选择的自动阈值检测范例，该范例提供了一种有效且简单的方法来选择与m域最相似的句子。使用这种方法获得的令人鼓舞的结果是七个语言对和四个数据集。

著录项

来源
《Second conference on machine translation 》|2017年|483-488|共6页
会议地点 Copenhagen(DK)
作者
Mirela-Stefania Duma; Wolfgang Menzel;
展开▼
作者单位

University of Hamburg Natural Language Systems Division;

University of Hamburg Natural Language Systems Division;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Anomaly detection in earth dam and levee passive seismic data using support vector machines and automatic feature selection [J] . Fisher Wendy D., Camp Tracy K., Krzhizhanovskaya Valeria V. Journal of computational science . 2017 ,第May期

机译：使用支持向量机和自动特征选择对大坝和堤坝被动地震数据进行异常检测
2. Patent Issued for Model Weighting, Selection and Hypotheses Combination for Automatic Speech Recognition and Machine Translation [J] . Robotics and Machine Learning . 2012 ,第41期

机译：为自动语音识别和机器翻译提供模型加权，选择和假设组合的专利
3. Data analysis with Shapley values for automatic subject selection in Alzheimer’s disease data sets using interpretable machine learning [J] . Bloch Louise, Friedrich Christoph M. Alzheimer s Research & Therapy . 2021 ,第1期

机译：使用可解释机学习，在阿尔茨海默病的疾病数据集中自动主题选择的数据分析
4. Automatic Threshold Detection for Data Selection in Machine Translation [C] . Mirela-Stefania Duma, Wolfgang Menzel Conference on machine translation . 2017

机译：机器翻译中数据选择的自动阈值检测
5. Data analysis and selection for statistical machine translation. [D] . Eetemadi, Sauleh. 2016

机译：用于统计机器翻译的数据分析和选择。
6. Semi-automatic carotid intraplaque hemorrhage detection and quantification on Magnetization-Prepared Rapid Acquisition Gradient-Echo (MP-RAGE) with optimized threshold selection [O] . Jin Liu, Niranjan Balu, Daniel S. Hippe, 2016

机译：半自动颈动脉斑块内出血的检测和定量优化磁化的快速采集梯度回波（MP-RAGE）的阈值选择
7. Automatic Threshold Detection for Data Selection in Machine Translation [O] . Mirela-Stefania Duma, Wolfgang Menzel 2017

机译：机器翻译中数据选择的自动阈值检测

Automatic Threshold Detection for Data Selection in Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅