Domain Adaptation for SMT Using Sentence Weight

机译：使用句子权重的SMT领域自适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe a sentence-level domain adaptation translation system, which trained with the sentence-weight model. Our system can take advantage of the domain information in each sentence rather than in the corpus. It is a fine-grained method for domain adaptation. By adding weights which reflect the preference of target domain to the sentences in the training set, we can improve the domain adaptation ability of a translation system. We set up the sentence-weight model depending on the similarity between sentences in the training set and the target domain text. In our method, the similarity is measured by the word frequency distribution. Our experiments on a large-scale Chinese-to-English translation task in news domain validate the effectiveness of our sentence-weight-based adaptation approach, with gains of up to 0.75 BLEU over a non-adapted baseline system.

机译：我们描述了一个句子级的领域适应翻译系统，该系统使用句子权重模型进行训练。我们的系统可以利用每个句子而不是语料库中的域信息。这是一种用于域自适应的细粒度方法。通过将反映目标域偏好的权重添加到训练集中的句子中，我们可以提高翻译系统的域适应能力。我们根据训练集中的句子与目标域文本之间的相似性来建立句子权重模型。在我们的方法中，相似度是通过单词频率分布来衡量的。我们在新闻领域进行的大规模汉英翻译任务的实验验证了我们基于句子权重的适应方法的有效性，与不适应的基准系统相比，该方法的收益高达0.75 BLEU。

著录项

来源
《China national conference on computational linguistics;International symposium on natural language processing based on naturally annotated big data 》|2015年|153-163|共11页
会议地点
作者
Xinpeng Zhou; Hailong Cao; Tiejun Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Domain adaptation; Sentence weight; Statistical machine translation;

机译：领域适应;句子重;统计机器翻译;

相似文献

外文文献
中文文献
专利

1. A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation [J] . LongyueWang, Derek F.Wong, Lidia S.Chao, ScientificWorldJournal . 2014 ,第3期

机译：SMT域适应数据选择标准的系统比较
2. Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems [J] . Ryu Seonghan, Kim Seokhwan, Choi Junhwi, Pattern recognition letters . 2017 ,第Mara1期

机译：在对话系统中仅使用域内语句进行神经语句嵌入以进行域外语句检测
3. Individual and Domain Adaptation in Sentence Planning for Dialogue [J] . Mairesse F., Prasad R., Stent A., The Journal of Artificial Intelligence Research . 2007 ,第5期

机译：对话句子规划中的个人和领域适应
4. Domain Adaptation for SMT Using Sentence Weight [C] . Xinpeng Zhou, Hailong Cao, Tiejun Zhao China National Conference on Computational Linguistics . 2015

机译：使用句子重量的SMT域改编
5. SMT-Based and Disjunctive Relational Abstract Domains for Static Analysis [D] . Chen, Junjie 2015

机译：基于SMT的析取关系抽象域用于静态分析
6. A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation [O] . Longyue Wang, Derek F. Wong, Lidia S. Chao, -1

机译：SMT域适配的数据选择标准的系统比较
7. Structured and Unstructured Cache Models for SMT Domain Adaptation [O] . Annie Louis, Bonnie Webber 2015

机译：用于smT域自适应的结构化和非结构化缓存模型

Domain Adaptation for SMT Using Sentence Weight

摘要

著录项

相似文献

相关主题

期刊订阅