Context Models for OOV Word Translation in Low-Resource Languages

机译：低资源语言中OOV字转换的上下文模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Out-of-vocabulary word translation is a major problem for the translation of low-resource languages that suffer from a lack of parallel training data. This paper evaluates the contributions of target-language context models towards the translation of OOV words, specifically in those cases where OOV translations are derived from external knowledge sources, such as dictionaries. We develop both neural and non-neural context models and evaluate them within both phrase-based and self-attention based neural machine translation systems. Our results show that neural language models that integrate additional context beyond the current sentence are the most effective in disambiguating possible OOV word translations. We present an efficient second-pass lattice-rescoring method for wide-context neural language models and demonstrate performance improvements over state-of-the-art self-attention based neural MT systems in five out of six low-resource language pairs.

机译：失败的单词翻译是缺乏平行训练数据的低资源语言翻译的主要问题。本文评估了目标语言上下文模型对OOV字的翻译的贡献，特别是在从外部知识源导出的那些oov翻译的情况下，例如词典。我们开发了神经和非神经背景模型，并在基于短语和基于自我关注的神经机翻译系统中进行评估。我们的结果表明，整合其他上下文超出当前句子的神经语言模型是歧义可能的OOV字翻译最有效的。我们为广泛的神经语言模型提出了一种有效的二手格式救援方法，并在六个低资源对中的五个中，展示了基于最先进的自我关注的神经MT系统的性能改进。

著录项

来源
《Conference of the Association for Machine Translation in the Americas》|2018年|v 216 p.|共14页
会议地点
作者
Angli Liu; Katrin Kirchhoff;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition [J] . Imran Sheikh, Dominique Fohr, Irina Illina, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第3期

机译：大词汇量连续语音识别中OOV词的语义上下文建模
2. Handling OOV Words in Mandarin Spoken Term Detection with an Hierarchical n-Gram Language Model [J] . WANG Xuyang1, ZHANG Pengyuan1, NA Xingyu1, 电子学报：英文版 . 2017,第006期

机译：用分层N-GRAM语言模型处理普通话语言术语检测的OOV字
3. Pause-Based Phrase Extraction and Effective OOV Handling for Low-Resource Machine Translation Systems [J] . Mrinalini K., Nagarajan T., Vijayalakshmi P. ACM transactions on Asian language information processing . 2019,第2期

机译：低资源机器翻译系统基于暂停的短语提取和有效的OOV处理
4. Context Models for OOV Word Translation in Low-Resource Languages [C] . Angli Liu, Katrin Kirchhoff Conference of the Association for Machine Translation in the Americas . 2018

机译：资源不足的OOV单词翻译的上下文模型
5. Parallel Sentence Detection in Comparable Corpora with Bilingual Word Embeddings for Low-Resource Languages [D] . Cadigan, John. 2018

机译：与低资源语言的双语单词嵌入式的同类语料中的并行句子检测
6. On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers [O] . Klára Jágrová, Michael Hedderich, Marius Mosbach, 2021

机译：关于语境感知语言模型与波兰目标词对捷克读者的信息的相关性
7. Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition [O] . Imran Sheikh, Dominique Fohr, Irina Illina, 2017

机译：大型词汇连续语音识别中OOV单词的语义背景建模
8. Linguistic-Core Approach to Structured Translation and Analysis of Low-Resource Languages. [R] . Carbonell, J., Levin, L., Smith, N., 2017

机译：结构化翻译的语言核心方法与低资源语言分析。

Context Models for OOV Word Translation in Low-Resource Languages

摘要

著录项

相似文献

相关主题

期刊订阅