Measuring and Increasing Context Usage in Context-Aware Machine Translation

机译：在上下文知识机器翻译中测量和增加上下文使用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent work in neural machine translation has demonstrated both the necessity and feasibility of using inter-sentential context - context from sentences other than those currently being translated. However, while many current methods present model architectures that theoretically can use this extra context, it is often not clear how much they do actually utilize it at translation time. In this paper, we introduce a new metric, conditional cross-mutual information, to quantify the usage of context by these models. Using this metric, we measure how much document-level machine translation systems use particular varieties of context. We find that target context is referenced more than source context, and that conditioning on a longer context has a diminishing effect on results. We then introduce a new, simple training method, context-aware word dropout, to increase the usage of context by context-aware models. Experiments show that our method increases context usage and that this reflects on the translation quality according to metrics such as BLEU and COMET, as well as performance on anaphoric pronoun resolution and lexical cohesion contrastive datasets.

机译：神经电机翻译中最近的工作已经证明了使用除了目前被翻译的句子中的句子中的情节上下文的必要性和可行性。但是，虽然许多当前方法存在模型架构，从理论上可以使用这种额外的上下文，但通常不清楚他们在翻译时实际使用它。在本文中，我们介绍了新的公制，条件交叉相互信息，以通过这些模型量化上下文的使用情况。使用此度量标准，我们测量文件级机器翻译系统使用特定品种的上下文。我们发现目标上下文是引用的更多源上下文，并且更长的上下文的调节对结果效果递减。然后，我们介绍一个新的简单训练方法，上下文知识的单词丢失，以通过上下文感知模型增加上下文的使用。实验表明，我们的方法会增加上下文使用，这反映了根据BLEU和COMET等度量的转换质量，以及在化学性代词分辨率和词汇凝聚对比数据集中的性能。

著录项

来源
《International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics》|2021年|6467-6478|共12页
会议地点
作者
Patrick Fernandes; Kayo Yin; Graham Neubig; Andre F. T. Martins;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Effectiveness analysis of machine learning classification models for predicting personalized context-aware smartphone usage [J] . Iqbal H. Sarker, A. S. M. Kayes, Paul Watters Journal of Big Data . 2019,第1期

机译：机器学习分类模型用于预测个性化上下文感知智能手机使用的有效性分析
2. A Context-Aware Recurrent Encoder for Neural Machine Translation [J] . Biao Zhang, Deyi Xiong, Jinsong Su, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第12期

机译：用于神经机器翻译的上下文感知循环编码器
3. Decision support for the usage of mobile information services: A context-aware service selection approach that considers the effects of context interdependencies [J] . Bernd Heinrich, Lars Lewerenz Journal of decision systems . 2015,第4期

机译：使用移动信息服务的决策支持：一种考虑上下文相关性影响的上下文感知服务选择方法
4. When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion [C] . Elena Voita, Rico Sennrich, Ivan Titov Annual meeting of the Association for Computational Linguistics . 2019

机译：当上下文中的翻译质量错误时：上下文识别机器翻译在Deixis，省略号和词法衔接方面得到改善
5. Machine Learning Models for Context-aware Recommender Systems [D] . Jhamb, Yogesh. 2018

机译：背景信息的机器学习模型推荐系统
6. CAFD: Context-Aware Fault Diagnostic Scheme towards Sensor Faults Utilizing Machine Learning [O] . Umer Saeed, Young-Doo Lee, Sana Ullah Jan, 2021

机译：CAFD：利用机器学习的传感器故障的上下文感知故障诊断方案
7. Data augmentation using back-translation for context-aware neural machine translation [O] . Amane Sugiyama, Naoki Yoshinaga 2019

机译：数据增强使用后翻版内容感知神经机翻译

Measuring and Increasing Context Usage in Context-Aware Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅