首页> 外文会议>9th Workshop on statistical machine translation >Dynamic Topic Adaptation for SMT using Distributional Profiles
【24h】

Dynamic Topic Adaptation for SMT using Distributional Profiles

机译:使用分发配置文件的SMT动态主题适应

获取原文
获取原文并翻译 | 示例

摘要

Despite its potential to improve lexical selection, most state-of-the-art machine translation systems take only minimal contextual information into account. We capture context with a topic model over distributional profiles built from the context words of each translation unit. Topic distributions are inferred for each translation unit and used to adapt the translation model dynamically to a given test context by measuring their similarity. We show that combining information from both local and global test contexts helps to improve lexical selection and outperforms a baseline system by up to 1.15 Bleu. We test our topic-adapted model on a diverse data set containing documents from tfiree different domains and achieve competitive performance in comparison with two supervised domain-adapted systems.
机译:尽管它具有改善词汇选择的潜力,但大多数最新的机器翻译系统仅考虑了最少的上下文信息。我们使用主题模型来捕获上下文,该主题模型是根据每个翻译单元的上下文词构建的分布概要文件。为每个翻译单元推断主题分布,并通过测量它们的相似性来将其动态地适应给定的测试上下文。我们显示,结合来自本地和全局测试上下文的信息有助于改善词汇选择,并且比基准系统的性能最高可达1.15 Bleu。我们在包含来自两个不同领域的文档的多样化数据集上测试了我们的主题自适应模型,并且与两个受监督的领域自适应系统相比,我们获得了竞争优势。

著录项

  • 来源
  • 会议地点 Baltimore MA(US)
  • 作者单位

    School of Informatics, University of Edinburgh;

    School of Informatics, University of Edinburgh;

    School of Informatics, University of Edinburgh,Center for Language and Speech Processing, Johns Hopkins University;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号