首页> 外文会议>International Conference on Natural Language Processing and Chinese Computing >Optimizing Topic Distributions of Descriptions for Image Description Translation
【24h】

Optimizing Topic Distributions of Descriptions for Image Description Translation

机译:图像描述翻译描述描述的优化主题分布

获取原文

摘要

Image Description Translation (IDT) is a task to automatically translate the image captions (i.e., image descriptions) into the target language. Current statistical machine translation (SMT) cannot perform as well as usual in this task because there is lack of topic information provided for translation model generation. In this paper, we focus on acquiring the possible contexts of the captions so as to generate topic models with rich and reliable information. The image matching technique is utilized in acquiring the relevant Wikipedia texts to the captions, including the captions of similar Wikipedia images, the full articles that involve the images and the paragraphs that semantically correspond to the images. On the basis, we go further to approach topic modelling using the obtained contexts. Our experimental results show that the obtained topic information enhances the SMT of image caption, yielding a performance gain of no less than 1% BLUE score.
机译:图像描述转换(IDT)是一种自动将图像标题(即,图像描述)转换为目标语言的任务。当前的统计机器翻译(SMT)不能在此任务中执行和常用,因为缺乏为翻译模型生成提供的主题信息。在本文中,我们专注于获取标题的可能背景,以便生成具有丰富和可靠的信息的主题模型。图像匹配技术用于获取相关的维基百科文本到标题,包括类似维基百科图像的标题,涉及图像的完整文章和语义对应于图像的段落。在此基础上,我们进一步使用所获得的上下文进一步接近主题建模。我们的实验结果表明,获得的主题信息增强了图像标题的SMT,产生了不低于1%的蓝色分数的性能增益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号