首页> 外文会议>Workshop on Asian translation >Integrating empty category detection into Preordering Machine Translation
【24h】

Integrating empty category detection into Preordering Machine Translation

机译:将空类别检测集成到预先定位的机器翻译中

获取原文

摘要

We propose a method for integrating Japanese empty category detection into the preordering process of Japanese-to-English statistical machine translation. First, we apply machine-leaming-based empty category detection to estimate the position and the type of empty categories in the constituent tree of the source sentence. Then, we apply discriminative preordering to the augmented constituent tree in which empty categories are treated as if they are normal lexical symbols. We find that it is effective to filter empty categories based on the confidence of estimation. Our experiments show that, for the IWSLT dataset consisting of short travel conversations, the insertion of empty categories alone improves the BLEU score from 33.2 to 34.3 and the RIBES score from 76.3 to 78.7, which imply that reordering has improved For the KFTT dataset consisting of Wikipedia sentences, the proposed preordering method considering empty categories improves the BLEU score from 19.9 to 20.2 and the RIBES score from 66.2 to 66.3, which shows both translation and reordering have improved slightly.
机译:我们提出了一种将日本空类别检测集成到日本对英语统计机器翻译的预期过程中的方法。首先,我们应用基于机器LEAM的空类别检测来估计源句的组成树中的空型的位置和类型。然后,我们申请判别预审到增强的组成树,其中空的类别被视为正常的词汇符号。我们发现,根据估计的置信度过滤空型是有效的。我们的实验表明,对于由短途旅行谈话组成的IWSLT数据集,单独插入空型从33.2到34.3的BLEU分数提高了76.3到78.7的RIBES得分,这意味着重新排序已经改进了由此组成的KFTT数据集维基百科句子,考虑空的类别的拟议预审方法将从19.9到20.2的BLEU分数提高,9.2至66.3的RIBES得分,这表明平移和重新排序略微改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号