首页> 外文期刊>Computational linguistics >Hierarchical Phrase-Based Translation with Weighted Finite-State Transducers and Shallow-n Grammars
【24h】

Hierarchical Phrase-Based Translation with Weighted Finite-State Transducers and Shallow-n Grammars

机译:具有加权有限状态换能器和浅n语法的基于短语的分层翻译

获取原文
       

摘要

In this article we describe HiFST, a lattice-based decoder for hierarchical phrase-based translation and alignment. The decoder is implemented with standard Weighted Finite-State Transducer (WFST) operations as an alternative to the well-known cube pruning procedure. We find that the use of WFSTs rather than k-best lists requires less pruning in translation search, resulting in fewer search errors, better parameter optimization, and improved translation performance. The direct generation of translation lattices in the target language can improve subsequent rescoring procedures, yielding further gains when applying long-span language models and Minimum Bayes Risk decoding. We also provide insights as to how to control the size of the search space defined by hierarchical rules. We show that shallow-n grammars, low-level rule catenation, and other search constraints can help to match the power of the translation system to specific language pairs.
机译:在本文中,我们描述了HiFST,这是一种基于格的解码器,用于基于层次短语的翻译和对齐。解码器通过标准加权有限状态换能器(WFST)操作实现,可以替代众所周知的立方体修剪过程。我们发现使用WFST而不是k-best列表在翻译搜索中需要较少的修剪,从而导致更少的搜索错误,更好的参数优化和改进的翻译性能。以目标语言直接生成翻译晶格可以改善后续的评分过程,在应用大跨度语言模型和最小贝叶斯风险解码时会进一步受益。我们还提供有关如何控制分层规则定义的搜索空间大小的见解。我们表明,浅n语法,低级规则分类和其他搜索约束条件可以帮助将翻译系统的功能与特定语言对进行匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号