首页> 外文期刊>Machine translation >Exploiting syntactic relationships in a phrase-based decoder: an exploration
【24h】

Exploiting syntactic relationships in a phrase-based decoder: an exploration

机译:在基于短语的解码器中利用句法关系:探索

获取原文
获取原文并翻译 | 示例
       

摘要

Phrase-based decoding is conceptually simple and straightforward to implement, at the cost of drastically oversimplified reordering models. Syntactically aware models make it possible to capture linguistically relevant relationships in order to improve word order, but they can be more complex to implement and optimise. In this paper, we explore a new middle ground between phrase-based and syntactically informed statistical MT, in the form of a model that supplements conventional, non-hierarchical phrase-based techniques with linguistically informed reordering based on syntactic dependency trees. The key idea is to exploit linguistically-informed hierchi-cal structures only for those dependencies that cannot be captured within a single flat phrase. For very local dependencies we leverage the success of conventional phrase-based approaches, which provide a sequence of target-language words appropriately ordered and ready-made with any agreement morphology. Working with dependency trees rather than constituency trees allows us to take advantage of the flexibility of phrase-based systems to treat non-constituent fragments as phrases. We do impose a requirement-that the fragment be a novel sort of "dependency constituent"-on what can be translated as a phrase, but this is much weaker than the requirement that phrases be traditional linguistic constituents, which has often proven too restrictive in MT systems.
机译:基于短语的解码从概念上讲简单易行,但代价是大大简化了重新排序模型。具有句法意识的模型可以捕获与语言相关的关系,以改善单词顺序,但是实现和优化它们可能更加复杂。在本文中,我们以模型的形式探索了基于短语的和句法已知的统计MT之间的新中间地带,该模型以基于语法依赖树的语言学的重新排序补充了传统的,非分层的基于短语的技术。关键思想是仅针对无法在单个平面词组中捕获的依存关系开发语言通知的层级结构。对于非常局部的依存关系,我们利用了传统的基于短语的方法的成功,该方法提供了一系列目标语言单词,这些目标语言单词经过适当排序并可以与任何协议形态一起使用。使用依赖树而不是构成树使我们能够利用基于短语的系统的灵活性,将非构成片段视为短语。我们确实对可翻译为短语的条件强加了一个要求,即该片段是一种新型的“依赖成分”,但这远不如要求短语是传统语言成分的要求弱得多,后者通常被证明过于严格。 MT系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号