首页> 外文会议>Insternational Joint Conference on Natural Language Processing >Automatic Learning of Parallel Dependency Treelet Pairs
【24h】

Automatic Learning of Parallel Dependency Treelet Pairs

机译:自动学习并行依赖性三圈对

获取原文

摘要

Induction of synchronous grammars from empirical data has long been a unsolved problem; despite generative synchronous grammar theoretically suits the machine translation task very well. This fact is mainly due to pervasive structural divergences between languages. This paper presents a statistical approach that learns dependency structure mappings from parallel corpora. The algorithm introduced in this paper extends the dependency tree word alignment algorithm in (Ding et al., 2003). The new algorithm automatically learns parallel dependency treelet pairs from loosely matched non-isomorphic dependency trees while keeping computational complexity polynomial in the length of the sentences. A set of heuristics is introduced and specifically optimized for parallel treelet learning purposes using Minimum Error Rate training. As learning parallel syntactic structures is the key step in the automatic learning of a synchronous .grammar, the learnt parallel dependency treelet pairs in our approach serve as an important first step of any lexicalized synchronous grammar induction.
机译:诱导经验数据的同步语法长期以来一直是未解决的问题;尽管生成的同步语法理论上非常适合机器翻译任务。这一事实主要是由于语言之间的普及结构分歧。本文提出了一种统计方法,从并行基层中学习依赖结构映射。本文介绍的算法扩展了依赖树字对齐算法(Ding等,2003)。新算法自动从松散匹配的非同义阶依赖树上自动学习并行依赖性三圈对,同时保持句子的长度的计算复杂性多项式。使用最小错误率训练介绍了一组启发式,并专门针对并行三座学习目的进行了优化。作为学习并行句法结构是自动学习同步.crammar的关键步骤,我们的方法中学习的并行依赖性三齿对成对作为任何词汇化同步语法诱导的重要第一步。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号