首页> 外国专利> EFFICIENT PHRASE PAIR EXTRACTION FROM BILINGUAL WORD ALIGNMENTS

EFFICIENT PHRASE PAIR EXTRACTION FROM BILINGUAL WORD ALIGNMENTS

机译:从双语单词对齐中高效地短语配对

摘要

A method is provided for identifying phrase alignment pairs between a source sentence and a target sentence. Boundaries for a phrase in the source sentence are identified by requiring that a source word be aligned with at least one target word in a target sentence in order to form a boundary for the source phrase. Boundaries for a phrase in the target sentence are identified based on alignments between words in the source phrase and words in the target sentence. The words in the target phrase are examined to determine if any of the words are aligned with source words outside of the source phrase. If they are not aligned with source words outside of the source phrase, the source phrase and target phrase are determined to form an alignment pair and are stored as a phrase alignment pair.
机译:提供了一种用于识别源句子和目标句子之间的短语对齐对的方法。通过要求源单词与目标句子中的至少一个目标单词对齐来标识源句子中短语的边界,以便形成源短语的边界。基于源短语中的单词与目标句子中的单词之间的对齐方式来标识目标句子中的短语的边界。检查目标短语中的单词,以确定是否有任何单词与源短语之外的源单词对齐。如果它们未与源短语之外的源单词对齐,则确定源短语和目标短语形成对齐对并存储为短语对齐对。

著录项

  • 公开/公告号IL195093B

    专利类型

  • 公开/公告日2012-06-28

    原文格式PDF

  • 申请/专利权人 MICROSOFT CORPORATION;

    申请/专利号IL195093

  • 发明设计人

    申请日2008-11-04

  • 分类号

  • 国家 IL

  • 入库时间 2022-08-21 17:24:44

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号