首页> 外国专利> induction of grammatical rules

induction of grammatical rules

机译:语法规则的归纳

摘要

A method of grammar rule induction comprises obtaining a monolingual set of phrases from a bilingual corpus of translation pairs. For each of the monolingual phrases in turn, initialising, with inactive edges formed from headwords identified in the phrase, the agenda of a dependency grammar chart parser arranged to form packed edges in the chart. Running the chart parser and adding to the agenda, for each inactive edge removed from the agenda, one or more active edges created as if all possible grammar rules existed. When the agenda is empty, ascertaining the alternations of each edge in the packed edge corresponding to the complete phrase, and finding their respective highest frequencies. For the set of phrases, summing, for each alternation, its respective highest frequencies, and ranking the sums. Then, selecting alternations in rank order to form the required set of grammar rules until the required set has become sufficient such that for each monolingual phrase there exists at least one analysis corresponding to the required set of grammar rules.
机译:语法规则归纳的方法包括从翻译对的双语语料库获得单语短语组。依次对每个单语短语进行初始化,并使用从短语中标识的headwords形成的非活动边缘,将依存语法图表解析器的议程安排为在图表中形成打包的边缘。运行图表解析器并将其添加到议程中,对于从议程中删除的每个非活动边,将创建一个或多个活动边,就好像存在所有可能的语法规则一样。当议程为空时,确定对应于完整短语的打包边中每个边的交替,并找到它们各自的最高频率。对于这组短语,对于每个交替,求和其各自的最高频率,并对总和进行排名。然后,按等级顺序选择交替以形成所需的语法规则集合,直到所需的集合已经足够,从而对于每个单语短语,存在至少一个与所需的语法规则集合相对应的分析。

著录项

  • 公开/公告号DE602005015561D1

    专利类型

  • 公开/公告日2009-09-03

    原文格式PDF

  • 申请/专利权人 BRITISH TELECOMMUNICATIONS P.L.C.;

    申请/专利号DE20056015561T

  • 发明设计人 APPLEBY STEPHEN CLIFFORD;

    申请日2005-03-17

  • 分类号G06F17/28;G06F17/27;

  • 国家 DE

  • 入库时间 2022-08-21 19:08:07

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号