首页> 外国专利> Automatic clustering of tokens from a corpus for grammar acquisition

Automatic clustering of tokens from a corpus for grammar acquisition

机译:来自语料库的令牌自动聚类以获取语法

摘要

A method of grammar learning from a corpus comprises, for the other non-context words, generating frequency vectors for each non-context token in a corpus based upon counted occurrences of a predetermined relationship of the non-context tokens to identified context tokens. Clusters are grown from the frequency vectors according to a lexical correlation among the non-context tokens.
机译:从语料库学习语法的方法包括:对于其他非上下文词,基于非上下文标记与所识别的上下文标记的预定关系的已出现次数,为语料库中的每个非上下文标记生成频率向量。根据非上下文标记之间的词汇相关性,从频率向量中增长聚类。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号