首页> 外国专利> Automatic clustering of tokens from a corpus for grammar acquisition

Automatic clustering of tokens from a corpus for grammar acquisition

机译：来自语料库的令牌自动聚类以获取语法

页面导航

摘要
著录项
相似文献

摘要

A method of grammar learning from a corpus comprises, for the other non-context words, generating frequency vectors for each non-context token in a corpus based upon counted occurrences of a predetermined relationship of the non-context tokens to identified context tokens. Clusters are grown from the frequency vectors according to a lexical correlation among the non-context tokens.

机译：从语料库学习语法的方法包括：对于其他非上下文词，基于非上下文标记与所识别的上下文标记的预定关系的已出现次数，为语料库中的每个非上下文标记生成频率向量。根据非上下文标记之间的词汇相关性，从频率向量中增长聚类。

著录项

公开/公告号US7356462B2

专利类型
公开/公告日2008-04-08

原文格式PDF
申请/专利权人 SRINIVAS BANGALORE;GIUSEPPE RICCARDI;
展开▼

申请/专利号US20030662730
发明设计人 SRINIVAS BANGALORE;GIUSEPPE RICCARDI;
展开▼

申请日2003-09-15
分类号G06F17/27;
国家 US
入库时间 2022-08-21 20:09:06

相似文献

专利
外文文献
中文文献