首页> 外国专利> System, methods, and data structure for quantitative assessment of symbolic associations in natural language

System, methods, and data structure for quantitative assessment of symbolic associations in natural language

机译:用于自然语言中符号关联的定量评估的系统,方法和数据结构

摘要

A method for quantitatively assessing associations between different terms in a natural language includes obtaining a first group of sentences, at least some of which comprise an object name, parsing each of the first group of sentences to identify a subject and a predicate, assigning a sentence type to each of the first group of sentences according to the location of the object name in the sentence, tokenizing each of the first group of sentences to produce a plurality of tokens that includes a jth token, and adding a weighting coefficient to a parameter token_j_count for each sentence that includes the jth token, wherein the weighting coefficient is dependent on the sentence type. A cumulative value of the parameter token_j_count obtained from the first group of sentences is divided by the total number of sentences in the first group to produce an internal association strength for the jth token.
机译:一种用于定量评估自然语言中不同术语之间的关联的方法,包括获取第一组句子,其中至少一部分包含对象名称,解析第一组句子中的每一个以识别主语和谓语,分配一个句子根据句子中对象名称的位置,对第一组句子中的每一个进行键入,对第一组句子中的每一个进行标记,以生成包括第j个标记的多个标记,并将加权系数添加到参数token_j_count对于包括第j个令牌的每个句子,其中加权系数取决于句子类型。从第一组句子中获得的参数token_j_count的累加值除以第一组中句子的总数,以产生第j个令牌的内部关联强度。

著录项

  • 公开/公告号US8380489B1

    专利类型

  • 公开/公告日2013-02-19

    原文格式PDF

  • 申请/专利权人 GUANGSHENG ZHANG;

    申请/专利号US20090631829

  • 发明设计人 GUANGSHENG ZHANG;

    申请日2009-12-05

  • 分类号G06F17/27;

  • 国家 US

  • 入库时间 2022-08-21 16:44:15

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号