首页> 外国专利> Method and system for automatic creation of a thesaurus

Method and system for automatic creation of a thesaurus

机译:自动创建同义词库的方法和系统

摘要

A method of automatic generation of a digital thesaurus, the method comprising: parsing the digital text and determining a first lexical unit and a second lexical unit; for each entry of the first lexical unit: selecting n-number of sequential units adjacent to the first lexical unit; generating a first context parameter for the first lexical unit, the first context parameter comprising an indication of each unit of the n-number of sequential units and a frequency of co-occurrence of each unit with the first lexical unit in the digital text; for each entry of the second lexical: selecting, n-number of sequential units adjacent to the second lexical unit; generating a second context parameter; determining a lexical unit relation parameter for the first lexical unit and the second lexical unit by: an interrelation analysis and an analysis of entry co-occurrence.
机译:一种自动生成数字同义词库的方法,该方法包括:解析数字文本并确定第一词汇单元和第二词汇单元;对于第一词汇单元的每个条目:选择与第一词汇单元相邻的n个顺序单元;产生用于第一词汇单元的第一上下文参数,该第一上下文参数包括在数字文本中的n个连续单元的每个单元的指示以及每个单元与第一词汇单元的同时出现的频率;对于第二词汇的每个条目:选择与第二词汇单元相邻的n个顺序单元;生成第二上下文参数;通过以下各项确定第一词汇单元和第二词汇单元的词汇单元相关参数:相互关系分析和条目共现分析。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号