首页> 外国专利> WORD RELATION DATABASE CONSTRUCTING METHOD AND DEVICE, WORD/DOCUMENT PROCESSING METHOD AND DEVICE USING WORD RELATION DATABASE, EXPLANATION EXPRESSION ADEQUACY VERIFYING METHOD, PROGRAMS FOR THESE, STORAGE MEDIUM STORING THEM, WORD SIMILARITY COMPUTING METHOD, WORD GROUPING METHOD, REPRESENTIVE WORD EXTRACTING METHOD, AND WORD CONCEPT HIERARCHIAL METHOD

WORD RELATION DATABASE CONSTRUCTING METHOD AND DEVICE, WORD/DOCUMENT PROCESSING METHOD AND DEVICE USING WORD RELATION DATABASE, EXPLANATION EXPRESSION ADEQUACY VERIFYING METHOD, PROGRAMS FOR THESE, STORAGE MEDIUM STORING THEM, WORD SIMILARITY COMPUTING METHOD, WORD GROUPING METHOD, REPRESENTIVE WORD EXTRACTING METHOD, AND WORD CONCEPT HIERARCHIAL METHOD

机译:单词关系数据库的构建方法和装置,使用单词关系数据库的单词/文档处理方法和装置,说明表达适当性验证方法,用于这些的程序,存储介质存储它们,单词相似性计算方法,单词组方法,单词组方法词概念层次方法

摘要

PPROBLEM TO BE SOLVED: To provide a determination factor for determining propriety of adoption of word information, which is given to an application program by vector notation of a word. PSOLUTION: In the first place, all the lemmas explained by an explanatory sentence are taken out from a dictionary DB 10, and a frequency of the lemmas appearing in each explanatory sentence is computed by an appearance frequency summarization part 11. Then, by means of a direct appearance probability computing part 12, an appearance probability of words in the explanatory sentence for the lemmas is computed from the frequency. By means of an indirect appearance probability computing part 13, it is assumed that the words in the explanatory sentence are indirectly explained by the explanatory sentence, and an appearance probability of the words in an indirect explanation sentence for the lemmas is computed. Then, by a direct/indirect appearance probability computing part 14, the direct and indirect appearance probabilities of the words in the explanatory sentence are added together for finding an appearance probability for all the explanatory words. Finally, by a vector notation storage part 15, the appearance probabilities of all the explanation words are taken out for each lemmas and stored as the vector notation in the word relation database 16. PCOPYRIGHT: (C)2004,JPO
机译:

要解决的问题:提供一个确定因素来确定采用单词信息的适当性,该信息是通过单词的矢量符号给予应用程序的。

解决方案:首先,从词典DB 10中取出由解释性句子解释的所有词条,然后由出现频率汇总部分11计算出现在每个解释性句子中的词条的频率。借助于直接出现概率计算部分12,从频率计算出解释词中解释词中单词出现的概率。借助于间接出现概率计算部分13,假设解释语句间接解释了解释语句中的单词,并且计算了引理的间接解释语句中单词的出现概率。然后,通过直接/间接出现概率计算部分14,将说明句子中单词的直接和间接出现概率相加在一起,以找到所有说明词的出现概率。最后,通过矢量记法存储部分15,针对每个引理取出所有解释词的出现概率,并将其作为矢量记数存储在单词关系数据库16中。

COPYRIGHT:(C)2004,JPO

著录项

  • 公开/公告号JP2004005337A

    专利类型

  • 公开/公告日2004-01-08

    原文格式PDF

  • 申请/专利权人 NIPPON TELEGR & TELEPH CORP NTT;

    申请/专利号JP20020211621

  • 发明设计人 SUZUKI SATOSHI;

    申请日2002-07-19

  • 分类号G06F17/28;G06F17/30;

  • 国家 JP

  • 入库时间 2022-08-21 23:24:01

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号