首页> 外国专利> METHOD AND DEVICE FOR NORMALIZING AND DEVELOPING DIFFERENT NOTATION, METHOD AND DEVICE FOR RETRIEVING DOCUMENT BY USING THE METHOD, AND PROGRAM RECORDING MEDIUM

METHOD AND DEVICE FOR NORMALIZING AND DEVELOPING DIFFERENT NOTATION, METHOD AND DEVICE FOR RETRIEVING DOCUMENT BY USING THE METHOD, AND PROGRAM RECORDING MEDIUM

机译:规范化和开发不同符号的方法和装置,使用该方法检索文档的方法和装置以及程序记录介质

摘要

PROBLEM TO BE SOLVED: To realize different notation development capable of preventing the occurrence of wasteful different notation development and retrieval leakage by performing the most proper different notation normalizing processing and different notation developing processing to the KATAKANA (Japanese square syllabary) string of different notation having variability in notation. SOLUTION: This method has a normalization part 122 for extracting the continuous KATAKANA string from a text to obtain normalized notation based on a normalization rule 123a and a different normalization developing part 126 for extracting the continuous KATAKANA string to develop it to the different notation of the normalized notation based on a different notation developing dictionary 127a, and has an entry word cost to which the rule 123a having conversion rules to the normalized notation registered therein or/and a different notation developing dictionary 127a having the different notation to be developed registered therein can be set optionally by each entry word constituting a dividing unit. Thus, the part 122 or/and the part 126 perform normalization processing or/and different notation developing processing to which morpheme analysis of a cost minimizing methods is applied.
机译:要解决的问题:通过对具有不同符号的片假名串执行最适当的不同符号规范化处理和不同符号开发处理,以实现能够防止浪费的不同符号开发和检索泄漏的不同符号开发。符号的可变性。解决方案:该方法具有标准化部分122,用于从文本中提取连续的片假名字符串,以基于标准化规则123a获得标准化的符号;以及不同的标准化开发部分126,用于提取连续的片假名字符串,以将其展开为不同的符号。归一化符号基于不同的符号开发词典127a,并且具有输入词成本,其中具有转换规则注册到其中的归一化符号的规则123a或/和具有要在其中注册的具有不同的符号开发的不同符号发展词典127a由构成除法单元的每个输入字可选地设置。因此,部分122或/和部分126执行归一化处理或/和对其应用了成本最小化方法的词素分析的不同符号表示处理。

著录项

  • 公开/公告号JP2002073656A

    专利类型

  • 公开/公告日2002-03-12

    原文格式PDF

  • 申请/专利权人 RICOH CO LTD;

    申请/专利号JP20000265266

  • 发明设计人 HAYASHI HIROKO;

    申请日2000-09-01

  • 分类号G06F17/30;G06F17/21;

  • 国家 JP

  • 入库时间 2022-08-22 00:59:10

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号