首页> 外国专利> FUZZY SEARCH METHOD OF SEARCH CHARACTER STRING INCLUDING A PLURALITY OF WORDS

FUZZY SEARCH METHOD OF SEARCH CHARACTER STRING INCLUDING A PLURALITY OF WORDS

机译:包含多个词的搜索字符串的模糊搜索方法

摘要

PROBLEM TO BE SOLVED: To solve the following problem: in a method for searching an electronic document transcribed in English for a character string, when an input character string or a searched character string comprising a plurality of words and space characters include spelling mistakes, such as misused characters and numerous ellipses, an appropriate extraction result is not obtainable by exact search, prefix matching search, or suffix matching search.;SOLUTION: A part of several words continuing an input character string that includes a plurality of words and space characters is cut out, one to more than one substitute characters (wild card) are assigned to two or more continuing characters except the space characters and tab characters to create a search key character string. By using regular expression pattern matching to a wild card character part of the search key character string and other parts and performing fuzzy search, one which is more likely to match than a searched character string having a plurality of spelling mistakes are extracted.;COPYRIGHT: (C)2010,JPO&INPIT
机译:解决的问题:解决以下问题:在一种搜索以英语转录的电子文档中寻找字符串的方法中,当输入字符串或包含多个单词和空格的搜索字符串中包含拼写错误时,例如作为误用的字符和大量的省略号,无法通过精确搜索,前缀匹配搜索或后缀匹配搜索获得适当的提取结果。解决方案:几个单词的一部分延续了包含多个单词和空格字符的输入字符串,切出后,将一对多于一个的替代字符(通配符)分配给两个或多个连续字符(空格字符和制表符除外)以创建搜索关键字字符串。通过使用与搜索关键字字符串的通配符部分和其他部分匹配的正则表达式模式并执行模糊搜索,可以提取比具有多个拼写错误的搜索字符串更可能匹配的一种; COPYRIGHT: (C)2010,日本特许厅&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号