首页> 外国专利> System and method for searching and matching data having ideographic content

System and method for searching and matching data having ideographic content

机译:搜索和匹配具有表意内容的数据的系统和方法

摘要

A method of searching and matching non-phonetic or ideogrammatic input data to stored data, including the steps of receiving input data comprising a search string having a plurality of elements, converting a subset of the elements into a set of terms, generating an optimized plurality of keys from the set of terms, retrieving stored data based on the optimized keys corresponding to most likely candidates for match, and selecting a best match from the plurality of candidates. At least some of the ideogrammatic elements form part of an ideogrammatic writing system. The method may also include dividing the search string into a plurality of overlapping sub-segments and identifying sub-segments having inferred semantic meaning as well as sub-segments having no semantic meaning in the ideogrammatic writing system, and using the various sub-segments to generate the optimized keys.
机译:一种将非语音或表意输入数据与存储的数据进行搜索和匹配的方法,包括以下步骤:接收包括具有多个元素的搜索字符串的输入数据,将这些元素的子集转换为一组术语,生成优化的多个从一组术语中选择一组密钥,基于与最可能匹配候选者相对应的优化密钥检索存储的数据,并从多个候选者中选择最佳匹配。表意文字元素中的至少一些形成表意文字书写系统的一部分。该方法还可以包括:在表意书写系统中,将搜索字符串划分为多个重叠的子段,并识别具有推断的语义的子段以及不具有语义的子段,并使用各种子段来生成优化的密钥。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号