首页> 外国专利> SCALABLE STRING MATCHING AS A COMPONENT FOR UNSUPERVISED LEARNING IN SEMANTIC META-MODEL DEVELOPMENT

SCALABLE STRING MATCHING AS A COMPONENT FOR UNSUPERVISED LEARNING IN SEMANTIC META-MODEL DEVELOPMENT

机译:可伸缩的字符串匹配是语义元模型开发中无监督学习的组件

摘要

A string analysis tool for calculating a similarity metric between a source string and a plurality of target strings. The string analysis tool may include optimizations that may reduce the number of calculations to be carried out when calculating the similarity metric for large volumes of data. In this regard, the string analysis tool may represent strings as features. As such, analysis may be performed relative to features (e.g., of either the source string or plurality of target strings) such that features from the strings may be eliminated from consideration when identifying target strings for which a similarity metric is to be calculated. The elimination of features may be based on a minimum similarity metric threshold, wherein features that are incapable of contributing to a similarity metric above the minimum similarity metric threshold are eliminated from consideration.
机译:字符串分析工具,用于计算源字符串和多个目标字符串之间的相似性度量。字符串分析工具可以包括优化,该优化可以减少在为大量数据计算相似性度量时要执行的计算数量。在这方面,字符串分析工具可以将字符串表示为特征。这样,可以相对于特征(例如,源字符串或多个目标字符串中的任一个)执行分析,以使得当识别要为其计算相似性度量的目标字符串时,可以不考虑来自字符串的特征。特征的消除可以基于最小相似性度量阈值,其中,不能考虑不能有助于高于最小相似性度量阈值的相似性度量的特征。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号