首页> 外国专利> Comparison of documents based on similarity measures

Comparison of documents based on similarity measures

机译:基于相似性度量的文档比较

摘要

Techniques for comparing a first document to one or more second documents are provided. At least one weight is assigned to one or more elements in the first document. A weighted document is generated in accordance with the at least one assigned weight. One or more comparison scores are computed by comparing each of the one or more elements in the first document to each of one or more elements in a given second document in accordance with one or more comparison rules. The one or more comparison rules determine if a given element in the first document and a given element in the given second document are compared using one or more language hierarchies and/or one or more similarity ranges. A similarity score is generated in accordance with the generated weighted document and the one or more computed comparison scores. The one or more second documents are retrieved in accordance with the generated similarity score.
机译:提供了用于将第一文档与一个或多个第二文档进行比较的技术。至少一个权重被分配给第一文档中的一个或多个元素。根据至少一个分配的权重生成加权文件。通过根据一个或多个比较规则将第一文档中的一个或多个元素中的每个元素与给定第二文档中的一个或多个元素中的每个元素进行比较来计算一个或多个比较分数。一个或多个比较规则确定是否使用一个或多个语言层次结构和/或一个或多个相似性范围来比较第一文档中的给定元素和给定第二文档中的给定元素。根据生成的加权文档和一个或多个计算出的比较分数来生成相似度分数。根据所生成的相似性分数来检索一个或多个第二文档。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号