首页> 外国专利> Method of using the analysis of the relevance of the document such compact similarity structure as well as a method and apparatus for constructing a compact similarity structure

Method of using the analysis of the relevance of the document such compact similarity structure as well as a method and apparatus for constructing a compact similarity structure

机译:使用文档相关性分析的方法,如紧凑相似结构以及构造紧凑相似结构的方法和设备

摘要

A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N2N in number if the similarity values are asymmetric with regard to document pairing, and the plurality of similarity-value entries are fewer than N 2 - N 2 in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described.
机译:一种计算机可读介质,包括用于提供关于N对文档之间的相似度的信息的数据结构。数据结构包括多个相似度值条目,代表多个文档对的相似度。每个相似度值表示给定对的一个文档相对于给定对的另一文档的相似度。每个条目的相似度值大于阈值相似度值,该阈值相似度值大于零。如果相似度值关于文档配对是不对称的,则多个相似度值条目的数量少于N2N;如果相似度值与文档对对称,则多个相似度值条目的数量少于N 2-N 2。关于文件配对。描述了一种用于生成数据结构的方法和设备。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号