N2-N2 ]]> in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described."/> Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance
首页> 外国专利> Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance

Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance

机译:用于构造紧凑的相似性结构并将其用于分析文档相关性的方法和设备

摘要

A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N2−N in number if the similarity values are asymmetric with regard to document pairing, and the plurality of similarity-value entries are fewer than; <math overflow="scroll"><mfrac><mrow><msup><mi>N</mi><mn>2</mn></msup><mo>-</mo><mi>N</mi></mrow><mn>2</mn></mfrac></math> in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described.
机译:一种计算机可读介质,包括用于提供关于N个文档对之间的相似度的信息的数据结构。数据结构包括多个相似度值条目,代表多个文档对的相似度。每个相似度值表示给定对的一个文档相对于给定对的另一文档的相似度。每个条目的相似度值大于阈值相似度值,该阈值相似度值大于零。如果相似度值关于文档配对是不对称的,则多个相似度值条目的数量少于N 2- -N,并且多个相似度值条目的数量少于; <![CDATA [<数学溢出=“ scroll”> N 2 -< / mo> N 2 ]]> 如果相似性值关于文档配对是对称的,则为数字。描述了一种用于生成数据结构的方法和设备。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号