首页> 外国专利> System for finding and identifying documents with similar content, especially for use with a web search system, identifies cyclical reference paths formed by links between documents originating from a reference document

System for finding and identifying documents with similar content, especially for use with a web search system, identifies cyclical reference paths formed by links between documents originating from a reference document

机译:用于查找和识别具有相似内容的文档的系统,尤其是与Web搜索系统一起使用的系统,用于识别由参考文档来源的文档之间的链接形成的周期性参考路径

摘要

System for finding and identifying documents with similar content that are connected by links, especially hypertext links, with at least an evaluation unit (41) that automatically traces cyclical, i.e. circular closed reference paths formed from links between a number of documents, reference paths originating from the reference document in a rule-based manner. The system then classifies such traced documents with cyclical reference paths as having similar content. An independent claim is made for a method for finding and identifying documents with similar content.
机译:用于查找和识别具有相似内容的文档的系统,该文档通过链接(尤其是超文本链接)与至少一个评估单元(41)相连,该评估单元自动跟踪循环(即由多个文档之间的链接形成的圆形封闭参考路径),这些参考路径源自以基于规则的方式从参考文档中删除。然后,系统将具有周期性参考路径的此类跟踪文档分类为具有相似内容。对用于查找和识别具有相似内容的文档的方法提出了独立的主张。

著录项

相似文献

  • 专利
  • 外文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号