首页> 外国专利> System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents

System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents

机译:通过链接文档集合的网络表示扩展激活来预测与焦点文档相关的文档的系统

摘要

A system for extracting and analyzing information from a collection of linked documents at a locality to enable categorization of documents and prediction of documents relevant to a focus document. The system obtains and analyzes topology, usage and path information from for a collection at a locality, e.g. a web locality on the world wide web. For categorization, document meta information is represented as document vectors. Predefined criteria is applied to the document vectors to create lists of "similar" types of documents. For relevance prediction, networks representing topology, usage path and text similarity amongst the documents in the collection are created. A spreading activation technique is applied to the networks starting at a focus document to predict the documents relevant to the focus document. Using category and relevance prediction information, tools can be built to enable a user to more efficiently traverse through the collection of linked documents.
机译:一种用于从本地链接文档集合中提取和分析信息以实现文档分类和与焦点文档相关的文档预测的系统。该系统从本地获取并分析拓扑,用途和路径信息,以用于本地的集合。万维网上的网络位置。为了分类,文档元信息被表示为文档向量。将预定义的标准应用于文档向量,以创建“相似”类型文档的列表。为了进行相关性预测,将创建表示拓扑,使用路径和文本在文档集中的相似性的网络。从焦点文档开始,将扩展激活技术应用于网络,以预测与焦点文档相关的文档。使用类别和相关性预测信息,可以构建工具以使用户能够更有效地遍历链接文档的集合。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号