首页> 外国专利> System and method of finding documents related to other documents and of finding related words in response to a query to refine a search

System and method of finding documents related to other documents and of finding related words in response to a query to refine a search

机译:查找与其他文档有关的文档并响应于查询来查找相关词以细化搜索的系统和方法

摘要

A computer-implemented system and method is disclosed for retrieving documents using context-dependant probabilistic modeling of words and documents. The present invention uses multiple overlapping vectors to represent each document. Each vector is centered on each of the words in the document and includes the local environment. The vectors are used to build probability models that are used for predictions of related documents and related keywords. The results of the statistical analysis are used for retrieving an indexed document, for extracting features from a document, or for finding a word within a document. The statistical evaluation is also used to evaluate the probability of relation between the key words appearing in the document and building a vocabulary of key words that are generally found together. The results of the analysis are stored in a repository. Searches of the data repository produce a list of related documents and a list of related terms. The user may select from the list of documents and/or from the list of related terms to refine the search and retrieve those documents which meet the search goal of the user with a minimum of extraneous data.
机译:公开了一种计算机实现的系统和方法,用于使用单词和文档的上下文相关概率建模来检索文档。本发明使用多个重叠矢量来表示每个文档。每个向量都以文档中每个单词为中心,并包含本地环境。向量用于建立概率模型,该概率模型用于相关文档和相关关键字的预测。统计分析的结果用于检索索引文档,从文档中提取特征或在文档中查找单词。统计评估还用于评估文档中出现的关键字与建立通常一起找到的关键字词汇之间的关系概率。分析结果存储在存储库中。对数据存储库的搜索将生成相关文档的列表和相关术语的列表。用户可以从文档列表和/或相关术语列表中进行选择,以精炼搜索并以最少的无关数据检索满足用户搜索目标的那些文档。

著录项

  • 公开/公告号US9064005B2

    专利类型

  • 公开/公告日2015-06-23

    原文格式PDF

  • 申请/专利权人 JAN MAGNUS STENSMO;

    申请/专利号US20070776634

  • 发明设计人 JAN MAGNUS STENSMO;

    申请日2007-07-12

  • 分类号G06F7;G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 15:19:55

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号