首页> 外国专利> Systems, methods and computer program products for discovering a text query from example documents

Systems, methods and computer program products for discovering a text query from example documents

机译:用于从示例文档中发现文本查询的系统,方法和计算机程序产品

摘要

Discovering a keyword query corresponding to an input collection of documents taken from a candidate pool includes selecting a document from a working set as the input set, and extracting a list of snippets in the selected document. For each snippet, executing a set of proximity queries based on selected terms in that snippet, and finding all possible proximity queries that return less than N query results from the candidate pool. A query is selected from said proximity queries, based on the selected query returning the greatest number of working set documents, and returning the smallest number of documents not in the working set. Documents returned by the selected query are removed from the working set, and the above steps are repeated until no documents remain in the working set. The disjunction of selected queries is returned as the discovered query.
机译:发现与从候选库中获取的文档的输入集合相对应的关键字查询包括:从工作集中选择文档作为输入集,以及提取所选文档中的摘要列表。对于每个代码段,根据该代码段中的选定字词执行一组接近查询,并从候选库中查找返回少于N个查询结果的所有可能的接近查询。基于所选择的查询,从所述邻近查询中选择一个查询,该查询返回最大数量的工作集文档,并且返回最小数量的不在工作集中的文档。所选查询返回的文档将从工作集中删除,重复上述步骤,直到工作集中没有文档为止。所选查询的析取作为发现的查询返回。

著录项

  • 公开/公告号US8862605B2

    专利类型

  • 公开/公告日2014-10-14

    原文格式PDF

  • 申请/专利权人 WILLIAM S. SPANGLER;

    申请/专利号US201113300431

  • 发明设计人 WILLIAM S. SPANGLER;

    申请日2011-11-18

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 16:05:47

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号