首页> 外国专利> Phrase extraction using subphrase scoring

Phrase extraction using subphrase scoring

机译:使用子短语评分的短语提取

摘要

An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are extracted from the document collection. Documents are the indexed according to their included phrases, using phrase posting lists. The phrase posting lists are stored in an cluster of index servers. The phrase posting lists can be tiered into groups, and sharded into partitions. Phrases in a query are identified based on possible phrasifications. A query schedule based on the phrases is created from the phrases, and then optimized to reduce query processing and communication costs. The execution of the query schedule is managed to further reduce or eliminate query processing operations at various ones of the index servers.
机译:信息检索系统使用短语来索引,检索,组织和描述文档。短语是从文档集中提取的。使用短语发布列表根据文档包含的短语对文档进行索引。短语发布列表存储在索引服务器的群集中。短语发布列表可以分为几组,并分成多个分区。根据可能的短语识别查询中的短语。从短语中创建基于短语的查询计划,然后对其进行优化以减少查询处理和通信成本。管理查询调度的执行以进一步减少或消除各个索引服务器上的查询处理操作。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号