...
首页> 外文期刊>Journal of supercomputing >Relevance maximization for high-recall retrieval problem: finding all needles in a haystack
【24h】

Relevance maximization for high-recall retrieval problem: finding all needles in a haystack

机译:高回忆检索问题的相关性最大化:在干草堆中找到所有针

获取原文
获取原文并翻译 | 示例

摘要

High-recall retrieval problem, aiming at finding the full set of relevant documents in a huge result set by effective mining techniques, is particularly useful for patent information retrieval, legal document retrieval, medical document retrieval, market information retrieval, and literature review. The existing high-recall retrieval methods, however, have been far from satisfactory to retrieve all relevant documents due to not only high-recall and precision threshold measurements but also a sheer minimize the number of reviewed documents. To address this gap, we generalize the problem to a novel high-recall retrieval model, which can be represented as finding all needles in a giant haystack. To compute candidate groups consisting ofkrelevant documents efficiently, we propose dynamic diverse retrieval algorithms specialized for the patent-searching method, in which an effective dynamic interactive retrieval can be achieved. In the various types of datasets, the dynamic ranking method shows considerable improvements with respect to time and cost over the conventional static ranking approaches.
机译:高召回检索问题,旨在通过有效采矿技术的巨大结果找到全套相关文件,特别适用于专利信息检索,法律文件检索,医学文件检索,市场信息检索和文献综述。然而,现有的高召回检索方法远未令人满意地检索所有相关文件,因为不仅是高召回和精确的阈值测量,而且纯粹最小化了审查文档的数量。为了解决这一差距,我们将问题概括为新的高召回检索模型,这可以表示为在巨大的干草堆中找到所有针。为了有效地计算OFKREVANT文件组成的候选组,我们提出了专门用于专利搜索方法的动态定样检索算法,其中可以实现有效的动态交互式检索。在各种类型的数据集中,动态排名方法对传统静态排名方法的时间和成本表示相当大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号