...
首页> 外文期刊>Knowledge-Based Systems >Fast anytime retrieval with confidence in large-scale temporal case bases
【24h】

Fast anytime retrieval with confidence in large-scale temporal case bases

机译:在大规模的时间案例基础上充满信心地快速检索

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This work is about speeding up retrieval in Case-Based Reasoning (CBR) for large-scale case bases (CBs) comprised of temporally related cases in metric spaces. A typical example is a CB of electronic health records where consecutive sessions of a patient forms a sequence of related cases. k-Nearest Neighbors (kNN) search is a widely used algorithm in CBR retrieval. However, brute-force kNN is impossible for large CBs. As a contribution to efforts for speeding up kNN search, we introduce an anytime kNN search methodology and algorithm. Anytime Lazy kNN finds exact kNNs when allowed to run to completion with remarkable gain in execution time by avoiding unnecessary neighbor assessments. For applications where the gain in exact kNN search may not suffice, it can be interrupted earlier and it returns best-so-far kNNs together with a confidence value attached to each neighbor. We describe the algorithm and methodology to construct a probabilistic model that we use both to estimate confidence upon interruption and to automatize the interruption at desired confidence thresholds. We present the results of experiments conducted with publicly available datasets. The results show superior gains compared to brute-force search. We reach to an average gain of 87.18% with 0.98 confidence and to 96.84% with 0.70 confidence. (C) 2020 Elsevier B.V. All rights reserved.
机译:这项工作是关于在基于案例的推理(CBR)的基于案例基础(CBS)中加速检索,包括在度量空间中的时间相关案例。典型的例子是电子健康记录的CB,其中患者的连续会话形成了一系列相关情况。 K-最近邻居(knn)搜索是CBR检索中广泛使用的算法。但是,大型CBS不可能蛮力KNN。作为加快KNN搜索的努力的贡献,我们介绍了任何时间的KNN搜索方法和算法。如果允许在执行时间内允许在执行时间内完成时,懒惰的knn会发现精确的knns,通过避免不必要的邻居评估。对于精确Knn搜索的增益可能不足的应用,它可以更早地中断,并且它与附加到每个邻居的置信度值一起返回最佳的knns。我们描述了构建概率模型的算法和方法,以便在中断时使用两者估计置信度,并以所需的置信阈值自动化中断。我们介绍了与公共数据集进行的实验结果。与暴力搜索相比,结果显示出卓越的收益。我们达到87.18%的平均收益,自信为0.98,达到96.84%,自信为0.70。 (c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号