...
首页> 外文期刊>Information Systems >Exploiting the structure of furthest neighbor search for fast approximate results
【24h】

Exploiting the structure of furthest neighbor search for fast approximate results

机译:利用最远邻居搜索的结构获得快速的近似结果

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We present a novel strategy for approximate furthest neighbor search that selects a set of candidate points using the data distribution. This strategy leads to an algorithm, which we callDrusillaSelect, that is able to outperform existing approximate furthest neighbor strategies. Our strategy is motivated by a study of the behavior of the furthest neighbor search problem, which has significantly different structure than the nearest neighbor search problem, and can be understood with the help of an information-theoretic hardness measure that we introduce. We also present a variant of the algorithm that gives an absolute approximation guarantee; under some assumptions, the guaranteed approximation can be achieved in provably less time than brute-force search. Performance studies indicate thatDrusillaSelectcan achieve comparable levels of approximation to other algorithms, even on the hardest datasets, while giving up to an order of magnitude speedup. An implementation is available in themlpackmachine learning library (found athttp://www.mlpack.org).
机译:我们提出一种用于近似最远邻居搜索的新颖策略,该策略使用数据分布选择一组候选点。这种策略导致了一种我们称为DrusillaSelect的算法,该算法的性能优于现有的近似最远邻居策略。我们的策略是通过对最远的邻居搜索问题的行为进行研究而获得的,该问题的结构与最近的邻居搜索问题大不相同,并且可以借助我们引入的信息理论硬度度量加以理解。我们还提出了一种算法的变体,它可以提供绝对逼近保证。在某些假设下,与暴力搜索相比,可以在更短的时间内实现保证的近似。性能研究表明,即使在最困难的数据集上,DrusillaSelect仍可以达到与其他算法相当的逼近水平,同时放弃了一个数量级的加速。 mlpackmachine学习库(位于http://www.mlpack.org)中提供了一种实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号