Sparse Randomized Partition Trees for Nearest Neighbor Search

Kaushik Sinha; Omid Keivani

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Sparse Randomized Partition Trees for Nearest Neighbor Search

【24h】

Sparse Randomized Partition Trees for Nearest Neighbor Search

机译：用于最近邻居搜索的稀疏随机分区树

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Randomized partition trees have recently been shown to be very effective in solving nearest neighbor search problem. In spite of enjoying strong theoretical guarantee, it suffers from high space complexity, since each internal node of the tree needs to store a $d$ dimensional projection direction leading to a $O(nd)$ space complexity for a dataset of size $n$. Inspired by the fast Johnson-Lindenstrauss transform, in this paper, we propose a sparse version of randomized partition tree where each internal node needs to store only a few non-zero entries, as opposed to all $d$ entries, leading to significant space savings without sacrificing much in terms of nearest neighbor search accuracy. As a by product of this, query time of our proposed method is slightly better than that of its non-sparse counterpart for large dataset size. Our theoretical results indicate that our proposed method enjoys the same theoretical guarantee as that of the original non-sparse RP-tree. Experimental evaluations on four real world dataset strongly suggest that nearest neighbor search performance of our proposed sparse RP-tree is very similar to that of its non-sparse counterpart in terms of accuracy and number of retrieved points.

机译：最近已经证明，随机分区树在解决最近邻居搜索问题方面非常有效。尽管享有强大的理论保证，但它仍然具有较高的空间复杂度，因为树的每个内部节点都需要存储$ d $维投影方向，从而导致大小为$ n的数据集具有$ O（nd）$空间复杂度$。受快速约翰逊-林登斯特劳斯（Johnson-Lindenstrauss）转换的启发，我们提出了一种稀疏版本的随机分区树，其中每个内部节点只需要存储几个非零条目，而不是所有$ d $条目，从而导致大量空间在不牺牲最近邻搜索精度的情况下节省了成本。因此，对于大数据集，我们提出的方法的查询时间比其非稀疏方法的查询时间略短。我们的理论结果表明，我们提出的方法与原始的非稀疏RP树具有相同的理论保证。对四个现实世界数据集的实验评估强烈表明，我们提出的稀疏RP树的最近邻居搜索性能在准确性和检索点数方面与非稀疏RP树非常相似。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第3期|共9页
作者
Kaushik Sinha; Omid Keivani;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Randomized Partition Trees for Nearest Neighbor Search [J] . Dasgupta Sanjoy, Sinha Kaushik Algorithmica . 2015,第1期

机译：最近邻居搜索的随机分区树
2. Fast ￡1-norm Nearest Neighbor Search Using A Simple Variant of Randomized Partition Tree [J] . Kaushik Sinha Procedia Computer Science . 2015,第1期

机译：使用随机分区树的简单变型快速￡ 1-范数最近邻搜索
3. Fast ￡1-norm Nearest Neighbor Search Using A Simple Variant of Randomized Partition Tree [J] . Kaushik Sinha Procedia Computer Science . 2015,第1期

机译：使用随机分区树的简单变型快速￡ 1-范数最近邻搜索
4. Fast l_1-norm Nearest Neighbor Search Using A Simple Variant of Randomized Partition Tree [C] . Kaushik Sinha INNS Conference on Big Data . 2016

机译：使用一个简单的随机分区树的快速L_1-rom最近邻权搜索
5. Analytic and adaptive techniques for improving nearest-neighbor search performance in k-dimensional trees. [D] . Talbert, Douglas Alan. 2001

机译：用于改善k维树中最近邻搜索性能的解析和自适应技术。
6. Privacy-Enhancing k-Nearest Neighbors Search over Mobile Social Networks [O] . Yuxi Li, Fucai Zhou, Yue Ge, 2021

机译：Privacy-Enhancation K-Interligh邻居搜索移动社交网络
7. Fast l(1)-norm nearest neighbor search using a simple variant of randomized partition tree [O] . Sinha, Kaushik 2015

机译：使用随机分区树的简单变型快速进行l（1）-范数最近邻搜索

Sparse Randomized Partition Trees for Nearest Neighbor Search

摘要

著录项

相似文献

相关主题

期刊订阅