首页> 外文会议>International Conference on Cloud Computing and Internet of Things >Fast KNN search for big data with set compression tree and best bin first
【24h】

Fast KNN search for big data with set compression tree and best bin first

机译:通过设置压缩树和最佳bin优先级快速KNN搜索大数据

获取原文

摘要

This paper proposes k nearest neighbors (kNN) search based on set compression tree (SCT) and best bin first (BBF) to deal with the problem for big data. The large compression rate by set compression tree is achieved by compressing the set of descriptors jointly instead of compressing on a per-descript or basis. So set compression tree has a good performance in kNN search at a low bit rate. At the same time, the best bin first (BBF) is a very efficient algorithm to find the approximately kNN from a large number of high dimensional feature descriptors. SCT-BBF is a novel exploration and it improves search performance in three aspects: First, SCT-BBF requires less memory footprint, which is important in big data age. Second, it increases accuracy compared traditional method like KD-Tree and original SCT. SCT-BBF can be used with other data processing methods like PCA and SIFT to perform better. Third, this paper adopts finer search to increase accuracy at a slight loss of speed. And it can extend to big data easily.
机译:本文提出了基于集合压缩树(SCT)和最佳bin优先(BBF)的k个最近邻搜索(kNN)来解决大数据问题。通过集合压缩树而不是按每个描述或基础进行压缩,可以通过集合压缩树实现较大的压缩率。因此,设置压缩树在低比特率的kNN搜索中具有良好的性能。同时,最佳bin优先(BBF)是一种非常有效的算法,可以从大量的高维特征描述符中找到近似kNN。 SCT-BBF是一项新颖的探索,它从三个方面提高了搜索性能:首先,SCT-BBF需要更少的内存空间,这在大数据时代很重要。其次,与传统方法(例如KD-Tree和原始SCT)相比,它提高了准确性。 SCT-BBF可以与其他数据处理方法(例如PCA和SIFT)一起使用,以实现更好的性能。第三,本文采用更精细的搜索来提高精度,但速度略有下降。它可以轻松扩展到大数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号