首页> 外文会议>2015 IEEE Fifth International Conference on Big Data and Cloud Computing >Efficient k-Nearest Neighbors Search in High Dimensions Using MapReduce
【24h】

Efficient k-Nearest Neighbors Search in High Dimensions Using MapReduce

机译:使用MapReduce在高维中进行有效的k最近邻搜索

获取原文
获取原文并翻译 | 示例

摘要

Finding the k-Nearest Neighbors (kNN) of a query object for a given dataset S is a primitive operation in many application domains. kNN search is very costly, especially many applications witness a quick increase in the amount and dimension of data to be processed. Locality sensitive hashing (LSH) has become a very popular method for this problem. However, most such methods can't obtain good performance in terms of search quality, search efficiency and space cost at the same time, such as RankReduce, which gains good search efficiency at the sacrifice of the search quality. Motivated by these, we propose a novel LSH-based inverted index scheme and design an efficient search algorithm, called H-c2kNN, which enables fast high-dimensional kNN search with excellent quality and low space cost. For efficiency and scalability concerns, we implemented our proposed approach to solve the kNN search in high dimensional space using MapReduce, which is a well-known framework for data-intensive applications and conducted extensive experiments to evaluate our proposed approach using both synthetic and real datasets. The results show that our proposed approach outperforms baseline methods in high dimensional space.
机译:查找给定数据集S的查询对象的k最近邻(kNN)是许多应用程序领域中的原始操作。 kNN搜索非常昂贵,特别是许多应用程序见证了要处理的数据量和维度的快速增长。局部敏感哈希(LSH)已成为解决此问题的一种非常流行的方法。但是,大多数此类方法无法同时在搜索质量,搜索效率和空间成本方面获得良好的性能,例如RankReduce,它牺牲了搜索质量而获得了良好的搜索效率。出于这些原因,我们提出了一种基于LSH的新型倒排索引方案,并设计了一种有效的搜索算法H-c2kNN,该算法可实现高质量,低空间成本的快速高维kNN搜索。出于效率和可伸缩性方面的考虑,我们使用MapReduce实现了我们提出的方法来解决高维空间中的kNN搜索,MapReduce是数据密集型应用程序的著名框架,并进行了广泛的实验以使用合成数据集和实际数据集评估我们提出的方法。结果表明,我们提出的方法在高维空间中优于基线方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号