首页> 外文会议>International Conference on Networked Systems >The Out-of-core KNN Awakens: The Light Side of Computation Force on Large Datasets
【24h】

The Out-of-core KNN Awakens: The Light Side of Computation Force on Large Datasets

机译:核心外的knn唤醒:大型数据集上计算力的轻侧

获取原文

摘要

K-Nearest Neighbors (KNN) is a crucial tool for many applications, e.g. recommender systems, image classification and web-related applications. However, KNN is a resource greedy operation particularly for large datasets. We focus on the challenge of KNN computation over large datasets on a single commodity PC with limited memory. We propose a novel approach to compute KNN on large datasets by leveraging both disk and main memory efficiently. The main rationale of our approach is to minimize random accesses to disk, maximize sequential accesses to data and efficient usage of only the available memory. We evaluate our approach on large datasets, in terms of performance and memory consumption. The evaluation shows that our approach requires only 7% of the time needed by an in-memory baseline to compute a KNN graph.
机译:K-CORMALY邻居(KNN)是许多应用的重要工具,例如,推荐系统,图像分类和与Web相关的应用程序。然而,KNN是一个尤其是用于大型数据集的资源贪婪操作。我们专注于KNN计算在单个商品PC上具有有限的商品PC上的大型数据集的挑战。我们提出了一种通过有效利用两个磁盘和主记忆来计算大型数据集上的knn的新方法。我们方法的主要基本原理是最小化对磁盘的随机访问,最大化与可用内存仅对数据的顺序访问和有效使用。在性能和内存消耗方面,我们在大型数据集中评估我们的方法。评估表明,我们的方法只需要一个内存基线所需的7%来计算KNN图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号