首页> 外文会议>Annual German conference on artificial intelligence >On GPU-Based Nearest Neighbor Queries for Large-Scale Photometric Catalogs in Astronomy
【24h】

On GPU-Based Nearest Neighbor Queries for Large-Scale Photometric Catalogs in Astronomy

机译:基于GPU的天文学大型测光目录的最近邻查询

获取原文

摘要

Nowadays astronomical catalogs contain patterns of hundreds of millions of objects with data volumes in the terabyte range. Upcoming projects will gather such patterns for several billions of objects with peta- and exabytes of data. From a machine learning point of view, these settings often yield unsupervised, semi-supervised, or fully supervised tasks, with large training and huge test sets. Recent studies have demonstrated the effectiveness of prototype-based learning schemes such as simple nearest neighbor models. However, although being among the most computationally efficient methods for such settings (if implemented via spatial data structures), applying these models on all remaining patterns in a given catalog can easily take hours or even days. In this work, we investigate the practical effectiveness of GPU-based approaches to accelerate such nearest neighbor queries in this context. Our experiments indicate that carefully tuned implementations of spatial search structures for such multi-core devices can significantly reduce the practical runtime. This renders the resulting frameworks an important algorithmic tool for current and upcoming data analyses in astronomy.
机译:如今,天文目录包含数亿个对象的模式,其数据量在TB范围内。即将到来的项目将为数十亿个具有PB和EB数据的对象收集这种模式。从机器学习的角度来看,这些设置通常会产生大量的训练和庞大的测试集,从而产生无人监督,半监督或完全监督的任务。最近的研究证明了基于原型的学习方案(例如简单的最近邻居模型)的有效性。但是,尽管是用于此类设置的计算效率最高的方法之一(如果通过空间数据结构实现),但将这些模型应用于给定目录中的所有其余模式可能很容易花费数小时甚至数天。在这项工作中,我们研究了基于GPU的方法在这种情况下加速此类最近邻居查询的实际有效性。我们的实验表明,针对此类多核设备精心调整空间搜索结构的实现方式可以大大减少实际运行时间。这使得结果框架成为天文学中当前和即将进行的数据分析的重要算法工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号