...
首页> 外文期刊>EURASIP journal on advances in signal processing >A Data-Driven Multidimensional Indexing Method for Data Mining in Astrophysical Databases
【24h】

A Data-Driven Multidimensional Indexing Method for Data Mining in Astrophysical Databases

机译:天体数据库中数据挖掘的数据驱动多维索引方法

获取原文
           

摘要

Large archives and digital sky surveys with dimensions of bytes currently exist, while in the near future they will reach sizes of the order of . Numerical simulations are also producing comparable volumes of information. Data mining tools are needed for information extraction from such large datasets. In this work, we propose a multidimensional indexing method, based on a static R-tree data structure, to efficiently query and mine large astrophysical datasets. We follow a top-down construction method, called VAMSplit, which recursively splits the dataset on a near median element along the dimension with maximum variance. The obtained index partitions the dataset into nonoverlapping bounding boxes, with volumes proportional to the local data density. Finally, we show an application of this method for the detection of point sources from a gamma-ray photon list.
机译:当前存在具有字节大小的大型档案和数字天空勘测,而在不久的将来,它们的大小将达到。数值模拟也产生了可比较的信息量。从这样的大型数据集中提取信息需要数据挖掘工具。在这项工作中,我们提出了一种基于静态R树数据结构的多维索引方法,以有效地查询和挖掘大型天体数据集。我们遵循一种称为VAMSplit的自上而下的构造方法,该方法将数据集沿具有最大方差的维递归地分割在接近中值的元素上。获得的索​​引将数据集划分为不重叠的边界框,其体积与局部数据密度成比例。最后,我们展示了该方法在伽马射线光子列表中检测点源的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号