首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Data-Intensive Spatial Filtering in Large Numerical Simulation Datasets
【24h】

Data-Intensive Spatial Filtering in Large Numerical Simulation Datasets

机译:大型数值模拟数据集中的数据密集型空间滤波

获取原文

摘要

We present a query processing framework for the efficient evaluation of spatial filters on large numerical simulation datasets stored in a data-intensive cluster. Previously, filtering of large numerical simulations stored in scientific databases has been impractical owing to the immense data requirements. Rather, filtering is done during simulation or by loading snapshots into the aggregate memory of an HPC cluster. Our system performs filtering within the database and supports large filter widths. We present two complementary methods of execution: I/O streaming computes a batch filter query in a single sequential pass using incremental evaluation of decomposable kernels, summed volumes generates an intermediate data set and evaluates each filtered value by accessing only eight points in this dataset. We dynamically choose between these methods depending upon workload characteristics. The system allows us to perform filters against large data sets with little overhead: query performance scales with the cluster's aggregate I/O throughput.
机译:我们提出了一个查询处理框架,用于在数据密集型集群中存储的大型数值模拟数据集上的空间滤波器的有效评估。以前,由于巨大的数据要求,存储在科学数据库中的大数数值模拟的过滤是不切实际的。相反,在模拟期间或通过将快照加载到HPC集群的聚合存储器中进行过滤。我们的系统在数据库中执行过滤,并支持大的滤波器宽度。我们呈现了两个互补方法执行方法:I / O流使用可分解内核的增量评估计算单个顺序传输中的批量滤波器查询,总结卷生成中间数据集,并通过访问该数据集中的仅八个点来评估每个过滤值。我们根据工作负载特性动态选择这些方法。该系统允许我们对大型数据集执行过滤器,几乎没有开销:查询性能尺度与群集的聚合I / O吞吐量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号