SUPPORTING SCALABLE AND DISTRIBUTED DATA SUBSETTING AND AGGREGATION IN LARGE-SCALE SEISMIC DATA ANALYSIS

X. Zhang; B. Rutt; UE. Catalyuerek; T. Kurc; P. Stoffa; M. Sen; J. Saltz

首页> 外文期刊>International Journal of High Performance Computing Applications >SUPPORTING SCALABLE AND DISTRIBUTED DATA SUBSETTING AND AGGREGATION IN LARGE-SCALE SEISMIC DATA ANALYSIS

【24h】

SUPPORTING SCALABLE AND DISTRIBUTED DATA SUBSETTING AND AGGREGATION IN LARGE-SCALE SEISMIC DATA ANALYSIS

机译：在大规模地震数据分析中支持可扩展的分布式数据子集和聚合

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability to query and process very large, terabyte-scale datasets has become a key step in many scientific and engineering applications. In this paper, we describe the application of two middleware frameworks in an integrated fashion to provide a scalable and efficient system for execution of seismic data analysis on large datasets in a distributed environment. We investigate different strategies for efficient querying of large datasets and parallel implementations of a seismic image reconstruction algorithm. Our results on a state-of-the-art mass storage system coupled with a high-end compute cluster show that our implementation is scalable and can achieve about 2.9 Gigabytes per second data processing rate - about 70% of the maximum 4.2GB/s application-level raw I/O bandwidth of the storage platform.

机译：查询和处理非常大的TB级数据集的能力已成为许多科学和工程应用程序中的关键步骤。在本文中，我们以集成的方式描述了两种中间件框架的应用，以提供可扩展且高效的系统，以在分布式环境中对大型数据集执行地震数据分析。我们研究了有效查询大型数据集的不同策略以及地震图像重建算法的并行实现。我们在最先进的大容量存储系统上结合高端计算集群的结果表明，我们的实施具有可伸缩性，可以实现每秒2.9千兆字节的数据处理速率-约为最大4.2GB / s的70％存储平台的应用程序级原始I / O带宽。

著录项

来源
《International Journal of High Performance Computing Applications》 |2006年第3期|p.423-438|共16页
作者
X. Zhang; B. Rutt; UE. Catalyuerek; T. Kurc; P. Stoffa; M. Sen; J. Saltz;
展开▼
作者单位

Department of Biomedical Informatics, The Ohio State University;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
seismic data analysis; data-driven applications;

机译：地震数据分析;数据驱动的应用;

相似文献

外文文献
中文文献
专利

1. A distributed data management system to support large-scale data analysis [J] . Tamer Z. Emara, Joshua Zhexue Huang The Journal of Systems and Software . 2019,第FEBa期

机译：支持大规模数据分析的分布式数据管理系统
2. Towards efficient data search and subsetting of large-scale atmospheric datasets [J] . Sangmi Lee Pallickara, Shrideep Pallickara, Milija Zupansk Future generation computer systems . 2012,第1期

机译：寻求有效的数据搜索和大型大气数据集的子集
3. Spreading Aggregation: A distributed collision-free approach for data aggregation in large-scale wireless sensor networks [J] . Merzoug M. A., Boukerche A., Mostefaoui A., Journal of Parallel and Distributed Computing . 2019,第MARa期

机译：扩展聚合：一种用于大型无线传感器网络中数据聚合的分布式无冲突方法
4. Supporting User-Defined Subsetting and Aggregation over Parallel NetCDF Datasets [C] . Su Yu, Agrawal Gagan Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on . 2012

机译：在并行NetCDF数据集上支持用户定义的子集和聚合
5. Enabling scalable data analysis for large computational structural biology datasets on large distributed memory systems supported by the MapReduce paradigm [D] . Zhang, Boyu 2015

机译：在MapReduce范例支持的大型分布式存储系统上，对大型计算结构生物学数据集启用可伸缩数据分析
6. Watchdog – a workflow management system for the distributed analysis of large-scale experimental data [O] . Michael Kluge, Caroline C. Friedel 2018

机译：看门狗–一种用于大规模实验数据的分布式分析的工作流管理系统
7. Supporting scalable and distributed data subsetting and aggregation in large-scale seismic data analysis [O] . X. Zhang, B. Rutt, Ü. Çatalyürek, 2006

机译：支持大规模地震数据分析中的可扩展和分布式数据子集化和聚合

SUPPORTING SCALABLE AND DISTRIBUTED DATA SUBSETTING AND AGGREGATION IN LARGE-SCALE SEISMIC DATA ANALYSIS

摘要

著录项

相似文献

相关主题

期刊订阅