首页> 外文学位 >Gridfields: Model-driven data transformation in the physical sciences.
【24h】

Gridfields: Model-driven data transformation in the physical sciences.

机译:网格域:物理科学中模型驱动的数据转换。

获取原文
获取原文并翻译 | 示例

摘要

Scientists' ability to generate and store simulation results is outpacing their ability to analyze them via ad hoc programs. We observe that these programs exhibit an algebraic structure that can be used to facilitate reasoning and improve performance. In this dissertation, we present a formal data model that exposes this algebraic structure, then implement the model, evaluate it, and use it to express, optimize, and reason about data transformations in a variety of scientific domains.; Simulation results are defined over a logical grid structure that allows a continuous domain to be represented discretely in the computer. Existing approaches for manipulating these gridded datasets are incomplete. The performance of SQL queries that manipulate large numeric datasets is not competitive with that of specialized tools, and the up-front effort required to deploy a relational database makes them unpopular for dynamic scientific applications. Tools for processing multidimensional arrays can only capture regular, rectilinear grids. Visualization libraries accommodate arbitrary grids, but no algebra has been developed to simplify their use and afford optimization. Further, these libraries are data dependent---physical changes to data characteristics break user programs.; We adopt the grid as a first-class citizen, separating topology from geometry and separating structure from data. Our model is agnostic with respect to dimension, uniformly capturing, for example, particle trajectories (1-D), sea-surface temperatures (2-D), and blood flow in the heart (3-D). Equipped with data, a grid becomes a gridfield. We provide operators for constructing, transforming, and aggregating gridfields that admit algebraic laws useful for optimization. We implement the model by analyzing several candidate data structures and incorporating their best features. We then show how to deploy gridfields in practice by injecting the model as middleware between heterogeneous, ad hoc file formats and a popular visualization library.; In this dissertation, we define, develop, implement, evaluate and deploy a model of gridded datasets that accommodates a variety of complex grid structures and a variety of complex data products. We evaluate the applicability and performance of the model using datasets from oceanography, seismology, and medicine and conclude that our model-driven approach offers significant advantages over the status quo.
机译:科学家生成和存储模拟结果的能力已经超过了他们通过即席程序进行分析的能力。我们观察到这些程序展现出可用于促进推理和提高性能的代数结构。在本文中,我们提出了一个正式的数据模型,该模型公开了这种代数结构,然后对其进行实现,评估,并用其表达,优化和推理各种科学领域中的数据转换。仿真结果是在逻辑网格结构上定义的,该逻辑网格结构允许在计算机中离散表示连续域。用于处理这些网格化数据集的现有方法尚不完善。处理大型数字数据集的SQL查询的性能与专用工具的性能不具竞争力,并且部署关系数据库所需的前期工作使其对于动态科学应用程序不受欢迎。用于处理多维数组的工具只能捕获规则的直线网格。可视化库可容纳任意网格,但尚未开发出代数来简化其使用并提供优化。此外,这些库是依赖于数据的-对数据特性的物理更改破坏了用户程序。我们将网格作为一流的公民,将拓扑结构与几何结构分离,将结构与数据分离。我们的模型在尺寸方面是不可知的,可以均匀地捕获例如粒子轨迹(1-D),海面温度(2-D)和心脏中的血流(3-D)。装有数据的网格成为网格域。我们提供用于构造,转换和聚合网格域的运算符,这些网格域接受对优化有用的代数定律。我们通过分析几种候选数据结构并结合其最佳功能来实施该模型。然后,我们通过将模型作为中间件注入异构,即席文件格式和流行的可视化库之间,展示了如何在实践中部署网格域。在本文中,我们定义,开发,实现,评估和部署了可容纳各种复杂网格结构和各种复杂数据产品的网格化数据集模型。我们使用海洋学,地震学和医学方面的数据集评估了该模型的适用性和性能,并得出结论,我们的模型驱动方法相对于现状具有明显优势。

著录项

  • 作者

    Howe, Bill.;

  • 作者单位

    Portland State University.;

  • 授予单位 Portland State University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 276 p.
  • 总页数 276
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号