首页> 外文学位 >Effective Large Scale Computing Software for Parallel Mesh Generation.
【24h】

Effective Large Scale Computing Software for Parallel Mesh Generation.

机译:用于并行网格生成的有效大型计算软件。

获取原文
获取原文并翻译 | 示例

摘要

Scientists commonly turn to supercomputers or Clusters of Workstations with hundreds (even thousands) of nodes to generate meshes for large-scale simulations. Parallel mesh generation software is then used to decompose the original mesh generation problem into smaller sub-problems that can be solved (meshed) in parallel. The size of the final mesh is limited by the amount of aggregate memory of the parallel machine. Also, requesting many compute nodes on a shared computing resource may result in a long waiting, far surpassing the time it takes to solve the problem.;These two problems (i.e., insufficient memory when computing on a small number of nodes, and long waiting times when using many nodes from a shared computing resource) can be addressed by using out-of-core algorithms. These are algorithms that keep most of the dataset out-of-core (i.e., outside of memory, on disk) and load only a portion in-core (i.e., into memory) at a time.;We explored two approaches to out-of-core computing. First, we presented a traditional approach, which is to modify the existing in-core algorithms to enable out-of-core computing. While we achieved good performance with this approach the task is complex and labor intensive. An alternative approach, we presented a runtime system designed to support out-of-core applications. It requires little modification of the existing in-core application code and still produces acceptable results. Evaluation of the runtime system showed little performance degradation while simplifying and shortening the development cycle of out-of-core applications. The overhead from using the runtime system for small problem sizes is between 12% and 41% while the overlap of computation, communication and disk I/O is above 50% and as high as 61% for large problems.;The main contribution of our work is the ability to utilize computing resources more effectively. The user has a choice of either solving larger problems, that otherwise would not be possible, or solving problems of the same size but using fewer computing nodes, thus reducing the waiting time on shared clusters and supercomputers. We demonstrated that the latter could potentially lead to substantially shorter wall-clock time.
机译:科学家通常转向具有数百(甚至数千)个节点的超级计算机或工作站集群来生成用于大规模仿真的网格。然后使用并行网格生成软件将原始网格生成问题分解为较小的子问题,这些子问题可以并行求解(网格化)。最终网格的大小受并行计算机的聚合内存量限制。同样,在共享计算资源上请求许多计算节点可能会导致漫长的等待时间,远远超过了解决问题所需的时间。这两个问题(即在少数节点上进行计算时内存不足,并且等待时间长)使用共享计算资源中的许多节点时,可以使用内核外算法解决。这些算法可将大多数数据集保持在内核外(即,在内存之外,在磁盘上),并且一次仅将一部分内核(即在内存中)加载。核心计算。首先,我们提出了一种传统方法,即修改现有的内核内算法以启用内核外计算。尽管我们通过这种方法获得了良好的性能,但任务却很复杂且劳动强度大。作为替代方法,我们提出了一种运行时系统,旨在支持核心外应用程序。它几乎不需要修改现有的核心应用程序代码,并且仍然可以产生可接受的结果。对运行时系统的评估显示,性能几乎没有下降,同时简化并缩短了核心应用程序的开发周期。对于较小的问题,使用运行时系统产生的开销在12%到41%之间,而计算,通信和磁盘I / O的重叠超过50%,对于大问题则高达61%。工作是更有效地利用计算资源的能力。用户可以选择解决更大的问题(否则将无法解决),或者解决相同大小但使用较少计算节点的问题,从而减少共享集群和超级计算机上的等待时间。我们证明了后者可以潜在地大大缩短挂钟时间。

著录项

  • 作者

    Kot, Andriy.;

  • 作者单位

    The College of William and Mary.;

  • 授予单位 The College of William and Mary.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 85 p.
  • 总页数 85
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号