首页> 外文期刊>PASJ: Publications of the Astronomical Society of Japan >GRAPE-6A: A single-card GRAPE-6 for parallel PC-GRAPE cluster systems
【24h】

GRAPE-6A: A single-card GRAPE-6 for parallel PC-GRAPE cluster systems

机译:GRAPE-6A:用于并行PC-GRAPE集群系统的单卡GRAPE-6

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we describe the design and performance of GRAPE-6A, a special-purpose computer for gravitational many-body simulations. It was designed to be used with a PC cluster, in which each node has one GRAPE-6A. Such a configuration is particularly cost-effective in running parallel tree algorithms. Though the use of parallel tree algorithms was possible with the original GRAPE-6 hardware, it was not very cost-effective since a single GRAPE-6 board was still too fast and too expensive. Therefore, we designed GRAPE-6A as a single PCI card to minimize the reproduction cost and to optimize the computing speed. The peak performance is 130 Gflops for one GRAPE-6A board and 3.1 Tflops for our 24 node cluster. We describe the implementation of the tree, TreePM and individual timestep algorithms on both a single GRAPE-6A system and GRAPE-6A cluster. Using the tree algorithm on our 16-node GRAPE-6A system, we can complete a collisionless simulation with 100 million particles (8000 steps) within 10 days.
机译:在本文中,我们描述了用于重力多体仿真的专用计算机GRAPE-6A的设计和性能。它设计用于PC群集,其中每个节点都有一个GRAPE-6A。这样的配置在运行并行树算法中特别具有成本效益。尽管可以在原始的GRAPE-6硬件上使用并行树算法,但是由于单个GRAPE-6板仍然太快和太昂贵,因此它不是很划算。因此,我们将GRAPE-6A设计为单个PCI卡,以最大程度地降低复制成本并优化计算速度。一块GRAPE-6A板的最高性能为130 Gflops,而我们的24节点集群的最高性能为3.1 Tflops。我们描述了单个GRAPE-6A系统和GRAPE-6A群集上树,TreePM和各个时间步算法的实现。使用我们的16节点GRAPE-6A系统上的树算法,我们可以在10天内完成1亿个粒子(8000步)的无碰撞仿真。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号