首页> 外文学位 >Analysis of task mapping for parallel supercomputers.
【24h】

Analysis of task mapping for parallel supercomputers.

机译:并行超级计算机的任务映射分析。

获取原文
获取原文并翻译 | 示例

摘要

This thesis concentrates on mapping applications to parallel computers with complex network architectures. The common practice of assigning tasks to processors without regard to the communication pattern of the problem or the network topology of the machine is an inefficient one. This approach has not caused any serious performance degradation for systems with small numbers of processors connected by simple, usually all-connected, networks. However, near-optimal performance for most general applications and architectures cannot be achieved without the incorporation of a sound mathematical model which represents the problem and the machine and predicts the relative runtimes for various mappings.;Many models have been developed, each appropriate under some circumstances. Few, however, deliver decent performance for selected scientific computing applications on a variety of architectures. We analyze the performance of several models on an assortment of problems on four computers with different network topologies. We attempt to improve upon models currently in use by developing methodologies to incorporate factors that are recognized as significant yet often ignored or poorly represented.;The two major problems studied in this thesis are integral components of many common applications: matrix multiplication and the fast Fourier transform. Each has been implemented on a Beowulf cluster, a distributed symmetric multiprocessing system, and two cellular architectures of differing topology. Our results reveal the great dependence of the performance of an application on the mapping model.;In addition to illustrating the significance of task mapping, we also address the difficulty that determining an efficient model can be a time-consuming operation. Our work seeks to remedy this problem by proposing guidelines for choosing an optimal method of task assignment, based on the applications and the architectures of the networks to be utilized. The goal is to use these guidelines as the foundation for a much more desirable programming paradigm: automatic parallelization.
机译:本文的重点是将应用程序映射到具有复杂网络架构的并行计算机。在不考虑问题的通信模式或机器的网络拓扑的情况下,将任务分配给处理器的通常做法是一种低效的做法。对于通过简单的,通常为全连接的网络连接的处理器数量较少的系统,该方法不会导致严重的性能下降。但是,如果不采用代表问题和机器的合理数学模型并预测各种映射的相对运行时间,就无法实现大多数通用应用程序和体系结构的近乎最佳的性能。情况。但是,很少有人能为各种架构上的选定科学计算应用程序提供出色的性能。我们在具有不同网络拓扑的四台计算机上分析各种问题的几种模型的性能。我们试图通过开发方法论来改进当前使用的模型,以纳入被认为是重要的但经常被忽略或表现不佳的因素。本论文研究的两个主要问题是许多常见应用的不可或缺的组成部分:矩阵乘法和快速傅立叶转变。每个都已在Beowulf集群,分布式对称多处理系统以及两个拓扑不同的蜂窝体系结构上实现。我们的结果揭示了应用程序性能对映射模型的极大依赖性。除了说明任务映射的重要性之外,我们还解决了确定有效模型可能是一项耗时的操作的难题。我们的工作旨在通过基于应用程序和要使用的网络体系结构,提出用于选择任务分配的最佳方法的准则,来解决此问题。目标是将这些准则用作更理想的编程范例的基础:自动并行化。

著录项

  • 作者

    Braunstein, Janet Laura.;

  • 作者单位

    State University of New York at Stony Brook.;

  • 授予单位 State University of New York at Stony Brook.;
  • 学科 Mathematics.;Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 111 p.
  • 总页数 111
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 数学;自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号