首页> 外文学位 >Optimizing for a many-core architecture without compromising ease-of-programming.
【24h】

Optimizing for a many-core architecture without compromising ease-of-programming.

机译:针对多核架构进行优化,而不会影响编程的简便性。

获取原文
获取原文并翻译 | 示例

摘要

Faced with nearly stagnant clock speed advances, chip manufacturers have turned to parallelism as the source for continuing performance improvements. But even though numerous parallel architectures have already been brought to market, a universally accepted methodology for programming them for general purpose applications has yet to emerge. Existing solutions tend to be hardware-specific, rendering them difficult to use for the majority of application programmers and domain experts, and not providing scalability guarantees for future generations of the hardware.;This dissertation advances the validation of the following thesis: it is possible to develop efficient general-purpose programs for a many-core platform using a model recognized for its simplicity. To prove this thesis, we refer to the eXplicit Multi-Threading (XMT) architecture designed and built at the University of Maryland. XMT is an attempt at re-inventing parallel computing with a solid theoretical foundation and an aggressive scalable design. Algorithmically, XMT is inspired by the PRAM (Parallel Random Access Machine) model and the architecture design is focused on reducing inter-task communication and synchronization overheads and providing an easy-to-program parallel model.;This thesis builds upon the existing XMT infrastructure to improve support for efficient execution with a focus on ease-of-programming. Our contributions aim at reducing the programmer's effort in developing XMT applications and improving the overall performance. More concretely, we: (1) present a work-flow guiding programmers to produce efficient parallel solutions starting from a high-level problem; (2) introduce an analytical performance model for XMT programs and provide a methodology to project running time from an implementation; (3) propose and evaluate RAP---an improved resource-aware compiler loop prefetching algorithm targeted at fine-grained many-core architectures; we demonstrate performance improvements of up to 34.79% on average over the GCC loop prefetching implementation and up to 24.61% on average over a simple hardware prefetching scheme; and (4) implement a number of parallel benchmarks and evaluate the overall performance of XMT relative to existing serial and parallel solutions, showing speedups of up to 13.89x vs. a serial processor and 8.10x vs. parallel code optimized for an existing many-core (GPU). We also discuss the implementation and optimization of the Max-Flow algorithm on XMT, a problem which is among the more advanced in terms of complexity, benchmarking and research interest in the parallel algorithms community. We demonstrate better speed-ups compared to a best serial solution than previous attempts on other parallel platforms.
机译:面对近乎停滞的时钟速度发展,芯片制造商已将并行性作为持续性能改进的源泉。但是,尽管已经有许多并行体系结构投入市场,但仍未出现一种通用编程方法,可用于通用应用程序。现有的解决方案往往是特定于硬件的,这使得大多数应用程序程序员和领域专家难以使用它们,并且不为下一代硬件提供可伸缩性保证。使用公认的简单模型为多核平台开发高效的通用程序。为了证明这一论点,我们参考了马里兰大学设计并构建的超多线程(XMT)体系结构。 XMT尝试以扎实的理论基础和积极的可扩展设计重新发明并行计算。从算法上讲,XMT受到PRAM(并行随机存取机器)模型的启发,其体系结构设计着重于减少任务间的通信和同步开销,并提供易于编程的并行模型。以简化编程为重点,以提高对高效执行的支持。我们的贡献旨在减少程序员在开发XMT应用程序方面的工作并提高整体性能。更具体地说,我们:(1)提出了一种工作流程指导程序员,以从一个高层次的问题开始产生有效的并行解决方案; (2)为XMT程序引入分析性能模型,并提供一种从实现中预测运行时间的方法; (3)提出并评估RAP-一种针对细粒度多核体系结构的改进的资源感知的编译器循环预取算法;我们证明,与GCC循环预取实施相比,性能平均提高了34.79%,与简单的硬件预取方案相比平均提高了24.61%; (4)实施许多并行基准测试并评估XMT相对于现有串行和并行解决方案的整体性能,与针对串行处理器的串行处理器和针对并行代码优化的8.10x并行处理相比,其显示速度分别提高了13.89倍和8.10倍。核心(GPU)。我们还将讨论Max-Flow算法在XMT上的实现和优化,该问题在并行算法社区中的复杂性,基准测试和研究兴趣方面都更为先进。与最佳的串行解决方案相比,我们展示了比以前在其他并行平台上的尝试更高的提速。

著录项

  • 作者

    Caragea, George Constantin.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Engineering Computer.;Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 165 p.
  • 总页数 165
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号