首页> 外文学位 >Systematic code partitioning for the disjoint-memory co-processor accelerated execution model.
【24h】

Systematic code partitioning for the disjoint-memory co-processor accelerated execution model.

机译:不相干内存协处理器加速执行模型的系统代码分区。

获取原文
获取原文并翻译 | 示例

摘要

In the heterogeneous computing execution model, one or more general-purpose processors are accelerated using one or more co-processors. In this model, general-purpose CPUs are generally assigned portions of the software that either do not map well to the available co-processor microarchitectures or whose low execution time does not warrant the extra effort required to adapt the code to the co-processor's programming model. The co-processors, on the other hand, are assigned the most computationally expensive portions of the software, and this code is adapted to the co-processor's specialized programming model.;In order for legacy code to take advantage of a heterogeneous computer, a programmer must partition its code to select which portions of it to map to the co-processor. The selection criteria typically involves finding and selecting the application's most expensive computations, or kernels. However, this methodology only considers execution time while ignoring memory behavior. In heterogeneous systems where the general-purpose processor and co-processor have disjoint memory spaces, there is a penalty required to exchange data between processors, and it is important to minimize communication cost. We refer to this category of heterogeneous systems as "Disjoint Memory Co-Processor Accelerated Computing (DiMCAC).";The partitioning procedure is typically performed in an ad hoc manner due to the lack of existing automation tools designed for this task. The tools that do exist are not specially designed for the DiMCAC model, or require that a programmer perform manual analysis which can take a considerable amount of time when the programmer is not familiar with the application.;To address this issue, this research presents a Partitioning Analysis Tool for Heterogeneous Systems (PATHS). PATHS is an analysis toolchain that performs a fully automated behavioral analysis of applications to be partitioned for execution on computing platforms that correspond to the DiMCAC model. In this case, making effective partitioning decisions may require optimizing against competing constraints.;In this dissertation, we describe new instrumentation, measurement, presentation and selection components that are implemented in PATHS to support a systematic partitioning methodology. PATHS' primary contributions are the development of (1) a novel methodology for instrumentation and runtime data collection to monitor execution time and transferred data movement at the loop level, (2) an objective function for determining the fitness of an arbitrary set of assignments, and (3) a heuristic search technique for finding an effective solution.;In an experimental evaluation of five different computationally intensive applications, PATHS provides a top ranked candidate accelerator for each application with a fitness evaluation that is higher than candidate accelerators selected from application profiles performed by GNU Gprof.
机译:在异构计算执行模型中,使用一个或多个协处理器来加速一个或多个通用处理器。在此模型中,通常为通用CPU分配软件的某些部分,这些部分可能无法很好地映射到可用的协处理器微体系结构,或者其执行时间短,因此无法保证将代码适应协处理器编程所需的额外工作。模型。另一方面,为协处理器分配了软件中计算最昂贵的部分,并且此代码适用于协处理器的专门编程模型。为了使旧代码能够利用异构计算机,程序员必须对代码进行分区,以选择映射到协处理器的哪些部分。选择标准通常涉及查找和选择应用程序最昂贵的计算或内核。但是,此方法仅考虑执行时间,而忽略内存行为。在通用处理器和协处理器的存储空间不相交的异构系统中,在处理器之间交换数据需要付出一定的代价,因此最小化通信成本非常重要。我们将此异构系统类别称为“不相干内存协处理器加速计算(DiMCAC)”。由于缺乏为此目的设计的现有自动化工具,因此分区过程通常以临时方式执行。确实存在的工具不是专门为DiMCAC模型设计的,或者要求程序员执行手动分析,而这在程序员不熟悉应用程序时可能会花费大量时间。为了解决此问题,本研究提出了一种异构系统分区分析工具(PATHS)。 PATHS是一种分析工具链,它对要分区的应用程序执行全自动的行为分析,以在与DiMCAC模型相对应的计算平台上执行。在这种情况下,做出有效的分区决策可能需要针对竞争性约束进行优化。本文将介绍在PATHS中实现的新仪器,测量,表示和选择组件,以支持系统的分区方法。 PATHS的主要贡献是(1)一种用于检测和运行时数据的新颖方法,以在循环级别监视执行时间和传输的数据运动;(2)一个目标函数,用于确定任意一组分配的适用性; (3)寻找有效解决方案的启发式搜索技术。在对五个不同计算密集型应用程序的实验评估中,PATHS为每个应用程序提供了排名最高的候选加速器,其适应性评估高于从应用程序配置文件中选择的候选加速器由GNU Gprof执行。

著录项

  • 作者

    Mintz, Tiffany M.;

  • 作者单位

    University of South Carolina.;

  • 授予单位 University of South Carolina.;
  • 学科 Engineering Computer.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 105 p.
  • 总页数 105
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号