首页> 外文会议>Innovative Architecture for Future Generation High-Performance Processors and Systems Conference >An approach towards an analytical characterization of locality and its portability
【24h】

An approach towards an analytical characterization of locality and its portability

机译:一种探讨局部性分析及其可移植性的方法

获取原文

摘要

The evolution of computing technology towards the ultimate physical limits makes communication the dominant cost of computing. It would then be desirable to have a framework for the study of locality, which we define as the property of an algorithm that enables implementations with reduced communication overheads. We view as part of the algorithm all those aspects that define the operations to be performed and their data dependences, at a functional level. We view as part of the implementation all those aspects that pertain to the use of machine resources during execution of the algorithm, such as operation scheduling, memory management, and message routing. We discuss the issue of useful characterizations of the locality of an algorithm with reference to both single machines and classes of machines. We then consider the question of portability of locality, viewed as the existence of a single implementation which is optimal across a class of machines. We also formulate a less stringent notion of portability, where the implementation is allowed to be parametrized, so as to adapt to the executing platform, and suboptimal performance is accepted, within specified bounds. We illustrate the proposed approach with its application to the study of temporal locality, the property of an algorithm that enables efficient implementations on machines where memory accesses have a variable latency depending on the location being accessed. As a first step, we only consider serial implementations, which can be viewed as defined by the choice of an operation schedule and by a memory management. We discuss how, for a fixed operation schedule, temporal locality can be characterized for interesting classes of uniform hierarchical machines by a set of metrics, the width lengths of the schedule, which are only logarithmically many in the number of operations. Moreover, a portable memory management of any schedule can be obtained for such classes of machines. The situation becomes more complex when the schedule is a degree of freedom of the implementation. Then, while some computations do admit a single schedule optimal across many machines, this is not always the case. Thus, in general, only the less stringent notion of portability, based on parametrized schedules, can be pursued. Correspondingly, a concise characterization of temporal locality becomes harder to achieve and still remains an interesting open problem.
机译:计算技术朝向最终物理限制的演变使得沟通计算的主要计算成本。然后,希望具有用于研究局部性的框架,我们将其定义为实现具有减少通信开销的实现的算法的属性。我们视为算法的一部分,所有这些方面都定义要执行的操作的所有方面以及它们在功能级别的数据依赖性。我们将作为在执行算法执行期间使用机器资源的所有这些方面的一部分,例如操作调度,存储器管理和消息路由。我们将参考单机和机器类讨论算法局部性的有用特征问题。然后,我们考虑局部性的可移植性问题,视为在一类机器上最佳的单一实现的存在。我们还制定了更严格的可移植性概念,其中允许实现参数化,以便适应执行平台,并且在指定边界内接受次优的性能。我们以其应用于时间局部的研究,算法的属性来说明所提出的方法,该算法在存储器访问的计算机上实现有效的实现,其中存储器访问的变量延迟根据被访问的位置。作为第一步,我们只考虑串行实现,可以通过选择操作计划和存储器管理来查看。我们讨论如何,对于固定的操作计划,可以通过一组度量来表征如何对统一分层机的有趣类,时间表的宽度长度仅在操作的数量上对数来进行。此外,可以获得任何计划的便携式存储器管理,用于这些类别的机器。当计划是实施的自由度时,情况变得更加复杂。然后,虽然某些计算确实承认在许多机器上最佳的单个计划,但情况并非总是如此。因此,通常只能追求基于参数化计划的便携性的严格概念。相应地,时间局部的简洁表征变得难以实现,并且仍然是一个有趣的公开问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号