【24h】

GPU-Accelerated Static Timing Analysis

机译:GPU加速的静态时序分析

获取原文

摘要

The ever-increasing power of graphics processing units (GPUs) has opened new opportunities for accelerating static timing analysis (STA) to a new milestone. Developing a CPU-GPU parallel STA engine is an extremely challenging job. We need to consider the unique problem characteristics of STA and distinct performance models between CPU and GPU, both of which require very strategic decomposition to benefit from heterogeneous parallelism. In this paper, we propose an efficient implementation for accelerating STA on a GPU. We leverage task-based approaches to decompose the STA workload into CPU-GPU dependent tasks where kernel computation and data processing overlap effectively. We develop GPU-efficient data structures and high-performance kernels to speed up various tasks of STA including levelization, delay calculation, and graph update. Our acceleration framework is flexible and adaptive. When tasks are scarce such as incremental timing, we run the normal CPU mode, and we enable GPU when tasks are massive. We have implemented our algorithms on top of OpenTimer and demonstrated promising performance speed-up on large designs. As an example, we achieved up to 3.69× speed-up on a large design of 1.6M gates and 1.6M nets using one GPU.
机译:图形处理单元(GPU)不断增强的功能为加速将静态时序分析(STA)推向新的里程碑提供了新的机遇。开发CPU-GPU并行STA引擎是一项极富挑战性的工作。我们需要考虑STA的独特问题特征,以及CPU和GPU之间的不同性能模型,这两者都需要进行非常有策略的分解才能从异构并行性中受益。在本文中,我们提出了一种在GPU上加速STA的有效实现。我们利用基于任务的方法将STA工作负载分解为依赖CPU-GPU的任务,其中内核计算和数据处理有效地重叠。我们开发了GPU高效的数据结构和高性能内核,以加快STA的各种任务,包括级别化,延迟计算和图形更新。我们的加速框架是灵活和自适应的。当任务稀少时(例如增量计时),我们将运行正常的CPU模式,并在任务繁重时启用GPU。我们已经在OpenTimer之上实现了我们的算法,并展示了在大型设计上有望实现的性能加速。例如,我们使用一个GPU在160万个门和160万个网的大型设计上实现了高达3.69倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号