首页> 外文学位 >A partitioning approach for GPU accelerated level-based on-chip variation static timing analysis.

【24h】

A partitioning approach for GPU accelerated level-based on-chip variation static timing analysis.

机译：一种基于GPU加速级的片上变化静态时序分析的分区方法。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Technology and design trends have made timing analysis the bottleneck of electronic design automation (EDA) tools. Efficient and accurate timing analysis is a challenge that the EDA industry must overcome in order to move forward. Using LLCOCV leverages Physical Location, Path Level, and Cell type information to further increase timing accuracy. This model introduces increased data complexity as a result of maintaining delays for each unique path-level. We parallelize this computation for co-processing on a CUDA enabled GPU. We introduce a novel divide-and-conquer partitioning approach for computing the per-level delay data used in the level-based aspect of LLC-OCV. Partitioning the circuit graph halves the inherently serial structure of a topological traversal of the circuit graph with a costly but more parallel merge step that combines the solutions of the two partitions. Using a massively parallel GPU-based approach allows us to absorb the cost of merging by performing it in parallel. Our experimental results on the ISCAS '85 benchmark demonstrate our parallel algorithm scales with timing graph size more efficiently than the serial algorithm. Results also show that our partitioning approach allows us to more fully utilize the massively parallel computational resources of the GPU. Our experiments on artificial test cases demonstrate that the parallel algorithms outperform the serial algorithm on large non-linear graph structures. We also find that LOCV timing analysis is a memory bound computation. We expect our algorithm to perform better on the newer Fermi architecture because of the new cached memory architecture.

机译：技术和设计趋势已使时序分析成为电子设计自动化（EDA）工具的瓶颈。高效，准确的时序分析是EDA行业必须克服的挑战，才能向前发展。使用LLCOCV可利用物理位置，路径级别和小区类型信息来进一步提高定时精度。由于为每个唯一路径级别维持延迟，因此该模型引入了增加的数据复杂性。我们将这种计算并行化，以便在支持CUDA的GPU上进行协同处理。我们介绍了一种新颖的分治法分区方法，用于计算在LLC-OCV的基于级别的方面使用的每个级别的延迟数据。对电路图进行分区，可以通过将两个分区的解决方案组合在一起的昂贵但更并行的合并步骤，将电路图的拓扑遍历的固有串行结构减半。使用基于大规模并行GPU的方法，我们可以通过并行执行合并来吸收合并的成本。我们在ISCAS '85基准上的实验结果证明，与时序算法相比，具有时序图大小的并行算法规模更有效。结果还表明，我们的分区方法使我们能够更充分地利用GPU的大规模并行计算资源。我们在人工测试用例上的实验表明，在大型非线性图形结构上，并行算法的性能优于串行算法。我们还发现LOCV时序分析是一个内存限制的计算。由于新的缓存内存架构，我们希望我们的算法在更新的Fermi架构上能表现更好。

著录项

作者
Zhang, Michael Longqiang.;
展开▼
作者单位

University of California, San Diego.;

展开▼
授予单位 University of California, San Diego.;
学科 Engineering Electronics and Electrical.;Computer Science.
学位 M.S.
年度 2010
页码 66 p.
总页数 66
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-17 11:36:45

相似文献

外文文献
中文文献
专利

1. Timing-Driven Variation-Aware Partitioning and Optimization of Mixed Static-Dynamic CMOS Circuits [J] . Kumar Yelamarthi Circuits and Systems . 2013,第2期

机译：时序驱动的变化感知分区和混合静态动态CMOS电路的优化
2. Timing-Driven Variation-Aware Partitioning and Optimization of Mixed Static-Dynamic CMOS Circuits [J] . Kumar Yelamarthi Circuits and systems . 2013,第2期

机译：时序驱动的变化感知分区和混合静态动态CMOS电路的优化
3. Accelerating calculations of RNA secondary structure partition functions using GPUs [J] . Harry A Stern, David H Mathews Algorithms for Molecular Biology . 2013,第1期

机译：使用GPU加速RNA二级结构分区功能的计算
4. GPU-Accelerated Static Timing Analysis [C] . Zizheng Guo, Tsung-Wei Huang, Yibo Lin IEEE/ACM International Conference On Computer Aided Design . 2020

机译：GPU加速的静态时序分析
5. Compact Variation-Aware Standard Cell Models for Statistical Static Timing Analysis. [D] . Aftabjahani, Seyed-Abdollah. 2011

机译：用于统计静态时序分析的紧凑型变异感知标准单元模型。
6. Accelerating calculations of RNA secondary structure partition functions using GPUs [O] . Harry A Stern, David H Mathews 2013

机译：使用GPU加速RNA二级结构分区功能的计算
7. Timing-Driven Variation-Aware Partitioning and Optimization of Mixed Static-Dynamic CMOS Circuits [O] . Kumar Yelamarthi 2013

机译：时序驱动的变化感知分区和混合静态动态CMOS电路的优化

A partitioning approach for GPU accelerated level-based on-chip variation static timing analysis.

摘要

著录项

相似文献

相关主题

期刊订阅