首页> 外文学位 >A partitioning approach for GPU accelerated level-based on-chip variation static timing analysis.
【24h】

A partitioning approach for GPU accelerated level-based on-chip variation static timing analysis.

机译:一种基于GPU加速级的片上变化静态时序分析的分区方法。

获取原文
获取原文并翻译 | 示例

摘要

Technology and design trends have made timing analysis the bottleneck of electronic design automation (EDA) tools. Efficient and accurate timing analysis is a challenge that the EDA industry must overcome in order to move forward. Using LLCOCV leverages Physical Location, Path Level, and Cell type information to further increase timing accuracy. This model introduces increased data complexity as a result of maintaining delays for each unique path-level. We parallelize this computation for co-processing on a CUDA enabled GPU. We introduce a novel divide-and-conquer partitioning approach for computing the per-level delay data used in the level-based aspect of LLC-OCV. Partitioning the circuit graph halves the inherently serial structure of a topological traversal of the circuit graph with a costly but more parallel merge step that combines the solutions of the two partitions. Using a massively parallel GPU-based approach allows us to absorb the cost of merging by performing it in parallel. Our experimental results on the ISCAS '85 benchmark demonstrate our parallel algorithm scales with timing graph size more efficiently than the serial algorithm. Results also show that our partitioning approach allows us to more fully utilize the massively parallel computational resources of the GPU. Our experiments on artificial test cases demonstrate that the parallel algorithms outperform the serial algorithm on large non-linear graph structures. We also find that LOCV timing analysis is a memory bound computation. We expect our algorithm to perform better on the newer Fermi architecture because of the new cached memory architecture.
机译:技术和设计趋势已使时序分析成为电子设计自动化(EDA)工具的瓶颈。高效,准确的时序分析是EDA行业必须克服的挑战,才能向前发展。使用LLCOCV可利用物理位置,路径级别和小区类型信息来进一步提高定时精度。由于为每个唯一路径级别维持延迟,因此该模型引入了增加的数据复杂性。我们将这种计算并行化,以便在支持CUDA的GPU上进行协同处理。我们介绍了一种新颖的分治法分区方法,用于计算在LLC-OCV的基于级别的方面使用的每个级别的延迟数据。对电路图进行分区,可以通过将两个分区的解决方案组合在一起的昂贵但更并行的合并步骤,将电路图的拓扑遍历的固有串行结构减半。使用基于大规模并行GPU的方法,我们可以通过并行执行合并来吸收合并的成本。我们在ISCAS '85基准上的实验结果证明,与时序算法相比,具有时序图大小的并行算法规模更有效。结果还表明,我们的分区方法使我们能够更充分地利用GPU的大规模并行计算资源。我们在人工测试用例上的实验表明,在大型非线性图形结构上,并行算法的性能优于串行算法。我们还发现LOCV时序分析是一个内存限制的计算。由于新的缓存内存架构,我们希望我们的算法在更新的Fermi架构上能表现更好。

著录项

  • 作者

    Zhang, Michael Longqiang.;

  • 作者单位

    University of California, San Diego.;

  • 授予单位 University of California, San Diego.;
  • 学科 Engineering Electronics and Electrical.;Computer Science.
  • 学位 M.S.
  • 年度 2010
  • 页码 66 p.
  • 总页数 66
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:36:45

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号