首页> 外文OA文献 >ADAPT : architectural and design exploration for application specific instruction-set processor technologies
【2h】

ADAPT : architectural and design exploration for application specific instruction-set processor technologies

机译:ADAPT:针对专用指令集处理器技术的体系结构和设计探索

摘要

This thesis presents design automation methodologies for extensible processor platforms in application specific domains. The work presents first a single processor approach for customization; a methodology that can rapidly create different processor configurations by the removal of unused instructions sets from the architecture. A profile directed approach is used to identify frequently used instructions and to eliminate unused opcodes from the available instruction pool.A coprocessor approach is next explored to create an SoC (System-on-Chip) to speedup the application while reducing energy consumption. Loops in applications are identified and accelerated by tightly coupling a coprocessor to an ASIP (Application Specific Instruction-set Processor). Latency hiding is used to exploit the parallelism provided by this architecture. A case study has been performed on a JPEG encoding algorithm; comparing two different coprocessor approaches: a high-level synthesis approach and our custom coprocessor approach.The thesis concludes by introducing a heterogenous multi-processor system using ASIPs as processing entities in a pipeline configuration. The problem of mapping each algorithmic stage in the system to an ASIP configuration is formulated. We proposed an estimation technique to calculate runtimes of the configured multiprocessor system without running cycle-accurate simulations, which could take a significant amount of time. We present two heuristics to efficiently search the design space of a pipeline-based multi ASIP system and compare the results against an exhaustive approach.In our first approach, we show that, on average, processor size can be reduced by 30%, energy consumption by 24%, while performance is improved by 24%. In the coprocessor approach, compared with the use of a main processor alone, a loop performance improvement of 2.57x is achieved using the custom coprocessor approach, as against 1.58x for the high level synthesis method, and 1.33x for the customized instruction approach. Energy savings are 57%, 28% and 19%, respectively. Our multiprocessor design provides a performance improvement of at least 4.03x for JPEG and 3.31x for MP3, for a single processor design system. The minimum cost obtained using our heuristic was within 0.43% and 0.29% of the optimum values for the JPEG and MP3 benchmarks respectively.
机译:本文提出了专用领域中可扩展处理器平台的设计自动化方法。这项工作首先提出了一种用于定制的单处理器方法。一种可以通过从体系结构中删除未使用的指令集来快速创建不同处理器配置的方法。使用基于配置文件的方法来识别常用的指令并从可用的指令池中消除未使用的操作码。接下来,将探索一种协处理器方法来创建SoC(片上系统)以加快应用程序的速度,同时降低能耗。通过将协处理器紧密耦合到ASIP(专用指令集处理器),可以识别和加速应用程序中的循环。延迟隐藏用于利用此体系结构提供的并行性。已对JPEG编码算法进行了案例研究;通过比较两种不同的协处理器方法:高级综合方法​​和我们的定制协处理器方法。本文的结论是通过介绍使用ASIP作为管道配置中的处理实体的异构多处理器系统来结束的。提出了将系统中的每个算法阶段映射到ASIP配置的问题。我们提出了一种估算技术,可以在不运行周期精确的仿真的情况下计算配置的多处理器系统的运行时间,这可能会花费大量时间。我们提出两种启发式方法,以有效地搜索基于管道的多ASIP系统的设计空间并将结果与​​详尽的方法进行比较。在我们的第一种方法中,我们表明,平均而言,处理器尺寸可以减少30%,能耗性能提高了24%,而性能提高了24%。在协处理器方法中,与单独使用主处理器相比,使用定制协处理器方法可将循环性能提高2.57倍,而高级综合方法​​为1.58x,定制指令方法为1.33x。节能分别为57%,28%和19%。对于单处理器设计系统,我们的多处理器设计将JPEG的性能至少提高了4.03倍,将MP3的性能提高了3.31倍。使用我们的启发式方法获得的最低成本分别在JPEG和MP3基准最佳值的0.43%和0.29%之内。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号