首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >Multicore Signal Processing Platform With Heterogeneous Configurable Hardware Accelerators
【24h】

Multicore Signal Processing Platform With Heterogeneous Configurable Hardware Accelerators

机译:具有异构可配置硬件加速器的多核信号处理平台

获取原文
获取原文并翻译 | 示例
           

摘要

The computing demand of many signal processing algorithms is dramatically growing because of the increasing complexity of embedded software applications. Concurrently, as process technology scales, the design effort for realizing very large scale integrated circuits and the associated costs are becoming critically high. A possible solution to address this performance/costs challenge is given by customizable multiprocessor system-on-chips. The approach proposed in this paper leads to the customization of multi/many processor system-on-chip at two levels of abstraction: 1) customization through application-specific hardware accelerators implemented on configurable datapath that can target three kinds of structured application-specific integrated circuit technologies: metal, via, and runtime programmable and 2) customization of the architectural parameters of the platform. The proposed platform is equipped with a design framework that assists the user in the high-level design-space exploration of signal processing applications described using the Open Computing Language (OpenCL) language. A peculiar added value of the flow is to support the migration of OpenCL kernels and tasks into pipelined hardware accelerators described using a C-level language. The platform is able to provide an average performance of 90 GOPS on a set of reference signal processing applications, and an average computational energy efficiency of 130 GOPS/W in its metal-programmable configuration. This result shows the benefits in terms of energy efficiency of hardware customization applied to multiprocessor systems with respect to many core devices such as general-purpose graphic processing units, able to provide on average 2.5 GOPS/W for the applications under analysis.
机译:由于嵌入式软件应用程序的复杂性日益提高,许多信号处理算法的计算需求正在急剧增长。同时,随着处理技术的扩展,用于实现超大规模集成电路的设计工作和相关成本正变得越来越高。可定制的多处理器片上系统提供了解决此性能/成本挑战的可能解决方案。本文提出的方法导致在两个抽象级别上定制多/许多处理器片上系统:1)通过在可配置数据路径上实现的专用硬件加速器进行定制,该硬件加速器可以针对三种结构化的专用集成处理器。电路技术:金属,通孔和运行时可编程,以及2)定制平台的架构参数。所建议的平台配备有一个设计框架,该框架可帮助用户使用开放计算语言(OpenCL)语言描述的信号处理应用程序的高级设计空间探索。该流程的独特附加价值是支持将OpenCL内核和任务迁移到使用C级语言描述的流水线硬件加速器中。该平台能够在一组参考信号处理应用程序上提供90 GOPS的平均性能,并且在其金属可编程配置下的平均计算能效为130 GOPS / W。该结果表明,相对于许多核心设备(例如通用图形处理单元),应用于多处理器系统的硬件定制的能源效率方面具有优势,能够为所分析的应用程序平均提供2.5 GOPS / W。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号