首页> 外文会议>IEEE International Conference on Embedded and Real-Time Computing Systems and Applications >An evaluation of code generation of dataflow languages on manycore architectures
【24h】

An evaluation of code generation of dataflow languages on manycore architectures

机译:对许多核心架构上数据流语言的代码生成的评估

获取原文

摘要

Today computer architectures are shifting from single core to manycores due to several reasons such as performance demands, power and heat limitations. However, shifting to manycores results in additional complexities, especially with regard to efficient development of applications. Hence there is a need to raise the abstraction level of development techniques for the manycores while exposing the inherent parallelism in the applications. One promising class of programming languages is dataflow languages and in this paper we evaluate and optimize the code generation for one such language, CAL. We have also developed a communication library to support the intercore communication. The code generation can target multiple architectures, but the results presented in this paper is focused on Adapteva's many core architecture Epiphany. We use the two-dimensional inverse discrete cosine transform (2D-IDCT) as our benchmark and compare our code generation from CAL with a hand-written implementation developed in C. Several optimizations in the code generation as well as in the communication library are described, and we have observed that the most critical optimization is reducing the number of external memory accesses. Combining all optimizations we have been able to reduce the difference in execution time between auto-generated and handwritten implementations from a factor of 4.3× down to a factor of only 1.3×.
机译:如今,由于多种原因(例如性能要求,功耗和散热限制),计算机体系结构已从单核转变为多核。但是,转移到多核会导致额外的复杂性,尤其是在有效开发应用程序方面。因此,有必要提高许多内核的开发技术的抽象水平,同时公开应用程序中固有的并行性。数据流语言是一类很有前途的编程语言,在本文中,我们评估并优化了一种此类语言CAL的代码生成。我们还开发了一个通信库来支持内核间通信。代码生成可以针对多种体系结构,但是本文中介绍的结果集中于Adapteva的许多核心体系结构Epiphany。我们使用二维逆离散余弦变换(2D-IDCT)作为基准,并将我们从CAL生成的代码与用C语言开发的手写实现进行比较。描述了代码生成以及通信库中的一些优化,并且我们已经观察到最关键的优化是减少外部存储器访问的数量。结合所有优化,我们已经能够将自动生成和手写实现之间的执行时间差异从4.3倍降低到1.3倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号