首页> 外文会议>2012 IEEE 31st International Performance Computing and Communications Conference. >Parallelized benchmark-driven performance evaluation of SMPs and tiled multi-core architectures for embedded systems
【24h】

Parallelized benchmark-driven performance evaluation of SMPs and tiled multi-core architectures for embedded systems

机译:针对嵌入式系统的SMP和平铺多核架构的并行基准测试驱动的性能评估

获取原文
获取原文并翻译 | 示例

摘要

With Moore's law supplying billions of transistors on-chip, embedded systems are undergoing a transition from single-core to multi-core to exploit this high transistor density for high performance. However, there exists a plethora of multi-core architectures and the suitability of these multi-core architectures for different embedded domains (e.g., distributed, real-time, reliability-constrained) requires investigation. Despite the diversity of embedded domains, one of the critical applications in many embedded domains (especially distributed embedded domains) is information fusion. Furthermore, many other applications consist of various kernels, such as Gaussian elimination (used in network coding), that dominate the execution time. In this paper, we evaluate two embedded systems multi-core architectural paradigms: symmetric multiprocessors (SMPs) and tiled multi-core architectures (TMAs). We base our evaluation on a parallelized information fusion application and benchmarks that are used as building blocks in applications for SMPs and TMAs. We compare and analyze the performance of an Intel-based SMP and Tilera's TILEPro64 TMA based on our parallelized benchmarks for the following performance metrics: runtime, speedup, efficiency, cost, scalability, and performance per watt. Results reveal that TMAs are more suitable for applications requiring integer manipulation of data with little communication between the parallelized tasks (e.g., information fusion) whereas SMPs are more suitable for applications with floating point computations and a large amount of communication between processor cores.
机译:随着摩尔定律向芯片上提供数十亿个晶体管,嵌入式系统正经历从单核到多核的转变,以利用这种高晶体管密度实现高性能。但是,存在大量的多核体系结构,并且这些多核体系结构对于不同嵌入式域(例如,分布式,实时,可靠性受限)的适用性需要进行调查。尽管嵌入式域的多样性,但许多嵌入式域(尤其是分布式嵌入式域)中的关键应用之一是信息融合。此外,许多其他应用程序由各种内核组成,例如高斯消除(用于网络编码),这些内核决定了执行时间。在本文中,我们评估了两种嵌入式系统多核体系结构范例:对称多处理器(SMP)和平铺多核体系结构(TMA)。我们的评估基于并行信息融合应用程序和基准,这些基准被用作SMP和TMA应用程序的基础。我们基于以下性能指标的并行基准,比较并分析了基于Intel的SMP和Tilera的TILEPro64 TMA的性能:运行时,加速,效率,成本,可伸缩性和每瓦性能。结果表明,TMA更适用于需要对数据进行整数操作且并行任务之间的通信很少的应用(例如信息融合),而SMP更适用于具有浮点计算和处理器内核之间的大量通信的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号