首页> 外文期刊>Journal of electrical and computer engineering >Performance Analysis of Homogeneous On-Chip Large-Scale Parallel Computing Architectures for Data-Parallel Applications
【24h】

Performance Analysis of Homogeneous On-Chip Large-Scale Parallel Computing Architectures for Data-Parallel Applications

机译:数据并行应用程序的同类片上大规模并行计算体系结构的性能分析

获取原文
获取原文并翻译 | 示例
           

摘要

On-chip computing platforms are evolving from single-core bus-based systems to many-core network-based systems, which are referred to as On-chip Large-scale Parallel Computing Architectures (OLPCs) in the paper. Homogenous OLPCs feature strong regularity and scalability due to its identical cores and routers. Data-parallel applications have their parallel data subsets that are handled individually by the same program running in different cores. Therefore, data-parallel applications are able to obtain good speedup in homogenous OLPCs. The paper addresses modeling the speedup performance of homogeneous OLPCs for data-parallel applications. When establishing the speedup performance model, the network communication latency and the ways of storing data of data-parallel applications are modeled and analyzed in detail. Two abstract concepts (equivalent serial packet and equivalent serial communication) are proposed to construct the network communication latency model. The uniform and hotspot traffic models are adopted to reflect the ways of storing data. Some useful suggestions are presented during the performance model's analysis. Finally, three data-parallel applications are performed on our cycle-accurate homogenous OLPC experimental platform to validate the analytic results and demonstrate that our study provides a feasible way to estimate and evaluate the performance of data-parallel applications onto homogenous OLPCs.
机译:片上计算平台正在从基于单核总线的系统发展到基于多核网络的系统,在本文中被称为片上大规模并行计算架构(OLPC)。均质的OLPC具有相同的内核和路由器,因此具有强大的规则性和可伸缩性。数据并行应用程序具有其并行数据子集,这些子集由运行在不同内核中的同一程序分别处理。因此,数据并行应用程序能够在同类OLPC中获得良好的加速。该论文致力于为数据并行应用建模同类OLPC的加速性能。在建立加速性能模型时,对网络通信延迟和数据并行应用程序的数据存储方式进行了建模和详细分析。提出了两个抽象的概念(等效串行包和等效串行通信)来构建网络通信等待时间模型。采用统一和热点流量模型来反映存储数据的方式。在性能模型的分析过程中,提出了一些有用的建议。最后,在周期精确的均相OLPC实验平台上执行了三个数据并行应用程序以验证分析结果,并证明我们的研究提供了一种可行的方法来估计和评估均质OLPC上数据并行应用程序的性能。

著录项

  • 来源
    《Journal of electrical and computer engineering》 |2015年第2015期|902591.1-902591.20|共20页
  • 作者单位

    College of Computer, National University of Defense Technology, Changsha, Hunan 410073, China,Department of Electronic Systems, KTH-Royal Institute of Technology, Kista, 16440 Stockholm, Sweden;

    Department of Electronic Systems, KTH-Royal Institute of Technology, Kista, 16440 Stockholm, Sweden;

    Institute of Computer Technology, Vienna University of Technology, 1040 Vienna, Austria;

    College of Computer, National University of Defense Technology, Changsha, Hunan 410073, China;

    College of Computer, National University of Defense Technology, Changsha, Hunan 410073, China;

    College of Computer, National University of Defense Technology, Changsha, Hunan 410073, China;

    College of Computer, National University of Defense Technology, Changsha, Hunan 410073, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号