首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Quantifying Intrinsic Parallelism Using Linear Algebra for Algorithm/Architecture Coexploration
【24h】

Quantifying Intrinsic Parallelism Using Linear Algebra for Algorithm/Architecture Coexploration

机译:使用线性代数量化固有并行性以进行算法/架构协同开发

获取原文
获取原文并翻译 | 示例
           

摘要

Degree of parallelism (DoP) is an essential complexity metric that characterizes the number of independent operation sets (IOSs) that can be concurrently executed within an algorithm. This paper presents a generic framework to identify IOSs and to quantify the DoP based on rank theorem in linear algebra. This framework is applied to extract algorithmic parallelisms at various granularities, namely, multigrain parallelism. Our parallelism is intrinsic and platform independent and can provide insights into architectural information, thus facilitating mapping onto generic platforms and early back annotation for modifying algorithms. It plays a significant role in the concurrent optimization of both algorithms and architectures, referred to as Algorithm/Architecture Coexploration (AAC), by trading off between the DoP and the number of operations (NoO). This paper reports three case studies for AAC. The case study on an IDCT reveals that our framework accurately quantifies the parallelism for mapping the algorithm onto generic platforms, including FPGA and multicore systems. The IDCT parallelized by our technique surpasses a conventional spectral parallelization. By exploiting fine-grain parallelism, this paper presents a better porting of a discrete wavelet transform (DWT) onto single instruction multiple data (SIMD) machines compared with a commercial compiler. A high-quality deinterlacer is implemented on a low-cost multicore platform for real-time high-definition applications by analyzing the multigrain parallelism. These case studies reveal the effectiveness of our parallel analysis framework which is applicable to generic systems. Compared with traditional graph traversal techniques, our linear algebraic approach impressively features low complexity and is practical for complicated algorithms.
机译:并行度(DoP)是一项重要的复杂性指标,它描述了可以在算法内同时执行的独立操作集(IOS)的数量。本文提出了一种基于线性代数中的秩定理来识别IOS和量化DoP的通用框架。该框架适用于以各种粒度提取算法并行性,即多粒度并行性。我们的并行性是固有的,并且与平台无关,并且可以提供对体系结构信息的见解,从而便于映射到通用平台和早期回注以修改算法。通过在DoP和操作数(NoO)之间进行权衡,它在同时优化算法和体系结构(称为算法/体系结构共同开发(AAC))中发挥着重要作用。本文报告了AAC的三个案例研究。在IDCT上进行的案例研究表明,我们的框架准确地量化了将算法映射到通用平台(包括FPGA和多核系统)的并行度。通过我们的技术并行化的IDCT超过了常规的频谱并行化。通过利用细粒度并行性,与商用编译器相比,本文提出了将离散小波变换(DWT)更好地移植到单指令多数据(SIMD)机器上的方法。通过分析多颗粒并行性,可以在低成本多核平台上为实时高清应用实现高质量的去隔行器。这些案例研究揭示了适用于通用系统的并行分析框架的有效性。与传统的图遍历技术相比,我们的线性代数方法具有较低的复杂度,并且对于复杂的算法非常实用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号