首页> 外文期刊>Journal of Real-Time Image Processing >Hardware-software optimizations of reconfigurable multi-core processors for floating-point computations of large sparse matrices
【24h】

Hardware-software optimizations of reconfigurable multi-core processors for floating-point computations of large sparse matrices

机译:可重构多核处理器的硬件-软件优化,用于大型稀疏矩阵的浮点计算

获取原文
获取原文并翻译 | 示例
       

摘要

State-of-the-art field-programmable gate array (FPGA) technologies have provided exciting opportunities to develop more flexible, less expensive, and better performance floating-point computing platforms for embedded systems. To better harness the full power of FPGAs and to bring FPGAs to more system designers, we investigate unique advantages and optimization opportunities in both software and hardware offered by multi-core processors on a programmable chip (MPoPCs). In this paper, we present our hardware customization and software dynamic scheduling solutions for LU factorization of large sparse matrices on in-house developed MPoPCs. Theoretical analysis is provided to guide the design. Implementation results on an Altera Stratix III FPGA for five benchmark matrices of size up to 7,917 × 7,917 are presented. Our hardware customization alone can reduce the execution time by up to 17.22 %. The integrated hardware-software optimization improves the speedup by an average of 60.30 %.
机译:最先进的现场可编程门阵列(FPGA)技术为开发用于嵌入式系统的更灵活,更便宜,性能更好的浮点计算平台提供了令人兴奋的机会。为了更好地利用FPGA的全部功能并将FPGA带给更多的系统设计人员,我们研究了可编程芯片(MPoPC)上的多核处理器在软件和硬件方面的独特优势和优化机会。在本文中,我们介绍了针对内部开发的MPoPC上的大型稀疏矩阵的LU分解的硬件定制和软件动态调度解决方案。提供理论分析以指导设计。给出了在Altera Stratix III FPGA上实现的五个基准矩阵的实现结果,这些矩阵的大小最大为7,917×7,917。仅我们的硬件定制就可以减少高达17.22%的执行时间。集成的硬件-软件优化使速度平均提高了60.30%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号