首页> 中文期刊> 《计算机工程与科学》 >面向ARMv864位多核处理器QTRSM的实现

面向ARMv864位多核处理器QTRSM的实现

         

摘要

We implement a quad-precision triangular matrix solution with multiple right-hand sides (QTRSM) based on OpenBLAS on the ARMv8 64-bit multi-core processor.We also propose two methods to implement QTRSM.One is based on GCC complier which accepts the long double data type as quad-precision floating-point numbers.The other uses the double-double data type and its corresponding quad-precision addition,subtraction,multiplication and division algorithms to implement QTRSM.We compare the two methods under different matrix sizes.Experimental results show that the two methods have the same accuracy.However,on average the method using double-double format runs 1.6 times faster than the one using long double format.As the number of threads increases,the speedup of the two QTRSM implementation methods are both close to 2.0,which has good scalability.%在ARMy8 64位多核处理器上基于OpenBLAS实现了四精度三角矩阵求解(QTRSM).基于两种数据格式分别实现了QTRSM,第一种实现利用GCC编译器对long double数据类型的支持来实现QTRSM,第二种实现采用double-double数据格式及其相应的四精度加减法、乘法和除法.以longdouble数据类型QTRSM为测试基准,就不同矩阵规模下测试结果精度和时间与double-double数据格式QTRSM进行比较.实验结果表明:两者得到近似相同精度的数值结果,但double-double数据格式QTRSM的性能是long double数据类型QTRSM的1.6倍.随着线程数的增加,两种QTRSM实现的加速比接近2.0,具有较好的可扩展性.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号