首页> 外文期刊>IEEE Transactions on Image Processing >An efficient parallel-processing method for transposing large matrices in place
【24h】

An efficient parallel-processing method for transposing large matrices in place

机译:一种有效的将大型矩阵转置到位的并行处理方法

获取原文
获取原文并翻译 | 示例

摘要

We have developed an efficient algorithm for transposing large matrices in place. The algorithm is efficient because data are accessed either sequentially in blocks or randomly within blocks small enough to fit in cache, and because the same indexing calculations are shared among identical procedures operating on independent subsets of the data. This inherent parallelism makes the method well suited for a multiprocessor computing environment. The algorithm is easy to implement because the same two procedures are applied to the data in various groupings to carry out the complete transpose operation. Using only a single processor, we have demonstrated nearly an order of magnitude increase in speed over the previously published algorithm by Gate and Twigg (1977) for transposing a large rectangular matrix in place. With multiple processors operating in parallel, the processing speed increases almost linearly with the number of processors. A simplified version of the algorithm for square matrices is presented as well as an extension for matrices large enough to require virtual memory.
机译:我们已经开发出了一种有效的算法,可以将大型矩阵转置到位。该算法之所以有效,是因为可以按块顺序访问数据,也可以在足够小以适合高速缓存的块内随机访问数据,并且由于在独立于数据子集的相同过程之间共享相同的索引计算。这种固有的并行性使该方法非常适合于多处理器计算环境。该算法易于实现,因为将相同的两个过程应用于不同分组的数据以执行完整的转置操作。仅使用单个处理器,我们已经证明了比Gate和Twigg(1977)先前发布的用于将大型矩形矩阵转置到位的算法的速度提高了近一个数量级。当多个处理器并行运行时,处理速度几乎随处理器数量线性增加。提出了平方矩阵算法的简化版本,以及对足够大以需要虚拟内存的矩阵的扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号