...
首页> 外文期刊>Concurrency and computation: practice and experience >Tsunami: massively parallel homomorphic hashing on many-core GPUs
【24h】

Tsunami: massively parallel homomorphic hashing on many-core GPUs

机译:海啸:多核GPU上的大规模并行同态哈希

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Homomorphic hash functions play a key role in securing distributed systems that use coding techniques such as erasure coding and network coding. The computational complexity of homomorphic hash functions remains a main challenge. In this paper, we present a massively parallel solution, named Tsunami, by exploiting the widely available many-core graphic processing units (GPUs). Tsunami includes the following optimization techniques to achieve the highest ever hashing throughput: (1) using Montgomery multiplication and precomputation to speed up modular exponentiations; (2) using a clean implementation of Montgomery multiplication in order to decrease the demand of registers and shared memory and increase the utilization ratio of GPU processing cores; (3) using our own assembly code to implement the 32-bit integer multiplication, which outperforms the assembly codes generated by the native compiler by 20%; and (4) exploiting memory alignment and constant memory on GPUs to improve the efficiency of memory access. Integrating the above techniques, our Tsunami achieves a significant improvement over existing results. Specifically, the hashing throughput achieved by Tsunami on a GTX295 GPU (NVIDIA, Santa Clara, CA, US) is about 33 times that of the existing solution on a quad-core CPU. We also show that the hashing throughput grows almost linearly with the number of GPU cores.
机译:同态散列函数在保护使用编码技术(例如擦除编码和网络编码)的分布式系统中起着关键作用。同态哈希函数的计算复杂度仍然是主要挑战。在本文中,我们通过利用广泛可用的多核图形处理单元(GPU),提出了一种名为Tsunami的大规模并行解决方案。海啸包括以下优化技术,以实现有史以来最高的哈希吞吐量:(1)使用蒙哥马利乘法和预计算来加快模幂。 (2)使用干净的蒙哥马利乘法实现,以减少寄存器和共享内存的需求并提高GPU处理内核的利用率。 (3)使用我们自己的汇编代码实现32位整数乘法,这比本机编译器生成的汇编代码高20%; (4)利用GPU上的内存对齐和恒定内存来提高内存访问效率。通过整合以上技术,我们的海啸相对于现有结果实现了重大改进。具体而言,Tsunami在GTX295 GPU(NVIDIA,Santa Clara,CA,美国)上实现的哈希吞吐量约为四核CPU上现有解决方案的33倍。我们还显示,哈希吞吐量几乎随GPU内核数呈线性增长。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号