首页> 外国专利> EFFICIENT IMPLEMENTATION OF RSA USING GPU/CPU ARCHITECTURE

EFFICIENT IMPLEMENTATION OF RSA USING GPU/CPU ARCHITECTURE

机译:使用GPU / CPU架构高效实现RSA

摘要

Various embodiments are directed to a heterogeneous processor architecture comprised of a CPU and a GPU on the same processor die. The heterogeneous processor architecture may optimize source code in a GPU compiler using vector strip mining to reduce instructions of arbitrary vector lengths into GPU supported vector lengths and loop peeling. It may be first determined that the source code is eligible for optimization if more than one machine code instruction of compiled source code under-utilizes GPU instruction bandwidth limitations. The initial vector strip mining results may be discarded and the first iteration of the inner loop body may be peeled out of the loop. The type of operands in the source code may be lowered and the peeled out inner loop body of source code may be vector strip mined again to obtain optimized source code.
机译:各种实施例针对一种异构处理器架构,该异构处理器架构包括同一处理器裸片上的CPU和GPU。异构处理器体系结构可以使用向量条挖掘在GPU编译器中优化源代码,以将任意向量长度的指令减少为GPU支持的向量长度和循环剥离。如果一个以上的已编译源代码的机器代码指令未充分利用GPU指令带宽限制,则可以首先确定该源代码符合优化条件。可以丢弃初始向量条的挖掘结果,并且可以将内部循环主体的第一次迭代从循环中剥离。可以降低源代码中的操作数的类型,并且可以再次剥离向量源中剥离的源代码的内部循环主体以获取优化的源代码。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号