首页>
外国专利>
Compiling a parallel loop with a complex access pattern for writing an array for GPU and CPU
Compiling a parallel loop with a complex access pattern for writing an array for GPU and CPU
展开▼
机译:编译具有复杂访问模式的并行循环以编写用于GPU和CPU的阵列
展开▼
页面导航
摘要
著录项
相似文献
摘要
Computer-implemented methods are provided for compiling a parallel loop and generating Graphics Processing Unit (GPU) code, and Central Processing Unit (CPU) code for writing an array for the CPU and the CPU. A method includes compiling the parallel loop by (i) checking, based on a range of array elements to be written, whether the parallel loop can update all of the array elements and (ii) checking whether an access order of the array elements that the parallel loop reads or writes is known at compilation time. The method further includes determining an approach, from among a plurality of available approaches, to generate the CPU code and the GPU code based on (i) the range of the array elements to be written and (ii) the access order to the array elements in the parallel loop.
展开▼