首页> 外文期刊>Publications of the Astronomical Society of the Pacific >A GPU-based Real-time Software Correlation System for the Murchison Widefield Array Prototype
【24h】

A GPU-based Real-time Software Correlation System for the Murchison Widefield Array Prototype

机译:Murchison宽场阵列原型的基于GPU的实时软件相关系统

获取原文
获取原文并翻译 | 示例
           

摘要

Modern graphics processing units (GPUs) are inexpensive commodity hardware that offer Tflop/s theoretical computing capacity. GPUs are well suited to many compute-intensive tasks including digital signal processing. We describe the implementation and performance of a GPU-based digital correlator for radio astronomy. The correlator is implemented using the NVIDIA CUDA development environment. We evaluate three design options on two generations of NVIDIA hardware. The different designs utilize the internal registers, shared memory, and multiprocessors in different ways. We find that optimal performance is achieved with the design that minimizes global memory reads on recent generations of hardware. The GPU-based correlator outperforms a single-threaded CPU equivalent by a factor of 60 for a 32-antenna array, and runs on commodity PC hardware. The extra compute capability provided by the GPU maximizes the correlation capability of a PC while retaining the fast development time associated with using standard hardware, networking, and programming languages. In this way, a GPU-based correlation system represents a middle ground in design space between high performance, custom-built hardware, and pure CPU-based software correlation. The correlator was deployed at the Murchison Widefield Array 32-antenna prototype system where it ran in real time for extended periods. We briefly describe the data capture, stream_ing, and correlation system for the prototype array.
机译:现代图形处理单元(GPU)是便宜的商用硬件,可提供Tflop / s的理论计算能力。 GPU非常适合许多计算密集型任务,包括数字信号处理。我们描述了用于射电天文的基于GPU的数字相关器的实现和性能。相关器是使用NVIDIA CUDA开发环境实现的。我们评估了两代NVIDIA硬件上的三个设计选项。不同的设计以不同的方式利用内部寄存器,共享存储器和多处理器。我们发现,通过使最新一代硬件上的全局内存读取最小化的设计可以实现最佳性能。对于32天线阵列,基于GPU的相关器的性能优于单线程CPU等效器的60倍,并且可以在商用PC硬件上运行。 GPU提供的额外计算能力最大程度地提高了PC的关联能力,同时保留了与使用标准硬件,网络和编程语言相关的快速开发时间。这样,基于GPU的关联系统就代表了高性能,定制硬件与基于纯CPU的软件关联之间的设计空间。该相关器已部署在Murchison宽场阵列32天线原型系统中,并在其中长时间实时运行。我们简要描述了原型数组的数据捕获,stream_ing和相关系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号