...
首页> 外文期刊>Concurrency, practice and experience >A comparative study of an X-ray tomography reconstruction algorithm in accelerated and cloud computing systems
【24h】

A comparative study of an X-ray tomography reconstruction algorithm in accelerated and cloud computing systems

机译:加速和云计算系统中X射线断层扫描重建算法的比较研究

获取原文
获取原文并翻译 | 示例

摘要

With the increase of resolution in medical image scanners and the need of faster reconstruction methods,rnnew ways of exploiting the inherent parallelism of reconstruction algorithms have arisen. In this paper, wernpresent Mangoose++, an application to perform X-ray computed tomography that supports multiple gradesrnof parallelism. This parallelism is tackled with two different approaches: the usage of parallel nodes withrnmulticore CPUs in a cloud environment and the usage of high-performance computing (HPC)-based parallelrnarchitectures such as general-purpose computing on graphics processing unit (GPGPU) or Intel Xeon Phi.rnIn this paper, we show the design and implementation of the application in three types of platforms relatedrnto the previous mentioned approaches, comparing and analyzing the performance, resource utilization, andrnscalability of each platform. Accelerators offer high performance for data sizes that fit inside the acceleratorrnmemory. This is the main advantage of Intel Xeon Phi that, in this work, obtains similar performance resultsrnthan compute unified device architecture (CUDA)-based GPGPU versions, comparing with cards with lessrnmemory capacity. In our evaluation experiments, we additionally analyze and discuss the costs and efficiencyrnof Mangoose++ over Amazon Compute Cloud platform, demonstrating that lower times can be achieved inrna reasonable price compared with owned HPC-based hardware. A comparison between distinct hardwarernconfigurations is provided for emphasizing on the advantages and disadvantages of each one.
机译:随着医学图像扫描仪分辨率的提高以及对更快的重建方法的需求,出现了利用重建算法的固有并行性的新方法。在本文中,我们介绍了Mangoose ++,这是一种用于执行X射线计算机断层扫描的应用程序,它支持多级srnof并行性。通过两种不同的方法解决了这种并行性:在云环境中使用具有多核CPU的并行节点,以及使用基于高性能计算(HPC)的并行体系结构,例如图形处理单元(GPGPU)或Intel Xeon上的通用计算在本文中,我们展示了在与上述方法相关的三种类型的平台上应用程序的设计和实现,比较并分析了每个平台的性能,资源利用率和可伸缩性。加速器可为加速器内存中适合的数据大小提供高性能。这是Intel Xeon Phi的主要优点,与内存容量较小的卡相比,它在工作中可获得比基于计算统一设备架构(CUDA)的GPGPU版本相似的性能结果。在我们的评估实验中,我们还分析和讨论了Amazon Compute Cloud平台上的成本和效率Mangoose ++,证明与拥有的基于HPC的硬件相比,可以合理的价格实现更短的时间。提供了不同硬件配置之间的比较,以强调每个硬件的优点和缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号