Efficient Execution of Microscopy Image Analysis on CPU, GPU, and MIC Equipped Cluster Systems

机译：在配备CPU，GPU和MIC的群集系统上高效执行显微镜图像分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

High performance computing is experiencing a major paradigm shift with the introduction of accelerators, such as graphics processing units (GPUs) and Intel Xeon Phi (MIC). These processors have made available a tremendous computing power at low cost, and are transforming machines into hybrid systems equipped with CPUs and accelerators. Although these systems can deliver a very high peak performance, making full use of its resources in real-world applications is a complex problem. Most current applications deployed to these machines are still being executed in a single processor, leaving other devices underutilized. In this paper we explore a scenario in which applications are composed of hierarchical dataflow tasks which are allocated to nodes of a distributed memory machine in coarse-grain, but each of them may be composed of several finer-grain tasks which can be allocated to different devices within the node. We propose and implement novel performance aware scheduling techniques that can be used to allocate tasks to devices. We evaluate our techniques using a pathology image analysis application used to investigate brain cancer morphology, and our experimental evaluation shows that the proposed scheduling strategies significantly outperforms other efficient scheduling techniques, such as Heterogeneous Earliest Finish Time - HEFT, in cooperative executions using CPUs, GPUs, and Masc. also experimentally show that our strategies are less sensitive to inaccuracy in the scheduling input data and that the performance gains are maintained as the application scales.

机译：高性能计算正在经历重大的范式转变，其中引入了加速器，例如图形处理单元（GPU）和英特尔至强融核（MIC）。这些处理器以低成本提供了巨大的计算能力，并且正在将机器转变为配备有CPU和加速器的混合系统。尽管这些系统可以提供非常高的峰值性能，但是在实际应用中充分利用其资源却是一个复杂的问题。部署到这些计算机上的大多数当前应用程序仍在单个处理器中执行，从而导致其他设备的利用不足。在本文中，我们探讨了一种方案，其中应用程序由分层数据流任务组成，这些任务以粗粒度分配给分布式存储计算机的节点，但是每个应用程序可能由几个细粒度任务组成，这些细粒度任务可以分配给不同的任务节点内的设备。我们提出并实现了可用于将任务分配给设备的新颖的性能感知调度技术。我们使用用于研究脑癌形态的病理图像分析应用程序评估了我们的技术，我们的实验评估表明，在使用CPU，GPU的协作执行中，所提出的调度策略明显优于其他有效的调度技术，例如异构最早完成时间-HEFT和Masc。还通过实验表明，我们的策略对调度输入数据中的不准确性较不敏感，并且随着应用程序的扩展，性能提升得以保持。

著录项

来源
《International symposium on computer architecture and high performance computing》|2014年|89-96|共8页
会议地点
作者
Andrade G.; Ferreira R.; Teodoro G.; Rocha L.; Saltz J.H.; Kurc T.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
brain; cancer; distributed memory systems; graphics processing units; medical image processing; parallel processing; scheduling; CPU equipped cluster systems; GPU equipped cluster systems; Intel Xeon Phi; MIC equipped cluster systems; Masc; accelerators; brain cancer morphology; distributed memory machine; graphics processing units; hierarchical dataflow tasks; high performance computing; microscopy image analysis; pathology image analysis application; performance aware scheduling techniques; task allocation; Central Processing Unit; Graphics processing units; Image analysis; Microwave integrated circuits; Performance evaluation; Processor scheduling;

机译：大脑;癌症;分布式存储系统;图形处理单元;医学图像处理;并行处理;调度;配备CPU的群集系统;配备GPU的群集系统;英特尔至强融核;配备MIC的群集系统; Masc;加速器;脑癌形态学;分布式内存机器;图形处理单元;分层数据流任务;高性能计算;显微镜图像分析;病理图像分析应用程序;性能感知调度技术;任务分配;中央处理单元;图形处理单元;图像分析;微波集成电路;性能评估;处理器排程;

相似文献

外文文献
中文文献
专利

1. Application performance analysis and efficient execution on systems with multi-core CPUs, GPUs and MICs: a case study with microscopy image analysis [J] . Teodoro George, Kurc Tahsin, Andrade Guilherme, International Journal of High Performance Computing Applications . 2017,第1期

机译：在具有多核CPU，GPU和MIC的系统上的应用程序性能分析和高效执行：带有显微镜图像分析的案例研究
2. An efficient implementation of 3D high-resolution imaging for large-scale seismic data with GPU/CPU heterogeneous parallel computing [J] . Xu Jincheng, Liu Wei, Wang Jin, Computers & geosciences . 2018,第feba期

机译：GPU / CPU异构并行计算可有效实现大规模地震数据的3D高分辨率成像
3. Quantitative analysis of spectroscopic low energy electron microscopy data: High-dynamic range imaging, drift correction and cluster analysis [J] . Ultramicroscopy . 2020,第期

机译：光谱低能电子显微镜数据的定量分析：高动态范围成像，漂移校正和聚类分析
4. Efficient Execution of Microscopy Image Analysis on CPU, GPU, and MIC Equipped Cluster Systems [C] . Andrade G., Ferreira R., Teodoro G., International symposium on computer architecture and high performance computing . 2014

机译：高效执行CPU，GPU和CPU和MIC的集群系统的显微镜图像分析
5. Shape dependent image processing tools for analysis of adhering medulloblastoma cells in custom designed microscopy system. [D] . Weissbach, Frederick. 2013

机译：形状相关的图像处理工具，用于在定制设计的显微镜系统中分析粘附的成髓细胞瘤细胞。
6. Efficient Execution of Microscopy Image Analysis on CPU GPU and MIC Equipped Cluster Systems [O] . G. Andrade, R. Ferreira, George Teodoro, -1

机译：在配备CPUGPU和MIC的群集系统上高效执行显微镜图像分析
7. High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms [O] . George Teodoro, Tony Pan, Tahsin M. Kurc, 2013

机译：CPU-GPU集群平台大显微镜图像数据集的高通量分析
8. Fast Methods for Analysis of Neurotransmitters from Single Cell and Monitoring Their Releases in Central Nervous System by Capillary Electrophoresis, Fluorescence Microscopy and Luminescence Imaging [R] . Wang, Z. 1999

机译：通过毛细管电泳，荧光显微镜和发光成像快速分析单个细胞的神经递质并监测其在中枢神经系统中的释放

Efficient Execution of Microscopy Image Analysis on CPU, GPU, and MIC Equipped Cluster Systems

摘要

著录项

相似文献

相关主题

期刊订阅