首页> 外文会议>SPIE Medical Imaging Conference >A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design

【24h】

A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design

机译：大数据医学图像处理的数据托管网格框架-后端设计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

When processing large medical imaging studies, adopting high performance grid computing resources rapidly becomes important. We recently presented a "medical image processing-as-a-service" grid framework that offers promise in utilizing the Apache Hadoop ecosystem and HBase for data colocation by moving computation close to medical image storage. However, the framework has not yet proven to be easy to use in a heterogeneous hardware environment. Furthermore, the system has not yet validated when considering variety of multi-level analysis in medical imaging. Our target design criteria are (1) improving the framework's performance in a heterogeneous cluster. (2) performing population based summary statistics on large datasets, and (3) introducing a table design scheme for rapid NoSQL query. In this paper, we present a heuristic backend interface application program interface (API) design for Hadoop & HBase for Medical Image Processing (HadoopBase-MIP). The API includes: Upload, Retrieve, Remove, Load balancer (for heterogeneous cluster) and MapReduce templates. A dataset summary statistic model is discussed and implemented by MapReduce paradigm. We introduce a HBase table scheme for fast data query to better utilize the MapReduce model. Briefly, 5153 T1 images were retrieved from a university secure, shared web database and used to empirically access an in-house grid with 224 heterogeneous CPU cores. Three empirical experiments results are presented and discussed: (1) load balancer wall-time improvement of 1.5-fold compared with a framework with built-in data allocation strategy, (2) a summary statistic model is empirically verified on grid framework and is compared with the cluster when deployed with a standard Sun Grid Engine (SGE), which reduces 8-fold of wall clock time and 14-fold of resource time, and (3) the proposed HBase table scheme improves MapReduce computation with 7 fold reduction of wall time compare with a naive scheme when datasets are relative small. The source code and interfaces have been made publicly available.

机译：在处理大型医学成像研究时，迅速采用高性能网格计算资源变得很重要。我们最近提出了一个“医学图像处理即服务”网格框架，该框架通过将计算移至医学图像存储附近，为利用Apache Hadoop生态系统和HBase进行数据托管提供了希望。但是，尚未证明该框架易于在异构硬件环境中使用。此外，当考虑医学成像中的各种多级分析时，该系统尚未通过验证。我们的目标设计标准是（1）改善异构集群中框架的性能。（2）对大型数据集执行基于总体的汇总统计信息，以及（3）介绍用于快速NoSQL查询的表设计方案。在本文中，我们提出了用于Hadoop和医学图像处理的HBase（HadoopBase-MIP）的启发式后端接口应用程序接口（API）设计。该API包括：上传，检索，删除，负载均衡器（用于异构集群）和MapReduce模板。 MapReduce范例讨论并实现了数据集摘要统计模型。我们介绍了一种用于快速数据查询的HBase表方案，以更好地利用MapReduce模型。简要地说，从大学安全的共享Web数据库中检索了5153张T1图像，并用于凭经验访问具有224个异构CPU内核的内部网格。提出并讨论了三个实验结果：（1）与具有内置数据分配策略的框架相比，负载均衡器的墙时间缩短了1.5倍;（2）在网格框架上通过经验验证了摘要统计模型并进行了比较与群集一起使用标准的Sun Grid Engine（SGE）进行部署时，可以减少8倍的挂钟时间和14倍的资源时间，并且（3）提出的HBase表方案通过减少7倍的墙壁减少了MapReduce的计算当数据集相对较小时，将时间与天真方案进行比较。源代码和接口已公开可用。

著录项

来源
《SPIE Medical Imaging Conference》|2018年|105790A.1-105790A.10|共10页
会议地点
作者
Shunxing Bao; Yuankai Huo; Prasanna Parvathaneni; Andrew J. Plassard; Camilo Bermudez; Yuang Yao; Ilwoo Lyu; Aniruddha Gokhale; Bennett A. Landman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Grid-free interactive and automated data processing for MR chemical shift imaging data [J] . Le Fur Y., Nicoli F., Guye M., Magma: Magnetic resonance materials in physics, biology, and medicine . 2010,第1期

机译：MR化学位移成像数据的无网格交互式自动数据处理
2. Erratum: Grid-free interactive and automated data processing for MR chemical shift imaging data (Magnetic Resonance Materials in Physics, Biology and Medicine (2010) 23 (23-30) DOI: 10.1007/s10334-009-0186-y) [J] . Le Fur Y., Nicoli F., Guye M., Magma: Magnetic resonance materials in physics, biology, and medicine . 2010,第2期

机译：勘误表：用于MR化学位移成像数据的无网格交互式自动数据处理（物理，生物学和医学磁共振材料（2010）23（23-30）DOI：10.1007 / s10334-009-0186-y）
3. Grid-free interactive and automated data processing for MR chemical shift imaging data [J] . Yann Le Fur, François Nicoli, Maxime Guye, Magnetic Resonance Materials in Physics, Biology and Medicine . 2010,第1期

机译：用于MR化学位移成像数据的无网格交互式自动数据处理
4. A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design [C] . Shunxing Bao, Yuankai Huo, Prasanna Parvathaneni, SPIE Medical Imaging Conference . 2018

机译：大数据医学图像处理的数据分配网格框架 - 后端设计
5. A framework for tumor segmentation and interactive immersive visualization of medical image data for surgical planning. [D] . Foo, Jung Leng. 2008

机译：肿瘤分割和医学图像数据的交互式沉浸式可视化框架，用于外科手术计划。
6. A Data Colocation Grid Framework for Big Data Medical Image Processing: Backend Design [O] . Shunxing Bao, Yuankai Huo, Prasanna Parvathaneni, -1

机译：大数据医学图像处理的数据托管网格框架：后端设计
7. A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design [O] . Bao, Shunxing, Huo, Yuankai, Parvathaneni, Prasanna, 2017

机译：用于大数据医学图像处理的数据共置网格框架 - 后端设计
8. Design and Implementation of A Backend Multiple-Processor Relational Data Base Computer System [R] . Fonden, R. W. 1981

机译：后端多处理器关系数据库计算机系统的设计与实现

A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design

摘要

著录项

相似文献

相关主题

期刊订阅