首页> 外文会议>2016 Future Technologies Conference >Big data application in functional magnetic resonance imaging using apache spark
【24h】

Big data application in functional magnetic resonance imaging using apache spark

机译:大数据在基于Apache Spark的功能磁共振成像中的应用

获取原文
获取原文并翻译 | 示例

摘要

Recently, big data applications have been rapidly expanding into various industries. Healthcare is among those industries that are willing to use big data platforms, and as a result, some large data analytics tools have been adopted in this field. Medical imaging, which is a pillar in diagnostic healthcare, involves a high volume of data collection and processing. A massive number of 3D and 4D images are acquired in different forms and resolutions using a variety of medical imaging modalities. Preprocessing and analysis of imaging data is currently a costly and time-consuming process. However, few big data platforms have been created or modified for medical imaging purposes because of certain restrictions, such as data format. In this paper, we designed, developed and successfully tested a new pipeline for medical imaging data (in particular, functional magnetic resonance imaging - fMRI) using the Big Data Spark / PySpark platform on a single node, which allowed us to read and load imaging data, convert the data to Resilient Distributed Datasets in order to manipulate and perform in-memory data processing in parallel, and convert final results to an imaging format. Additionally, the pipeline provides an option to store the results in other formats, such as data frames. Using this new pipeline, we repeated our previous work, in which we extracted brain networks from fMRI data using template matching and the sum of squared differences (SSD) method. The final results revealed that our Spark (PySpark) based solution improved the performance (in terms of processing time) approximately fourfold when compared with the previous work developed in Python.
机译:最近,大数据应用已迅速扩展到各个行业。医疗保健是愿意使用大数据平台的行业之一,因此,该领域已采用了一些大数据分析工具。医学成像是诊断医疗保健的支柱,它涉及大量的数据收集和处理。使用各种医学成像方法,可以以不同的形式和分辨率获取大量的3D和4D图像。成像数据的预处理和分析当前是一个昂贵且耗时的过程。但是,由于诸如数据格式之类的某些限制,很少创建或修改用于医学成像目的的大数据平台。在本文中,我们使用Big Data Spark / PySpark平台在单个节点上设计,开发并成功测试了用于医学成像数据(特别是功能磁共振成像-fMRI)的新管道,这使我们能够读取和加载成像数据,将数据转换为弹性分布式数据集,以便并行操作和执行内存中数据处理,并将最终结果转换为成像格式。此外,管道还提供了一种以其他格式(例如数据帧)存储结果的选项。使用这个新管道,我们重复了以前的工作,其中我们使用模板匹配和平方差和(SSD)方法从fMRI数据中提取了大脑网络。最终结果表明,与以前用Python开发的工作相比,我们基于Spark(PySpark)的解决方案将性能(在处理时间方面)提高了大约四倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号