首页> 外文学位 >Enabling large-scale storage and retrieval of whole slide images: A big data approach.
【24h】

Enabling large-scale storage and retrieval of whole slide images: A big data approach.

机译:支持大规模存储和检索整个幻灯片图像:大数据方法。

获取原文
获取原文并翻译 | 示例

摘要

Telepathology has the potential to transform the practice of pathology and be a game-changer for patients and pathologists. It can lead to wider, rapid access to expert pathologists across hospitals in the US, improve the daily workflow of pathologists, provide better diagnosis and treatment, reduce medical errors and enable hospitals to cope with constantly increasing caseload. There are certain technical challenges that must be overcome to enable telepathology on a large-scale in US hospitals. First, a glass slide can be scanned using advances in digital imaging to produce a whole slide image (WSI) of near-optical resolution. But WSIs are very large in size (about 6 GB per image). There is a need for cost-effective and scalable storage to host millions of WSIs and support thousands of requests from hundreds of pathologists per day. Next, the underlying networking infrastructure must be capable of transferring terabytes of image data per day.;As a pathologist may view a few hundred slides a day, it is necessary to provide access to WSIs in real-time, with minimal transmission delay and smooth viewing experience. In this work, we propose a software system for large-scale storage and retrieval of WSIs using Apache Spark and a cluster setup. Each WSI is partitioned using a space-filling curve and stored using Apache Spark's abstraction of a collection along with range partitioning. This enables us to place spatially closer partitions of a WSI together on a cluster node. During retrieval, partitions of a WSI are read and transmitted in parallel through the network. We conducted experiments on CloudLab using multi-gigabyte images and observed that our approach was 2 times faster than remote copy.
机译:远程病理学具有改变病理学实践的潜力,并会成为患者和病理学家的游戏规则改变者。它可以使美国各医院的专家病理学家获得更广泛,更快速的访问机会,改善病理学家的日常工作流程,提供更好的诊断和治疗,减少医疗错误,并使医院能够应对不断增加的病例量。要在美国医院大规模进行远程病理检查,必须克服某些技术难题。首先,可以使用数字成像技术对载玻片进行扫描,以产生接近光学分辨率的完整载玻片图像(WSI)。但是WSI的大小非常大(每个映像大约6 GB)。需要一种经济高效且可扩展的存储来承载数百万个WSI,并每天支持数百名病理学家的数千个请求。接下来,底层的网络基础结构必须能够每天传输TB级的图像数据。;由于病理学家每天可能会浏览数百张幻灯片,因此必须实时提供对WSI的访问,并且传输延迟最小且平滑观看体验。在这项工作中,我们提出了一个使用Apache Spark和集群设置来大规模存储和检索WSI的软件系统。每个WSI均使用空间填充曲线进行分区,并使用Apache Spark对集合的抽象以及范围分区进行存储。这使我们能够在群集节点上将WSI在空间上更紧密的分区放在一起。在检索期间,将读取WSI的分区并通过网络并行传输。我们在CloudLab上使用数GB的图像进行了实验,发现我们的方法比远程复制快2倍。

著录项

  • 作者

    Nuchimaniyanda, Vinutha.;

  • 作者单位

    University of Missouri - Kansas City.;

  • 授予单位 University of Missouri - Kansas City.;
  • 学科 Computer science.;Medical imaging.;Pathology.
  • 学位 M.S.
  • 年度 2016
  • 页码 46 p.
  • 总页数 46
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号