首页> 外文期刊>Cancer Informatics >Next Generation Distributed Computing for Cancer Research
【24h】

Next Generation Distributed Computing for Cancer Research

机译:用于癌症研究的下一代分布式计算

获取原文
           

摘要

Advances in next generation sequencing (NGS) and mass spectrometry (MS) technologies have provided many new opportunities and angles for extending the scope of translational cancer research while creating tremendous challenges in data management and analysis. The resulting informatics challenge is invariably not amenable to the use of traditional computing models. Recent advances in scalable computing and associated infrastructure, particularly distributed computing for Big Data, can provide solutions for addressing these challenges. In this review, the next generation of distributed computing technologies that can address these informatics problems is described from the perspective of three key components of a computational platform, namely computing, data storage and management, and networking. A broad overview of scalable computing is provided to set the context for a detailed description of Hadoop, a technology that is being rapidly adopted for large-scale distributed computing. A proof-of-concept Hadoop cluster, set up for performance benchmarking of NGS read alignment, is described as an example of how to work with Hadoop. Finally, Hadoop is compared with a number of other current technologies for distributed computing.
机译:下一代测序(NGS)和质谱(MS)技术的进步为扩展转化癌症研究的范围提供了许多新机会和新角度,同时给数据管理和分析带来了巨大挑战。由此产生的信息学挑战总是不适合传统计算模型的使用。可伸缩计算和相关基础架构(尤其是大数据的分布式计算)的最新进展可以提供解决这些挑战的解决方案。在这篇综述中,从计算平台的三个关键组件(即计算,数据存储和管理以及网络)的角度描述了可以解决这些信息学问题的下一代分布式计算技术。提供了可伸缩计算的广泛概述,为Hadoop的详细说明设置了背景,该技术已被大规模分布式计算迅速采用。描述了为NGS读取对齐的性能基准测试而建立的概念验证Hadoop集群,作为如何与Hadoop配合使用的示例。最后,将Hadoop与许多其他当前的分布式计算技术进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号