首页> 外文会议>International Conference for Internet Technology and Secured Transactions >A workflow for parallel and distributed computing of large-scale genomic data
【24h】

A workflow for parallel and distributed computing of large-scale genomic data

机译:大规模基因组数据并行和分布式计算的工作流程

获取原文

摘要

Workflow management systems are emerging as dominant solution in bioinformatics because they enable researchers to analyze the huge amount of data generated by modern laboratory equipment. The growth of genomic data generated by next generation sequencing (NGS) results in an increasing need to analyze data on distributed computer clusters. In this paper, we construct a semi-automated workflow system for the analysis of large-scale sequence data sets, describe a pipeline designed with parallel computation to perform the optimal computational steps required to analyze whole genome sequence data, and report the overall execution time of the pipeline using cores on multiple machines.
机译:工作流管理系统正在成为生物信息学中的主要解决方案,因为它们使研究人员能够分析现代实验室设备生成的大量数据。由下一代测序(NGS)生成的基因组数据的增长导致对分析分布式计算机集群上的数据的需求不断增加。在本文中,我们构建了一个用于分析大规模序列数据集的半自动化工作流程系统,描述了一种通过并行计算设计的管线,以执行分析整个基因组序列数据所需的最佳计算步骤,并报告总体执行时间使用多台机器上的内核的流水线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号