首页> 外文会议>IEEE International Conference on High Performance Computing and Communications >Quantifying and Mitigating Computational Inefficiency of Genomics Data Analysis
【24h】

Quantifying and Mitigating Computational Inefficiency of Genomics Data Analysis

机译:量化和减轻基因组学数据分析的计算效率

获取原文
获取外文期刊封面目录资料

摘要

In this paper, we performed a comprehensive study of quantifying and mitigating computational inefficiency of current genomic analysis approaches. First, we found current parallelization approaches that have limited scalability due to either unexploited parallelism or low utilization of system resource. Thus, we proposed Spark-Gene, which is on the basis of Spark in-memory programming model. To test the performance of our Spark-Gene, we used WGS in the GATK as the test case. We show that Spark-Gene reduces the execution time of WGS analysis from 19 hours to 30 minutes with a speedup in excess of 37-fold at 256 CPU cores. Furthermore, we identified that garbage collection is the major scalable bottleneck of better parallel efficiency for native in-memory computing model. Second, we quantified microarchitectural inefficiency for typical genomic applications and uncovered opportunities for microarchitectural optimizations for the design of genomic domain-specific accelerator, especially on specializing concurrency, computation and memory hierarchy. This paper is to leverage state-of-art big-data technologies to improve parallelization for genomics analysis and motivate the integration of accelerators into the genomic analysis computing system.
机译:在本文中,我们对量化和减轻当前基因组分析方法的计算效率综合研究。首先,我们发现由于未分发的并行性或系统资源的利用率低,所以目前的并行化方法具有有限的可扩展性。因此,我们提出了火花基因,这是基于火花内记录编程模型的基础。为了测试我们的Spark-基因的性能,我们在GATK中使用了WG作为测试用例。我们表明Spark-Gene将WGS分析的执行时间降低到19小时至30分钟,其加速超过256个CPU核心。此外,我们确定了垃圾收集是本机内存计算模型更好并行效率的主要可扩展瓶颈。其次,我们量化了典型基因组应用的微架构低效率,以及对基因组结构域 - 特异性加速器设计的微体建筑优化的机会,尤其是专用并发性,计算和存储器层次结构。本文是利用最先进的大数据技术来改善基因组学分析的并行化,并激励加速器将加速器的集成到基因组分析计算系统中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号