首页> 外文会议>Workshop on big data management in clouds >Multicore and Cloud-Based Solutions for Genomic Variant Analysis
【24h】

Multicore and Cloud-Based Solutions for Genomic Variant Analysis

机译:基于多核和基于云的基因组变体分析解决方案

获取原文

摘要

Genomic variant analysis is a complex process that allows to find and study genome mutations. For this purpose, analysis and tests from both biological and statistical points of view must be conducted. Biological data for this kind of analysis are typically stored according to the Variant Call Format (VCF), in gigabytes-sized files that cannot be efficiently processed using conventional software. In this paper, we introduce part of the High Performance Genomics (HPG) project, whose goal is to develop a collection of efficient and open-source software applications for the genomics area. The paper is mainly focused on HPG Variant, a suite that allows to get the effect of mutations and to conduct genomic-wide and family-based analysis, using a multi-tier architecture based on CellBase Database and a RESTful web service API. Two user clients are also provided: an HTML5 web client and a command-line interface, both using a back-end parallelized using OpenMP. Along with HPG Variant, a library for VCF files handling and a collection of utilities for VCF files preprocessing have been developed. Positive performance results are shown in comparison with other applications such as PLINK, GenABEL, SNPTEST or VCFtools.
机译:基因组变体分析是一种复杂的方法,允许寻找和研究基因组突变。为此目的,必须进行从生物学和统计观点的分析和测试。这种分析的生物数据通常根据变型呼叫格式(VCF)来存储在无法使用传统软件的千兆字节的大小文件中存储。在本文中,我们介绍了一部分高性能基因组学(HPG)项目,其目标是为基因组学区开发一系列有效和开源的软件应用。本文主要集中在HPG变体上,这是一种允许获得突变的效果并通过基于CellBase数据库的多层架构和RESTful Web服务API来实现突变效果和基于基于基于基于家族的分析的套件。还提供了两个用户客户端:HTML5 Web客户端和命令行界面,都使用使用OpenMP并行化的后端。除了HPG Variant之外,还开发了VCF文件处理的库和VCF文件预处理的实用程序集合。与其他应用相比,阳性绩效结果如Plink,Genabel,SNPTEST或VCFTOOLS相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号