【24h】

Parallel Processing of Genomics Data

机译:基因组学数据的并行处理

获取原文

摘要

The availability of high-throughput experimental platforms for the analysis of biological samples, such as mass spectrometry, microarrays and Next Generation Sequencing, have made possible to analyze a whole genome in a single experiment. Such platforms produce an enormous volume of data per single experiment, thus the analysis of this enormous flow of data poses several challenges in term of data storage, preprocessing, and analysis. To face those issues, efficient, possibly parallel, bioinfor-matics software needs to be used to preprocess and analyze data, for instance to highlight genetic variation associated with complex diseases. In this paper we present a parallel algorithm for the parallel preprocessing and statistical analysis of genomics data, able to face high dimension of data and resulting in good response time. The proposed system is able to find statistically significant biological markers able to discriminate classes of patients that respond to drugs in different ways. Experiments performed on real and synthetic genomic datasets show good speed-up and scalability.
机译:用于分析生物样品的高通量试验平台,例如质谱,微阵列和下一代测序,可以在单一实验中分析整个基因组。这些平台每单一实验产生巨大的数据量,因此对数据存储,预处理和分析期限的这种巨大数据流程的分析构成了几个挑战。要面对这些问题,有效,可能并行,生物中的生物中的软件需要用于预处理和分析数据,例如以突出与复杂疾病相关的遗传变异。在本文中,我们提出了一种平行算法,用于基因组学数据的并行预处理和统计分析,能够面对数据的高维度并导致良好的响应时间。该拟议的系统能够找到能够以不同方式对药物响应药物的患者的统计上显着的生物标记。在实际和合成基因组数据集上进行的实验显示出良好的加速和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号