首页> 外文期刊>Annals of the American Thoracic Society >A Model for Distributed Processing and Analyses of NGS Data under Map-Reduce Paradigm
【24h】

A Model for Distributed Processing and Analyses of NGS Data under Map-Reduce Paradigm

机译:地图减少范式下的NGS数据分布式处理和分析模型

获取原文
获取原文并翻译 | 示例
           

摘要

Massively parallel sequencing technique, introduced by NGS technology, has resulted in an exponential growth of sequencing data, with greatly reduced cost and increased throughput. This huge explosion of data has introduced new challenges in regard to its storage, integration, processing, and analyses. In this paper, we have proposed a novel distributed model under Map-Reduce paradigm to address the NGS big data problem. The architecture of the model involves Map-Reduce based modularized approach involving three different phases that support various analytical pipelines. The first phase will generate detailed base level information of various individual genomes, by granulating the alignment data. The other two phases independently process this base level information in parallel. One of these two phases will provide an integrated DNA profile of multiple individuals, whereas the other phase will generate contigs with similar features in an individual. Each of these three phases will generate a repository of genomic information that will facilitate other analytical pipelines. A simulated and real experimental prototypes has been provided as results to show the effectiveness of the model and its superiority over a few existing popular models and tools. A detailed description of the scope of applications of this model is also included in this article.
机译:由NGS技术引入的大规模平行测序技术导致测序数据的指数增长,成本大大降低和产量增加。这种巨大的数据爆炸在其存储,集成,处理和分析方面引入了新的挑战。在本文中,我们提出了一种在地图减少范式下的新型分布式模型来解决NGS大数据问题。该模型的体系结构涉及地图 - 减少基于模块化的方法,涉及三个不同阶段支持各种分析管道的阶段。通过造粒对准数据,第一阶段将产生各种子种组的详细基础级别信息。另外两个阶段独立地并行地处理该基础级别信息。这两种阶段中的一种将提供多个个体的集成DNA曲线,而另一个阶段将产生具有个体中具有类似特征的Contigs。这三个阶段中的每一个都将生成基因组信息的存储库,其将促进其他分析管道。已经提供了模拟和实际实验原型的结果,以显示模型的有效性及其在少数现有流行型号和工具上的优越性。本文还包含对该模型的应用范围的详细描述。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号