...
首页> 外文期刊>Ecology and Evolution >4P: fast computing of population genetics statistics from large DNA polymorphism panels
【24h】

4P: fast computing of population genetics statistics from large DNA polymorphism panels

机译:4P:通过大型DNA多态性面板快速计算种群遗传统计数据

获取原文

摘要

SummaryMassive DNA sequencing has significantly increased the amount of data available for population genetics and molecular ecology studies. However, the parallel computation of simple statistics within and between populations from large panels of polymorphic sites is not yet available, making the exploratory analyses of a set or subset of data a very laborious task. Here, we present 4P (parallel processing of polymorphism panels), a stand-alone software program for the rapid computation of genetic variation statistics (including the joint frequency spectrum) from millions of DNA variants in multiple individuals and multiple populations. It handles a standard input file format commonly used to store DNA variation from empirical or simulation experiments. The computational performance of 4P was evaluated using large SNP (single nucleotide polymorphism) datasets from human genomes or obtained by simulations. 4P was faster or much faster than other comparable programs, and the impact of parallel computing using multicore computers or servers was evident. 4P is a useful tool for biologists who need a simple and rapid computer program to run exploratory population genetics analyses in large panels of genomic data. It is also particularly suitable to analyze multiple data sets produced in simulation studies. Unix, Windows, and MacOs versions are provided, as well as the source code for easier pipeline implementations.
机译:总结大规模的DNA测序大大增加了可用于群体遗传学和分子生态学研究的数据量。但是,尚无法从大型多态位点面板中对种群内部和种群之间的简单统计进行并行计算,这使得对一组数据或子集的探索性分析变得非常艰巨。在这里,我们介绍4P(多态性面板的并行处理),这是一个独立的软件程序,用于快速计算来自多个个体和多个人群的数百万个DNA变异的遗传变异统计量(包括联合频谱)。它处理标准输入文件格式,通常用于存储来自经验或模拟实验的DNA变异。使用来自人类基因组的大型SNP(单核苷酸多态性)数据集或通过仿真获得4P的计算性能。 4P比其他同类程序快或快得多,并且使用多核计算机或服务器进行并行计算的影响显而易见。 4P是生物学家的有用工具,他们需要简单,快速的计算机程序来在大量的基因组数据中进行探索性种群遗传学分析。它也特别适合分析模拟研究中产生的多个数据集。提供了Unix,Windows和MacOs版本,以及用于简化管道实现的源代码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号