首页> 外文期刊>BMC Bioinformatics >Boosting forward-time population genetic simulators through genotype compression
【24h】

Boosting forward-time population genetic simulators through genotype compression

机译:通过基因型压缩促进前瞻性种群遗传模拟器

获取原文
           

摘要

Background Forward-time population genetic simulations play a central role in deriving and testing evolutionary hypotheses. Such simulations may be data-intensive, depending on the settings to the various parameters controlling them. In particular, for certain settings, the data footprint may quickly exceed the memory of a single compute node. Results We develop a novel and general method for addressing the memory issue inherent in forward-time simulations by compressing and decompressing, in real-time, active and ancestral genotypes, while carefully accounting for the time overhead. We propose a general graph data structure for compressing the genotype space explored during a simulation run, along with efficient algorithms for constructing and updating compressed genotypes which support both mutation and recombination. We tested the performance of our method in very large-scale simulations. Results show that our method not only scales well, but that it also overcomes memory issues that would cripple existing tools. Conclusions As evolutionary analyses are being increasingly performed on genomes, pathways, and networks, particularly in the era of systems biology, scaling population genetic simulators to handle large-scale simulations is crucial. We believe our method offers a significant step in that direction. Further, the techniques we provide are generic and can be integrated with existing population genetic simulators to boost their performance in terms of memory usage.
机译:背景技术长期的人口遗传模拟在推导和检验进化假设中起着核心作用。这种模拟可能是数据密集型的,具体取决于控制它们的各种参数的设置。特别是,对于某些设置,数据足迹可能会迅速超过单个计算节点的内存。结果我们开发了一种新颖且通用的方法,通过实时,主动和祖先的基因型进行压缩和解压缩,从而解决了实时模拟中固有的内存问题,同时仔细考虑了时间开销。我们提出了一种用于压缩模拟运行中探索的基因型空间的通用图数据结构,以及用于构建和更新支持突变和重组的压缩基因型的有效算法。我们在非常大规模的仿真中测试了我们方法的性能。结果表明,我们的方法不仅可以很好地扩展,而且还解决了可能使现有工具瘫痪的内存问题。结论随着对基因组,途径和网络的进化分析越来越多,尤其是在系统生物学时代,对种群遗传模拟器进行扩展以处理大规模模拟至关重要。我们相信我们的方法朝着这个方向迈出了重要的一步。此外,我们提供的技术是通用的,可以与现有的种群遗传模拟器集成,以提高其在内存使用方面的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号