首页> 外文期刊>LIPIcs : Leibniz International Proceedings in Informatics >An Average-Case Sublinear Exact Li and Stephens Forward Algorithm
【24h】

An Average-Case Sublinear Exact Li and Stephens Forward Algorithm

机译:平均情况下的亚线性精确Li和Stephens正演算法

获取原文
           

摘要

Hidden Markov models of haplotype inheritance such as the Li and Stephens model allow for computationally tractable probability calculations using the forward algorithms as long as the representative reference panel used in the model is sufficiently small. Specifically, the monoploid Li and Stephens model and its variants are linear in reference panel size unless heuristic approximations are used. However, sequencing projects numbering in the thousands to hundreds of thousands of individuals are underway, and others numbering in the millions are anticipated.To make the Li and Stephens forward algorithm for these datasets computationally tractable, we have created a numerically exact version of the algorithm with observed average case O(nk^{0.35}) runtime in number of genetic sites n and reference panel size k. This avoids any tradeoff between runtime and model complexity. We demonstrate that our approach also provides a succinct data structure for general purpose haplotype data storage. We discuss generalizations of our algorithmic techniques to other hidden Markov models.
机译:只要模型中使用的代表性参考面板足够小,诸如Li和Stephens模型之类的单倍型继承的隐式Markov模型就可以使用正演算法进行易计算的概率计算。具体而言,除非使用启发式近似,否则单倍体Li和Stephens模型及其变体的参考面板大小是线性的。但是,正在进行的测序项目数量在数千到数十万之间,而其他项目预计在数以百万计。为了使这些数据集的Li和Stephens正向算法在计算上易于处理,我们创建了该算法的数值精确版本观察到的平均情况O(nk ^ {0.35})运行时间为遗传位点数n和参考面板大小k。这避免了运行时与模型复杂性之间的折衷。我们证明了我们的方法还为通用单倍型数据存储提供了简洁的数据结构。我们讨论了对其他隐马尔可夫模型的算法技术的概括。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号