首页> 美国卫生研究院文献>Journal of Computational Biology >Calculating Higher-Order Moments of Phylogenetic Stochastic Mapping Summaries in Linear Time
【2h】

Calculating Higher-Order Moments of Phylogenetic Stochastic Mapping Summaries in Linear Time

机译:计算线性时间系统发生随机映射摘要的高阶矩

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Stochastic mapping is a simulation-based method for probabilistically mapping substitution histories onto phylogenies according to continuous-time Markov models of evolution. This technique can be used to infer properties of the evolutionary process on the phylogeny and, unlike parsimony-based mapping, conditions on the observed data to randomly draw substitution mappings that do not necessarily require the minimum number of events on a tree. Most stochastic mapping applications simulate substitution mappings only to estimate the mean and/or variance of two commonly used mapping summaries: the number of particular types of substitutions (labeled substitution counts) and the time spent in a particular group of states (labeled dwelling times) on the tree. Fast, simulation-free algorithms for calculating the mean of stochastic mapping summaries exist. Importantly, these algorithms scale linearly in the number of tips/leaves of the phylogenetic tree. However, to our knowledge, no such algorithm exists for calculating higher-order moments of stochastic mapping summaries. We present one such simulation-free dynamic programming algorithm that calculates prior and posterior mapping variances and scales linearly in the number of phylogeny tips. Our procedure suggests a general framework that can be used to efficiently compute higher-order moments of stochastic mapping summaries without simulations. We demonstrate the usefulness of our algorithm by extending previously developed statistical tests for rate variation across sites and for detecting evolutionarily conserved regions in genomic sequences.
机译:>随机映射是一种基于模拟的方法,可以根据连续时间的马尔可夫演化模型将替换历史概率映射到系统发育上。此技术可用于推断系统发育进化过程的属性,并且与基于简约的映射不同,可根据观察到的数据条件随机绘制替代映射,而这些映射不一定需要树上最少的事件。大多数随机映射应用程序都模拟替代映射,只是为了估计两种常用映射摘要的均值和/或方差:特定类型的替代数量(标记为替代计数)和在特定状态组中花费的时间(标记为居住时间)在树上。存在用于计算随机映射摘要的平均值的快速,无仿真算法。重要的是,这些算法在系统发育树的尖端/叶的数量上线性增长。但是,据我们所知,不存在用于计算随机映射摘要的高阶矩的算法。我们提出了一种这样的无仿真动态规划算法,该算法可以计算前后映射的方差并在系统发育提示的数量上线性缩放。我们的过程提出了一个通用框架,该框架可用于有效计算随机映射摘要的高阶矩,而无需进行模拟。我们通过扩展先前开发的统计测试来跨站点进行速率变化并检测基因组序列中的进化保守区域,证明了该算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号