首页> 美国卫生研究院文献>Genome Biology and Evolution >Inferring Rates and Length-Distributions of Indels Using Approximate Bayesian Computation
【2h】

Inferring Rates and Length-Distributions of Indels Using Approximate Bayesian Computation

机译:使用近似贝叶斯计算推断Indel的速率和长度分布

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

The most common evolutionary events at the molecular level are single-base substitutions, as well as insertions and deletions (indels) of short DNA segments. A large body of research has been devoted to develop probabilistic substitution models and to infer their parameters using likelihood and Bayesian approaches. In contrast, relatively little has been done to model indel dynamics, probably due to the difficulty in writing explicit likelihood functions. Here, we contribute to the effort of modeling indel dynamics by presenting SpartaABC, an approximate Bayesian computation (ABC) approach to infer indel parameters from sequence data (either aligned or unaligned). SpartaABC circumvents the need to use an explicit likelihood function by extracting summary statistics from simulated sequences. First, summary statistics are extracted from the input sequence data. Second, SpartaABC samples indel parameters from a prior distribution and uses them to simulate sequences. Third, it computes summary statistics from the simulated sets of sequences. By computing a distance between the summary statistics extracted from the input and each simulation, SpartaABC can provide an approximation to the posterior distribution of indel parameters as well as point estimates. We study the performance of our methodology and show that it provides accurate estimates of indel parameters in simulations. We next demonstrate the utility of SpartaABC by studying the impact of alignment errors on the inference of positive selection. A C ++ program implementing SpartaABC is freely available in http://spartaabc.tau.ac.il.
机译:在分子水平上最常见的进化事件是单碱基取代以及短DNA片段的插入和缺失(indels)。大量的研究致力于开发概率替换模型,并使用似然法和贝叶斯方法推断其参数。相反,可能由于编写显式似然函数的困难而对indel动力学进行建模的工作相对较少。在这里,我们通过介绍SpartaABC(一种近似的贝叶斯计算(ABC)方法,从序列数据(对齐或未对齐)中推导indel参数)来为indel动力学建模做出贡献。 SpartaABC通过从模拟序列中提取摘要统计信息来规避使用显式似然函数的需求。首先,从输入序列数据中提取摘要统计信息。其次,SpartaABC从先前的分布中采样插入/缺失参数,并使用它们来模拟序列。第三,它从模拟的序列集中计算摘要统计量。通过计算从输入中提取的摘要统计量与每个模拟之间的距离,SpartaABC可以提供indel参数的后验分布以及点估计的近似值。我们研究了方法的性能,并表明它可以在仿真中提供indel参数的准确估计。接下来,我们将通过研究比对错误对正选择推断的影响来证明SpartaABC的实用性。可在http://spartaabc.tau.ac.il免费获得实现SpartaABC的C ++程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号