首页> 外文期刊>Proceedings of the National Academy of Sciences of the United States of America >Dynamic programming procedure for searching optimal models to estimate substitution rates based on the maximum-likelihood method
【24h】

Dynamic programming procedure for searching optimal models to estimate substitution rates based on the maximum-likelihood method

机译:基于最大似然法的搜索最优模型以估计替代率的动态编程程序

获取原文
获取原文并翻译 | 示例
           

摘要

The substitution rate in a gene can provide valuable information for understanding its functionality and evolution. A widely used method to estimate substitution rates is the maximum-likelihood method implemented in the CODEML program in the PAML package. A limited number of branch models, chosen based on a priori information or an interest in a particular lineage(s), are tested, whereas a large number of potential models are neglected. A complementary approach is also needed to test all or a large number of possible models to search for the globally optional model(s) of maximum likelihood. However, the computational time for this search even in a small number of sequences becomes impractically long. Thus, it is desirable to explore the most probable spaces to search for the optimal models. Using dynamic programming techniques, we developed a simple computational method for searching the most probable optimal branch-specific models in a practically feasible computational time. We propose three search methods to find the optimal models, which explored O(n) (method 1) to O(n2) (method 2 and method 3) models when the given phytogeny has n branches. In addition, we derived a formula to calculate the number of all possible models, revealing the complexity of finding the optimal branch-specific model. We show that in a reanalysis of over 50 previously published studies, the vast majority obtained better models with significantly higher likelihoods than the conventional hypothesis model methods.
机译:基因中的取代率可以为了解其功能和进化提供有价值的信息。估计替代率的一种广泛使用的方法是PAML软件包中CODEML程序中实现的最大似然法。测试了基于先验信息或对特定谱系的兴趣而选择的有限数量的分支模型,而忽略了大量潜在模型。还需要一种补充方法来测试所有或大量可能的模型,以搜索具有最大可能性的全局可选模型。然而,即使在少量序列中,用于该搜索的计算时间也变得不切实际地长。因此,期望探索最可能的空间以搜索最佳模型。使用动态编程技术,我们开发了一种简单的计算方法,可以在实际可行的计算时间内搜索最可能的最优分支特定模型。我们提出了三种搜索方法来找到最佳模型,当给定的植物遗传学具有n个分支时,它们探索了O(n)(方法1)到O(n2)(方法2和方法3)模型。此外,我们导出了一个公式来计算所有可能模型的数量,从而揭示了找到最佳分支特定模型的复杂性。我们显示,在对50多个先前发表的研究进行的重新分析中,绝大多数人获得了比传统假设模型方法更好的模型,并且可能性大大提高。

著录项

  • 来源
  • 作者单位

    National Key Laboratory of Crop Genetic Improvement and National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan 430070, China;

    National Key Laboratory of Crop Genetic Improvement and National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan 430070, China;

    National Key Laboratory of Crop Genetic Improvement and National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan 430070, China;

    National Key Laboratory of Crop Genetic Improvement and National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan 430070, China;

    Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637;

    National Key Laboratory of Crop Genetic Improvement and National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan 430070, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《生物学医学文摘》(MEDLINE);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    likelihood-ratio test; natural selection; positive selection; synonymous substitution; nonsynonymous substitution;

    机译:似然比检验自然选择;正选择;同义替换;非同义替换;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号