...
首页> 外文期刊>Evolutionary Bioinformatics >A New Effective Method for Estimating Missing Values in the Sequence Data Prior to Phylogenetic Analysis
【24h】

A New Effective Method for Estimating Missing Values in the Sequence Data Prior to Phylogenetic Analysis

机译:系统发育分析之前估算序列数据缺失值的新有效方法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

In this article we address the problem of phylogenetic inference from nucleic acid data containing missing bases. We introduce a new effective approach, called “Probabilistic estimation of missing values” (PEMV), allowing one to estimate unknown nucleotides prior to computing the evolutionary distances between them. We show that the new method improves the accuracy of phylogenetic inference compared to the existing methods “Ignoring Missing Sites” (IMS), “Proportional Distribution of Missing and Ambiguous Bases” (PDMAB) included in the PAUP software [26]. The proposed strategy for estimating missing nucleotides is based on probabilistic formulae developed in the framework of the Jukes-Cantor [10] and Kimura 2-parameter [11] models. The relative performances of the new method were assessed through simulations carried out with the SeqGen program [20], for data generation, and the BioNJ method [7], for inferring phylogenies. We also compared the new method to the DNAML program [5] and “Matrix Representation using Parsimony” (MRP) [13, 19] considering an example of 66 eutherian mammals originally analyzed in [17].
机译:在本文中,我们从包含缺失碱基的核酸数据中解决了系统发育推断的问题。我们引入了一种新的有效方法,称为“丢失值的概率估计”(PEMV),允许人们在计算未知核苷酸之间的进化距离之前对其进行估计。我们证明,与PAUP软件中包含的现有方法“忽略缺失位点”(IMS),“缺失和歧义碱基的比例分布”(PDMAB)相比,新方法提高了系统发育推断的准确性[26]。提议的估计核苷酸缺失的策略是基于在Jukes-Cantor [10]和Kimura 2-parameter [11]模型的框架中开发的概率公式。通过使用SeqGen程序[20]进行的仿真(用于数据生成)和BioNJ方法[7]的用于进行系统进化的仿真,评估了该新方法的相对性能。我们还将新方法与DNAML程序[5]和“使用简约性的矩阵表示法”(MRP)[13,19]进行了比较,考虑了最初在[17]中分析的66个以太子哺乳动物的例子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号