A New Effective Method for Estimating Missing Values in the Sequence Data Prior to Phylogenetic Analysis

Abdoulaye Baniré Diallo; Fran?ois-Joseph Lapointe; Vladimir Makarenkov

首页> 外文期刊>Evolutionary Bioinformatics >A New Effective Method for Estimating Missing Values in the Sequence Data Prior to Phylogenetic Analysis

【24h】

A New Effective Method for Estimating Missing Values in the Sequence Data Prior to Phylogenetic Analysis

机译：系统发育分析之前估算序列数据缺失值的新有效方法

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article we address the problem of phylogenetic inference from nucleic acid data containing missing bases. We introduce a new effective approach, called “Probabilistic estimation of missing values” (PEMV), allowing one to estimate unknown nucleotides prior to computing the evolutionary distances between them. We show that the new method improves the accuracy of phylogenetic inference compared to the existing methods “Ignoring Missing Sites” (IMS), “Proportional Distribution of Missing and Ambiguous Bases” (PDMAB) included in the PAUP software [26]. The proposed strategy for estimating missing nucleotides is based on probabilistic formulae developed in the framework of the Jukes-Cantor [10] and Kimura 2-parameter [11] models. The relative performances of the new method were assessed through simulations carried out with the SeqGen program [20], for data generation, and the BioNJ method [7], for inferring phylogenies. We also compared the new method to the DNAML program [5] and “Matrix Representation using Parsimony” (MRP) [13, 19] considering an example of 66 eutherian mammals originally analyzed in [17].

机译：在本文中，我们从包含缺失碱基的核酸数据中解决了系统发育推断的问题。我们引入了一种新的有效方法，称为“丢失值的概率估计”（PEMV），允许人们在计算未知核苷酸之间的进化距离之前对其进行估计。我们证明，与PAUP软件中包含的现有方法“忽略缺失位点”（IMS），“缺失和歧义碱基的比例分布”（PDMAB）相比，新方法提高了系统发育推断的准确性[26]。提议的估计核苷酸缺失的策略是基于在Jukes-Cantor [10]和Kimura 2-parameter [11]模型的框架中开发的概率公式。通过使用SeqGen程序[20]进行的仿真（用于数据生成）和BioNJ方法[7]的用于进行系统进化的仿真，评估了该新方法的相对性能。我们还将新方法与DNAML程序[5]和“使用简约性的矩阵表示法”（MRP）[13，19]进行了比较，考虑了最初在[17]中分析的66个以太子哺乳动物的例子。

著录项

来源
《Evolutionary Bioinformatics》 |2017年第4期|共页
作者
Abdoulaye Baniré Diallo; Fran?ois-Joseph Lapointe; Vladimir Makarenkov;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian sensitivity analysis methods to evaluate bias due to misclassification and missing data using informative priors and external validation data [J] . LutaG., FordM.B., BondyM., Cancer epidemiology . 2013,第2期

机译：贝叶斯敏感性分析方法，可使用信息丰富的先验数据和外部验证数据评估由于分类错误和数据丢失而造成的偏差
2. Estimating treatment effects from longitudinal clinical trial data with missing values: comparative analyses using different methods. [J] . Houck PR, Mazumdar S, Koru Sengul T, Psychiatry research . 2004,第2期

机译：根据纵向临床试验数据（缺少值）估算治疗效果：使用不同方法进行的比较分析。
3. Analysis of data including missing values in the Taguchi’s T method [J] . Yuto Nakao, Yasushi Nagata Total Quality Science . 2018,第2期

机译：Taguchi T方法中的数据分析，包括缺失值
4. Methods for estimating the autocorrelation and power spectral density functions when there are many missing data values [C] . Grossbard, N.J., Dewan, . 1990

机译：缺少许多数据值时估算自相关和功率谱密度函数的方法
5. Methodological and clinical issues in analysis of data from HIV cardiovascular research: Validity of ultrasound methods, impact of anti-retroviral therapy on atherosclerosis, and imputation of missing values. [D] . Odueyungbo, Adefowope. 2010

机译：HIV心血管研究数据分析中的方法学和临床问题：超声方法的有效性，抗逆转录病毒疗法对动脉粥样硬化的影响以及缺失值的归因。
6. A new effective method for estimating missing values in the sequence data prior to phylogenetic analysis [O] . Abdoulaye Baniré Diallo, François-Joseph Lapointe, Vladimir Makarenkov 2006

机译：一种在系统发育分析之前估算序列数据中缺失值的新有效方法
7. A new effective method for estimating missing valuesudin the sequence data prior to phylogenetic analysis [O] . Diallo Abdoulaye Baniré, Lapointe François-Joseph, Makarenkov Vladimir 2006

机译：一种估计缺失值的新有效方法 ud在系统发育分析之前的序列数据中

A New Effective Method for Estimating Missing Values in the Sequence Data Prior to Phylogenetic Analysis

摘要

著录项

相似文献

相关主题

期刊订阅