...
首页> 外文期刊>Journal of Molecular Evolution >Automated Removal of Noisy Data in Phylogenomic Analyses
【24h】

Automated Removal of Noisy Data in Phylogenomic Analyses

机译:自动删除系统分析中的嘈杂数据

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Noisy data, especially in combination with misalignment and model misspecification can have an adverse effect on phylogeny reconstruction; however, effective methods to identify such data are few. One particularly important class of noisy data is saturated positions. To avoid potential errors related to saturation in phylogenomic analyses, we present an automated procedure involving the step-wise removal of the most variable positions in a given data set coupled with a stopping criterion derived from correlation analyses of pairwise ML distances calculated from the deleted (saturated) and the remaining (conserved) subsets of the alignment. Through a comparison with existing methods, we demonstrate both the effectiveness of our proposed procedure for identifying noisy data and the effect of the removal of such data using a well-publicized case study involving placental mammals. At the least, our procedure will identify data sets requiring greater data exploration, and we recommend its use to investigate the effect on phylogenetic analyses of removing subsets of variable positions exhibiting weak or no correlation to the rest of the alignment. However, we would argue that this procedure, by identifying and removing noisy data, facilitates the construction of more accurate phylogenies by, for example, ameliorating potential long-branch attraction artefacts.
机译:嘈杂的数据,尤其是与未对齐和模型错误指定结合使用时,可能会对系统发育重建产生不利影响;但是,识别此类数据的有效方法很少。一类特别重要的噪声数据是饱和位置。为了避免与系统生物学分析中的饱和度有关的潜在错误,我们提出了一种自动化程序,该程序涉及逐步删除给定数据集中最多可变位置的步骤,并结合根据从删除的(饱和)和其余(保守)比对子集。通过与现有方法的比较,我们证明了我们提出的程序用于识别嘈杂数据的有效性以及使用涉及胎盘哺乳动物的广为宣传的案例研究去除此类数据的效果。至少,我们的程序将识别需要更多数据探索的数据集,并且我们建议使用它来调查系统发育分析中去除与其余比对关系较弱或没有相关性的可变位置子集的影响。但是,我们认为,此过程通过识别和消除嘈杂的数据,例如通过减轻潜在的长分支吸引伪像,有助于构建更准确的系统发育。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号