首页> 外文期刊>Genome Biology and Evolution >Estimates of Positive Darwinian Selection Are Inflated by Errors in Sequencing, Annotation, and Alignment
【24h】

Estimates of Positive Darwinian Selection Are Inflated by Errors in Sequencing, Annotation, and Alignment

机译:正达尔文选择的估计因排序,注释和对齐中的错误而膨胀

获取原文
           

摘要

Published estimates of the proportion of positively selected genes (PSGs) in human vary over three orders of magnitude. In mammals, estimates of the proportion of PSGs cover an even wider range of values. We used 2,980 orthologous protein-coding genes from human, chimpanzee, macaque, dog, cow, rat, and mouse as well as an established phylogenetic topology to infer the fraction of PSGs in all seven terminal branches. The inferred fraction of PSGs ranged from 0.9% in human through 17.5% in macaque to 23.3% in dog. We found three factors that influence the fraction of genes that exhibit telltale signs of positive selection: the quality of the sequence, the degree of misannotation, and ambiguities in the multiple sequence alignment. The inferred fraction of PSGs in sequences that are deficient in all three criteria of coverage, annotation, and alignment is 7.2 times higher than that in genes with high trace sequencing coverage, “known” annotation status, and perfect alignment scores. We conclude that some estimates on the prevalence of positive Darwinian selection in the literature may be inflated and should be treated with caution.
机译:已公布的人类中阳性选择基因(PSG)比例的估计值在三个数量级上有所不同。在哺乳动物中,对PSG比例的估计涵盖了更大的价值范围。我们使用了来自人类,黑猩猩,猕猴,狗,牛,大鼠和小鼠的2,980个直系同源蛋白质编码基因,以及已建立的系统发育拓扑,以推断所有七个末端分支中PSG的比例。 PSG的推断分数范围从人的0.9%到猕猴的17.5%到狗的23.3%。我们发现三个因素会影响显示阳性选择的明显迹象的基因比例:序列的质量,错误注释的程度以及多序列比对中的歧义。 PSGs在覆盖,注释和比对这三个标准均不足的序列中推断出的分数是具有高痕量测序覆盖率,“已知”注释状态和完美比对分数的基因的7.2倍。我们得出的结论是,对文献中有关积极达尔文选择的普遍性的某些估计可能被夸大,应谨慎对待。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号