首页> 外文期刊>BMC Genomics >Interspecies hybridization on DNA resequencing microarrays: efficiency of sequence recovery and accuracy of SNP detection in human, ape, and codfish mitochondrial DNA genomes sequenced on a human-specific MitoChip
【24h】

Interspecies hybridization on DNA resequencing microarrays: efficiency of sequence recovery and accuracy of SNP detection in human, ape, and codfish mitochondrial DNA genomes sequenced on a human-specific MitoChip

机译:在DNA重测序微阵列上进行种间杂交:在人类特异性MitoChip上测序的人类,猿和鳕鱼线粒体DNA基因组中的序列恢复效率和SNP检测的准确性

获取原文
           

摘要

Background Iterative DNA "resequencing" on oligonucleotide microarrays offers a high-throughput method to measure intraspecific biodiversity, one that is especially suited to SNP-dense gene regions such as vertebrate mitochondrial (mtDNA) genomes. However, costs of single-species design and microarray fabrication are prohibitive. A cost-effective, multi-species strategy is to hybridize experimental DNAs from diverse species to a common microarray that is tiled with oligonucleotide sets from multiple, homologous reference genomes. Such a strategy requires that cross-hybridization between the experimental DNAs and reference oligos from the different species not interfere with the accurate recovery of species-specific data. To determine the pattern and limits of such interspecific hybridization, we compared the efficiency of sequence recovery and accuracy of SNP identification by a 15,452-base human-specific microarray challenged with human, chimpanzee, gorilla, and codfish mtDNA genomes. Results In the human genome, 99.67% of the sequence was recovered with 100.0% accuracy. Accuracy of SNP identification declines log-linearly with sequence divergence from the reference, from 0.067 to 0.247 errors per SNP in the chimpanzee and gorilla genomes, respectively. Efficiency of sequence recovery declines with the increase of the number of interspecific SNPs in the 25b interval tiled by the reference oligonucleotides. In the gorilla genome, which differs from the human reference by 10%, and in which 46% of these 25b regions contain 3 or more SNP differences from the reference, only 88% of the sequence is recoverable. In the codfish genome, which differs from the reference by > 30%, less than 4% of the sequence is recoverable, in short islands ≥ 12b that are conserved between primates and fish. Conclusion Experimental DNAs bind inefficiently to homologous reference oligonucleotide sets on a re-sequencing microarray when their sequences differ by more than a few percent. The data suggest that interspecific cross-hybridization will not interfere with the accurate recovery of species-specific data from multispecies microarrays, provided that the species' DNA sequences differ by > 20% (mean of 5b differences per 25b oligo). Recovery of DNA sequence data from multiple, distantly-related species on a single multiplex gene chip should be a practical, highly-parallel method for investigating genomic biodiversity.
机译:背景技术寡核苷酸微阵列上的迭代DNA“重测序”提供了一种高通量方法来测量种内生物多样性,该方法特别适合于SNP密集基因区域,例如脊椎动物线粒体(mtDNA)基因组。然而,单一物种设计和微阵列制造的成本高昂。一种经济高效的多物种​​策略是将来自不同物种的实验DNA杂交到一个普通的微阵列,该微阵列上铺有来自多个同源参考基因组的寡核苷酸集。这种策略要求实验DNA和来自不同物种的参考寡核苷酸之间的交叉杂交不干扰物种特异性数据的准确恢复。为了确定这种种间杂交的模式和限度,我们比较了15,452个碱基的人特异性微阵列对人类,黑猩猩,大猩猩和银鳕鱼mtDNA基因组的攻击,从而比较了序列恢复的效率和SNP鉴定的准确性。结果在人类基因组中,以100.0%的准确率回收了99.67%的序列。黑猩猩和大猩猩基因组中,每个SNP的SNP识别准确度随序列差异呈对数线性下降,分别为0.067至0.247个错误。序列恢复的效率随着参考寡核苷酸平铺的25b区间中种间SNP数量的增加而降低。在大猩猩基因组中,与人类参考基因相差10%,并且在这25b区域中有46%与参考基因有3个或更多的SNP差异,只有88%的序列是可恢复的。在与参考值相差> 30%的鳕鱼基因组中,在灵长类动物和鱼类之间保守的≥12b的短岛中,不到4%的序列是可恢复的。结论当实验DNA的序列差异超过百分之几时,它们与重测序微阵列上的同源参考寡核苷酸组的结合效率不高。数据表明,种间交叉杂交不会干扰从多物种微阵列中准确恢复物种特异性数据,前提是物种的DNA序列相差> 20%(每25b寡核苷酸5b差异的平均值)。在单个多重基因芯片上从多个远缘物种恢复DNA序列数据应该是一种实用,高度平行的调查基因组生物多样性的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号