...
首页> 外文期刊>BMC Genomics >Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
【24h】

Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

机译:通过基因组重测序和高通量基因分型对牛基因组变异进行全球评估

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants.
机译:背景基因组变异与表型信息的整合是发现基因型-表型关联的有效方法。这需要准确鉴定单个基因组中不同类型的变异。结果我们报告了单头荷斯坦黑白花公牛的全基因组序列与单核苷酸多态性(SNP)和比较基因组杂交(CGH)阵列技术的数据的整合,以确定基因组变异的综合范围。通过将通过血缘鉴定(IBD)或在拷贝数变异(CNV)中被鉴定为同一性的SNP与SNP阵列基因分型的结果相结合,来评估重测序SNP检测的性能。发现编码插入和缺失(indels)的大小以3的倍数富集,并且位于蛋白质的N末端和C末端附近。对于更大的插入缺失,分离阅读和阅读对方法的组合被证明在寻找不同特征上是互补的。基于测序读段的深度并使用SNP和CGH阵列鉴定CNV。结论我们的结果为单个牛基因组中不同类别的基因组变异提供了高分辨率的图谱,并证明结构变异超越了序列变异,成为基因组变异的主要组成部分。当使用实现映射质量的算法时,SNP检测的准确性更高,灵敏度损失很小。发现IBD区域有助于计算重测序SNP的准确性,而CNV内的SNP检测往往较不可靠。 CNV发现受到平台分辨率和覆盖率偏差的极大影响。这项研究的综合数据表明,在中等程度的测序覆盖率下,可以将平台和工具集成在一起,以最大程度地准确检测序列和结构变体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号