首页> 外文期刊>The Plant Genome >Large-Scale Discovery of Gene-Enriched SNPs
【24h】

Large-Scale Discovery of Gene-Enriched SNPs

机译:大规模发现富含基因的SNP

获取原文
           

摘要

Whole-genome association studies of complex traits in higher eukaryotes require a high density of single nucleotide polymorphism (SNP) markers at genome-wide coverage. To design high-throughput, multiplexed SNP genotyping assays, researchers must first discover large numbers of SNPs by extensively resequencing multiple individuals or lines. For SNP discovery approaches using short read-lengths that next-generation DNA sequencing technologies offer, the highly repetitive and duplicated nature of large plant genomes presents additional challenges. Here, we describe a genomic library construction procedure that facilitates pyrosequencing of genic and low-copy regions in plant genomes, and a customized computational pipeline to analyze and assemble short reads (100–200 bp), identify allelic reference sequence comparisons, and call SNPs with a high degree of accuracy. With maize (Zea mays L.) as the test organism in a pilot experiment, the implementation of these methods resulted in the identification of 126,683 putative SNPs between two maize inbred lines at an estimated false discovery rate (FDR) of 15.1%. We estimated rates of false SNP discovery using an internal control, and we validated these FDR rates with an external SNP dataset that was generated using locus-specific PCR amplification and Sanger sequencing. These results show that this approach has wide applicability for efficiently and accurately detecting gene-enriched SNPs in large, complex plant genomes.
机译:对高等真核生物复杂性状的全基因组关联研究要求在全基因组覆盖范围内具有高密度的单核苷酸多态性(SNP)标记。要设计高通量,多重SNP基因分型测定法,研究人员必须首先通过对多个个体或品系进行广泛的重测序来发现大量SNP。对于下一代DNA测序技术提供的使用短读长的SNP发现方法,大型植物基因组的高度重复性和重复性带来了其他挑战。在这里,我们描述了一个基因组文库构建程序,该程序可促进植物基因组中基因和低拷贝区域的焦磷酸测序,以及定制的计算管道,以分析和组装短读(100-200 bp),识别等位基因参考序列比较并调用SNP高度准确。以玉米(Zea mays L.)作为试点实验中的测试生物,这些方法的实施导致鉴定出两个玉米自交系之间的126,683个推定的SNP,估计的错误发现率(FDR)为15.1%。我们使用内部对照来估计错误SNP发现的发生率,并使用外部SNP数据集(使用基因座特异性PCR扩增和Sanger测序生成的数据)来验证这些FDR发生率。这些结果表明,该方法对于有效,准确地检测大型复杂植物基因组中富含基因的SNP具有广泛的适用性。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号