...
首页> 外文期刊>BMC Genomics >Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus
【24h】

Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

机译:柑橘中基于单体型的表达序列标签单核苷酸多态性的挖掘

获取原文
           

摘要

Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially yielding different numbers of haplotype-based quality SNPs. Sweet orange (SO) had the most (213,830) ESTs, generating 11,182 quality SNPs in 3,327 out of 4,228 usable contigs. Summed from all the individually mining results, a total of 25,417 quality SNPs were discovered – 15,010 (59.1%) were transitions (AG and CT), 9,114 (35.9%) were transversions (AC, GT, CG, and AT), and 1,293 (5.0%) were insertion/deletions (indels). A vast majority of SNP-containing contigs consisted of only 2 haplotypes, as expected, but the percentages of 2 haplotype contigs varied widely in these citrus cultivars. BLAST of the 25,417 25-mer SNP oligos to the Clementine reference genome scaffolds revealed 2,947 SNPs had “no hits found”, 19,943 had 1 unique hit / alignment, 1,571 had one hit and 2+ alignments per hit, and 956 had 2+ hits and 1+ alignment per hit. Of the total 24,293 scaffold hits, 23,955 (98.6%) were on the main scaffolds 1 to 9, and only 338 were on 87 minor scaffolds. Most alignments had 100% (25/25) or 96% (24/25) nucleotide identities, accounting for 93% of all the alignments. Considering almost all the nucleotide discrepancies in the 24/25 alignments were at the SNP sites, it served well as in silico validation of these SNPs, in addition to and consistent with the rate (81%) validated by sequencing and SNaPshot assay. Conclusions High-quality EST-SNPs from different citrus genotypes were detected, and compared to estimate the heterozygosity of each genome. All the SNP oligo sequences were aligned with the Clementine citrus genome to determine their distribution and uniqueness and for in silico validation, in addition to SNaPshot and sequencing validation of selected SNPs.
机译:背景技术单核苷酸多态性(SNP)是基因组中最丰富的变异,已广泛用于各种研究中。基于柑橘单体型的表达序列标签(EST)SNP的检测和表征将大大促进这些基于基因的资源的进一步利用。结果在本文中,从单个柑橘品种(基因型)的公共柑橘表达序列标签(EST)中分别提取出基于单体型的SNP,以进行比较。共有567,297个EST属于27个品种,数量不等,因此产生了不同数量的基于单倍型的优质SNP。甜橙(SO)的EST数量最多(213,830),在4,228个可用重叠群中,有3,327个产生了11,182个SNP。从所有单独的挖掘结果中总结,总共发现了25,417个质量SNP – 15010(59.1%)个是转换(AG和CT),9114个(35.9%)是转换(AC,GT,CG和AT),还有1,293个(5.0%)为插入/删除(indels)。正如预期的那样,绝大多数含有SNP的重叠群仅由2个单倍型组成,但是在这些柑橘品种中2个单体型重叠群的百分比差异很大。在Clementine参考基因组支架中的25,417个25-mer SNP寡核苷酸的BLAST分析显示,有2947个SNP“未发现匹配”,其中19943个具有1个唯一匹配/比对,1,571个具有1个匹配和2+个比对,而956个具有2个以上匹配。和每次匹配1+对齐。在总共24,293个脚手架中,有23,955个(98.6%)位于主要脚手架1至9中,只有338个位于87个较小脚手架上。大多数比对具有100%(25/25)或96%(24/25)核苷酸同一性,占所有比对的93%。考虑到几乎所有24/25比对中的核苷酸差异均在SNP位点上,因此,除了通过测序和SNaPshot测定法验证的比率(81%)且与之一致外,它可以很好地在计算机上验证这些SNP。结论检测到了来自不同柑橘基因型的高质量EST-SNP,并进行了比较以估计每个基因组的杂合性。除了SNaPshot和选定SNP的序列验证外,所有SNP寡核苷酸序列均与柑桔类柑橘基因组进行比对,以确定其分布和唯一性,并进行计算机验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号