...
首页> 外文期刊>BMC Genomics >Re-annotation of the physical map of Glycine max for polyploid-like regions by BAC end sequence driven whole genome shotgun read assembly
【24h】

Re-annotation of the physical map of Glycine max for polyploid-like regions by BAC end sequence driven whole genome shotgun read assembly

机译:通过BAC末端序列驱动的全基因组shot弹枪读取装配对多倍体样区域的Glycine max物理图谱的重新注释

获取原文
           

摘要

Background Many of the world's most important food crops have either polyploid genomes or homeologous regions derived from segmental shuffling following polyploid formation. The soybean (Glycine max) genome has been shown to be composed of approximately four thousand short interspersed homeologous regions with 1, 2 or 4 copies per haploid genome by RFLP analysis, microsatellite anchors to BACs and by contigs formed from BAC fingerprints. Despite these similar regions,, the genome has been sequenced by whole genome shotgun sequence (WGS). Here the aim was to use BAC end sequences (BES) derived from three minimum tile paths (MTP) to examine the extent and homogeneity of polyploid-like regions within contigs and the extent of correlation between the polyploid-like regions inferred from fingerprinting and the polyploid-like sequences inferred from WGS matches. Results Results show that when sequence divergence was 1–10%, the copy number of homeologous regions could be identified from sequence variation in WGS reads overlapping BES. Homeolog sequence variants (HSVs) were single nucleotide polymorphisms (SNPs; 89%) and single nucleotide indels (SNIs 10%). Larger indels were rare but present (1%). Simulations that had predicted fingerprints of homeologous regions could be separated when divergence exceeded 2% were shown to be false. We show that a 5–10% sequence divergence is necessary to separate homeologs by fingerprinting. BES compared to WGS traces showed polyploid-like regions with less than 1% sequence divergence exist at 2.3% of the locations assayed. Conclusion The use of HSVs like SNPs and SNIs to characterize BACs wil improve contig building methods. The implications for bioinformatic and functional annotation of polyploid and paleopolyploid genomes show that a combined approach of BAC fingerprint based physical maps, WGS sequence and HSV-based partitioning of BAC clones from homeologous regions to separate contigs will allow reliable de-convolution and positioning of sequence scaffolds (see BES_scaffolds section of SoyGD). This approach will assist genome annotation for paleopolyploid and true polyploid genomes such as soybean and many important cereal and fruit crops.
机译:背景技术世界上许多最重要的粮食作物都具有多倍体基因组或来源于多倍体形成后节段改组的同源区域。通过RFLP分析,BAC的微卫星锚定以及BAC指纹形成的重叠群,大豆(Glycine max)基因组已显示出约4000个短散布的同源区域,每个单倍体基因组具有1、2或4个拷贝。尽管有这些相似的区域,但已通过全基因组shot弹枪序列(WGS)对基因组进行了测序。此处的目的是使用源自三个最小图块路径(MTP)的BAC末端序列(BES)来检查重叠群中多倍体样区域的范围和同质性以及从指纹图谱推断的多倍体样区域与从WGS匹配中推断出多倍体样序列。结果结果显示,当序列差异为1–10%时,可以从WGS读数重叠的BES中的序列变异中确定同源区域的拷贝数。同源序列变异体(HSV)是单核苷酸多态性(SNP; 89%)和单核苷酸插入/缺失(SNI 10%)。较大的插入缺失很少见,但存在(1%)。当差异超过2%时,可以分离出预测同源区域指纹的模拟结果是错误的。我们显示出5–10%的序列差异对于通过指纹分离同系物是必要的。与WGS痕迹相比,BES显示在2.3%的分析位置中存在少于1%序列差异的多倍体样区域。结论使用HSV像SNP和SNI来表征BAC可以改善重叠群构建方法。多倍体和古多倍体基因组的生物信息学和功能注释的含义表明,基于BAC指纹的物理图谱,WGS序列和BAC克隆从同源区域到单独重叠群的基于HSV划分的组合方法将允许可靠的反卷积和序列定位支架(请参见SoyGD的BES_scaffolds部分)。这种方法将有助于古多倍体和真正的多倍体基因组的基因组注释,例如大豆以及许多重要的谷物和水果作物。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号