首页> 外文期刊>BMC Genomics >Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains
【24h】

Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

机译:甲基化敏感的连接库可增强复杂基因组的基因富集测序,并绘制DNA甲基化结构域图

获取原文
           

摘要

Background Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. Results A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary. Conclusion MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.
机译:背景技术由于大量的重复序列,许多植物基因组对全基因组装配有抵抗力,从而导致了基因富集的测序技术的发展。两种这样的技术是低甲基化部分限制性酶(HMPR)和甲基化跨度接头文库(MSLL)。这些文库与其他富含基因的数据集的不同之处在于,它们具有较大的插入片段大小,并且MSLL克隆旨在提供定位于甲基化开始或终止的“表观遗传边界”的读码。结果对玉米的大规模研究产生了40,299条HMPR序列和80,723条MSLL序列,包括超过100 kb的MSLL克隆。已显示MSLL和HMPR克隆的配对末端读段在将现有的富基因序列连接到支架中是有效的。另外,已表明MSLL克隆可用于将这些支架锚定至基于BAC的物理图谱。 MSLL末端读取有效识别的表观遗传边界,如它们优先于注释基因上游和下游区域的序列所示。精确定位长片段的完全甲基化DNA序列的能力是MSLL分析的独特成果,并且还被证明可提供基因鉴定错误的证据。观察到MSLL克隆的内部比其末端读段的重复序列富集得多,这证实了甲基化与反转录元素含量之间的相关性。发现MSLL和HMPR读物均实质上被基因富集,其中SalI MSLL文库的富集度最高(与EST重叠群比对为31%),而HMPR克隆表现出异常的重复DNA消耗(至〜11%)。 。将这两种技术与其他基因富集方法进行了比较,并证明是互补的。结论MSLL技术提供了一种无与伦比的方法,可用于绘制重复块的表观遗传状态并鉴定被误识别为基因的序列。尽管目前尚不了解表观遗传边界的类型和性质,但MSLL技术同时标记了近似边界和甲基化基因,值得进一步研究。 MSLL和HMPR序列为玉米基因组注释提供了宝贵的资源,并且是任何植物基因组测序项目的独特有价值的补充。为了使社区能够完全获得这些结果,开发了一个网络显示器,该显示器显示了MSLL,HMPR和其他富含基因的序列与BAC的比对;此显示将使用最新的EST和BAC序列进行不断更新。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号