首页> 外文期刊>BMC Genomics >SMRT genome assembly corrects reference errors, resolving the genetic basis of virulence in Mycobacterium tuberculosis
【24h】

SMRT genome assembly corrects reference errors, resolving the genetic basis of virulence in Mycobacterium tuberculosis

机译:SMRT基因组装配可纠正参考错误,解决结核分枝杆菌毒力的遗传基础

获取原文
           

摘要

Background The genetic basis of virulence in Mycobacterium tuberculosis has been investigated through genome comparisons of virulent (H37Rv) and attenuated (H37Ra) sister strains. Such analysis, however, relies heavily on the accuracy of the sequences. While the H37Rv reference genome has had several corrections to date, that of H37Ra is unmodified since its original publication. Results Here, we report the assembly and finishing of the H37Ra genome from single-molecule, real-time (SMRT) sequencing. Our assembly reveals that the number of H37Ra-specific variants is less than half of what the Sanger-based H37Ra reference sequence indicates, undermining and, in some cases, invalidating the conclusions of several studies. PE_PPE family genes, which are intractable to commonly-used sequencing platforms because of their repetitive and GC-rich nature, are overrepresented in the set of genes in which all reported H37Ra-specific variants are contradicted. Further, one of the sequencing errors in H37Ra masks a true variant in common with the clinical strain CDC1551 which, when considered in the context of previous work, corresponds to a sequencing error in the H37Rv reference genome. Conclusions Our results constrain the set of genomic differences possibly affecting virulence by more than half, which focuses laboratory investigation on pertinent targets and demonstrates the power of SMRT sequencing for producing high-quality reference genomes.
机译:背景技术已经通过对有毒(H37Rv)和减毒(H37Ra)姐妹菌株进行基因组比较研究了结核分枝杆菌中毒力的遗传基础。但是,这种分析在很大程度上依赖于序列的准确性。尽管迄今为止,H37Rv参考基因组已进行了多次校正,但自其最初发表以来,H37Ra的基因组未经过修改。结果在这里,我们报道了单分子实时(SMRT)测序的H37Ra基因组的组装和完成。我们的大会揭示,H37Ra特异性变体的数量少于基于Sanger的H37Ra参考序列所指示序列的一半,从而破坏了某些研究的结论,并在某些情况下使这些研究结论无效。 PE_PPE家族基因由于其重复性和富含GC的性质而对常用测序平台难以处理,因此在所有报告的H37Ra特异性变体均与之相矛盾的基因集中被过度表达。此外,H37Ra中的一种测序错误掩盖了与临床菌株CDC1551相同的真实变异,当在先前工作中考虑时,它对应于H37Rv参考基因组中的测序错误。结论我们的结果将可能影响毒力的基因组差异限制了一半以上,这将实验室研究集中在相关靶标上,并证明了SMRT测序对产生高质量参考基因组的作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号