首页> 外文学位 >Novel strategies to increase read length and accuracy for DNA sequencing by synthesis.
【24h】

Novel strategies to increase read length and accuracy for DNA sequencing by synthesis.

机译:通过合成增加DNA测序的读取长度和准确性的新策略。

获取原文
获取原文并翻译 | 示例

摘要

The completion of the Human Genome Project has increased the need for high-throughput DNA sequencing technologies aimed at uncovering the genomic contributions to diseases. The DNA sequencing by synthesis (SBS) approach has shown great promise as a new platform for decoding the genome. This thesis focuses on the development and improvement of a chip-based four-color DNA SBS platform using molecular engineering approaches. In this approach, four nucleotides (A, C, G, T) are modified as fluorescent nucleotide reversible terminators (CF-NRTs) by tethering a cleavable fluorophore to the base and capping the 3'-OH with a small chemically reversible moiety so that the nucleotide analogues are still recognizable as substrates by DNA polymerase. First, we explored the potential of using an azido modified group for nucleotide modification. Based on our established rationale for nucleotide reversible terminator (NRT) design, we synthesized a complete set of NRTs capped at the 3' position with an azidomethyl group (3'-O-N3-dATP, 3'-O-N3-dCTP, 3'-O-N3-dGTP, 3'-O-N 3-dTTP). Through testing and optimization, it was apparent that these NRTs were good substrates of a DNA polymerase. Afterwards, we worked out an optimum chemical cleavage condition to remove the azidomethyl group capping the 3'-OH of the nucleotide analogues under conditions that were compatible with DNA, allowing the next NRT to be incorporated in the subsequent polymerase reaction. We then designed and synthesized two sets of azido-modified CF-NRTs for applications in SBS. The four CF-NRTs of the first set (3'-N 3-O-dNTP-azidomethylbenzoyl-fluorophores) were capped at the 3'-OH with an azidomethyl group identical to the NRTs and contained a substituted 2-azidomethylbenzoyl linker to tether a fluorophore. These CF-NRTs were used to produce four-color de novo DNA sequencing data on a chip based our sequencing by synthesis approach. After each round of sequencing, both the fluorophores linked to the CF-NRTs and the 3'-azidomethyl group on the DNA extension products generated by incorporating 3'-O-N 3-dNTP-azidomethylbenzoyl-fluorophores were removed using a TCEP [Tris(2-carboxyethyl)phosphine] cleavage solution. This one-step dual-cleavage process for reinitiating the polymerase reaction increased the overall SBS efficiency. After confirming the feasibility of implementing azido-modified CF-NRTs in SBS, we synthesized a second set of CF-NRTs (3'-O-N3-dNTP-N3-fluorophores) to further improve and optimize the sequencing process. During the incorporation stage of SBS, a mixture of CF-NRTs and NRTs was used to simultaneously extend the primer strand of various target DNA linear templates. This approach led to a more efficient DNA polymerase reaction since the smaller 3'-O-N 3-dNTPs were much easier to incorporate. Moreover primers extended with NRTs resembled nascent strands of DNA that had no traces of modification after cleavage of the 3'-azidomethyl capping group. After the incorporation reaction, two separate capping steps, first with 3'-O-N3-dNTPs and then with ddNTPs, were performed to synchronize all the templates on the surface. Without these precautionary synchronization procedures, mixed fluorescent signals would prevent the identification of the correctly incorporated nucleotide. Hence, we have successfully addressed one of the key drawbacks of SBS, which was the miscalling of the base due to lagging signals. In addition, since both 3'-O-N3-dNTP-N3-fluorophores and 3'-O-N 3-dNTPs were reversible terminators, which allow the sequencing of each base in a serial manner, they could accurately determine the homopolymeric regions of DNA. Finally, we developed a novel template walking strategy to increase read length for DNA SBS. The template walking method involved resetting the sequencing start site by extending the sequencing primer with three natural nucleotides and one NRT so that the polymerase reaction was temporarily paused when the NRT was incorporated. Upon restoring the 3'-OH group of the NRT incorporated into the primer via cleavage, the next cycle of walking could be carried out until the entire previously sequenced portion of the template was skipped. We have successfully demonstrated the integration of this template walking strategy into our four-color DNA SBS platform by performing one round of SBS, four cycles of template walking reactions, and then a second round of SBS. Through this effort, we were able to sequence a linear DNA template in its entirety, nearly doubling the read length of our previous sequencing results. We are also taking advantage of the massive throughput of a next generation sequencer that is based on our SBS technology to conduct digital gene expression study of Aplysia central nervous system in an ongoing project that explores the molecular mechanism of long-term memory formation.
机译:人类基因组计划的完成增加了对高通量DNA测序技术的需求,该技术旨在揭示疾病的基因组贡献。 DNA合成测序(SBS)方法作为解码基因组的新平台已显示出巨大的希望。本文主要研究和开发基于分子工程技术的基于芯片的四色DNA SBS平台。在这种方法中,通过将可裂解的荧光团连接至碱基并用小的化学可逆部分封端3'-OH,从而将四个核苷酸(A,C,G,T)修饰为荧光核苷酸可逆终止子(CF-NRT)核苷酸类似物仍可被DNA聚合酶识别为底物。首先,我们探索了使用叠氮基修饰基团进行核苷酸修饰的潜力。根据我们建立的核苷酸可逆终止子(NRT)设计原理,我们合成了一套完整的NRT,它们在3'位带有叠氮甲基(3'-O-N3-dATP,3'-O-N3-dCTP, 3'-O-N3-dGTP,3'-ON 3-dTTP)。通过测试和优化,很明显,这些NRT是DNA聚合酶的良好底物。之后,我们制定了一个最佳的化学裂解条件,以在与DNA相容的条件下去除覆盖核苷酸类似物3'-OH的叠氮基甲基,从而将下一个NRT掺入后续的聚合酶反应中。然后,我们设计并合成了两组用于ABS的叠氮基修饰CF-NRT。第一组的四个CF-NRT(3'-N 3-O-dNTP-叠氮基甲基苯甲酰基-荧光团)在3'-OH上被一个与NRT相同的叠氮基甲基覆盖,并包含一个连接到系链的取代的2-叠氮基甲基苯甲酰基接头荧光团。这些CF-NRT用于基于我们的合成测序方法在芯片上产生四色从头DNA测序数据。在每一轮测序之后,使用TCEP [Tris(2)将与CF-NRT连接的荧光团和通过掺入3'-ON 3-dNTP-叠氮基甲基苯甲酰荧光团而产生的DNA延伸产物上的3'-叠氮基甲基团去除。 -羧乙基)膦]裂解溶液。重新启动聚合酶反应的这一一步双裂解过程提高了整体SBS效率。在确认了在SBS中实施叠氮基修饰CF-NRT的可行性后,我们合成了第二组CF-NRT(3'-O-N3-dNTP-N3-荧光团)以进一步改善和优化测序过程。在SBS的整合阶段,使用CF-NRT和NRT的混合物同时延伸各种目标DNA线性模板的引物链。由于较小的3'-O-N 3-dNTPs易于整合,因此该方法可导致更有效的DNA聚合酶反应。此外,用NRT延伸的引物类似于新生的DNA链,在切割3'-叠氮基甲基封端基团后,没有修饰的痕迹。掺入反应后,首先进行3'-O-N3-dNTPs,然后进行ddNTPs两个单独的封盖步骤,以同步表面上的所有模板。没有这些预防性的同步程序,混合的荧光信号将阻止对正确掺入的核苷酸的鉴定。因此,我们成功地解决了SBS的主要缺点之一,那就是由于信号滞后而导致的基准误判。此外,由于3'-O-N3-dNTP-N3-荧光团和3'-ON 3-dNTPs都是可逆的终止子,允许以序列方式对每个碱基进行测序,因此它们可以准确地确定DNA的均聚物区域。最后,我们开发了一种新颖的模板遍历策略,以增加DNA SBS的阅读长度。模板步移法涉及通过用三个天然核苷酸和一个NRT延伸测序引物来重置测序起始位点,以便在掺入NRT时暂时停止聚合酶反应。在通过切割恢复掺入引物中的NRT的3'-OH基团之后,可以进行下一个循环,直到跳过模板的整个先前测序的部分。我们已经通过执行一轮SBS,四个循环的模板行走反应,然后进行第二轮SBS,成功地证明了将模板行走策略集成到我们的四色DNA SBS平台中。通过这项工作,我们能够对整个线性DNA模板进行测序,几乎使以前测序结果的读取长度翻了一番。在正在进行的探索长期记忆形成的分子机制的项目中,我们还利用了基于我们的SBS技术的下一代测序仪的大量处理能力,来进行Aplysia中枢神经系统的数字基因表达研究。

著录项

  • 作者

    Yu, Lin.;

  • 作者单位

    Columbia University.;

  • 授予单位 Columbia University.;
  • 学科 Engineering Chemical.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 195 p.
  • 总页数 195
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号