首页> 外文期刊>Genome research >Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases
【24h】

Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases

机译:对第三代DNA测序数据中的动力学速率变化建模,以检测对DNA碱基的假定修饰

获取原文
获取原文并翻译 | 示例
       

摘要

Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types.
机译:作为科学研究的常规部分,当前一代的DNA测序仪器正在接近无缝测序整个种群的基因组。然而,尽管已经确定了影响目标表型的DNA中的小核苷酸变异和结构变异方面取得了重大进展,但在表观遗传学改变和对DNA的碱基水平损伤方面,进展还不是那么显着,这主要是由于分析所有已知和已知技术的技术局限性基因组规模的未知修饰类型。近来,已经报道了单分子实时(SMRT)测序来鉴定动力学变化(KV)事件,该事件已被证明可反映每种已知类型的表观遗传学变化,从而为检测碱基修饰作为测序的常规部分提供了一条途径。但是,迄今为止,尚未提出统计框架来增强检测这些事件的能力,同时还控制假阳性事件。通过将感兴趣的基因组区域中任意位置附近的酶动力学建模为条件随机场,我们提供了一个统计框架,用于在感兴趣的测试位置以及相邻位点整合动力学信息,从而有助于增强检测KV事件。探索了该模型和相关模型的性能,并将性能最佳的模型应用于从大肠杆菌分离的质粒DNA和从人脑组织分离的线粒体DNA。我们重点介绍了广泛的动力学变化事件,其中一些与已知的修饰事件密切相关,而另一些则代表未知类型的假定化学修饰位点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号