首页> 外文会议>Research in computational molecular biology >Deep Sequencing of a Genetically Heterogeneous Sample: Local Haplotype Reconstruction and Read Error Correction
【24h】

Deep Sequencing of a Genetically Heterogeneous Sample: Local Haplotype Reconstruction and Read Error Correction

机译:遗传异质样品的深度测序:局部单倍型重建和读取错误校正。

获取原文
获取原文并翻译 | 示例

摘要

We present a computational method for analyzing deep sequencing data obtained from a genetically diverse sample. The set of reads obtained from a deep sequencing experiment represents a statistical sample of the underlying population. We develop a generative probabilistic model for assigning observed reads to unobserved haplotypes in the presence of sequencing errors. This clustering problem is solved in a Bayesian fashion using the Dirichlet process mixture to define a prior distribution on the unknown number of haplotypes in the mixture. We devise a Gibbs sampler for sampling from the joint posterior distribution of haplotype sequences, assignment of reads to haplotypes, and error rate of the sequencing process to obtain estimates of the local haplotype structure of the population. The method is evaluated on simulated data and on experimental deep sequencing data obtained from HIV samples.
机译:我们提出了一种计算方法,用于分析从遗传多样性样品中获得的深度测序数据。从深度测序实验中获得的一组读数代表基础人群的统计样本。我们开发了一种生成概率模型,用于在存在测序错误的情况下将观察到的读数分配给未观察到的单倍型。使用Dirichlet过程混合物以贝叶斯方式解决了该聚类问题,以定义混合物中未知数量单倍型的先验分布。我们设计了一种吉布斯采样器,用于从单倍型序列的联合后验分布,对单倍型的读段分配以及测序过程的错误率中进行采样,以获得总体种群单倍型结构的估计值。从艾滋病毒样本获得的模拟数据和实验性深度测序数据对方法进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号