【24h】

Statistical Analysis of Electrophoresis Time Series for Improving Basecalling in DNA Sequencing

机译:用于改善DNA测序中碱基检出的电泳时间序列的统计分析

获取原文
获取原文并翻译 | 示例

摘要

In automated DNA sequencing, the final algorithmic phase, referred to as basecalling, consists of the translation of four time signals in the form of peak sequences (electropherogram) to the corresponding sequence of bases. Commercial basecallers detect the peaks based on heuristics, and are very efficient when the peaks are distinct and regular in spread, amplitude and spacing. Unfortunately, in the practice the signals are subject to several degradations, among which peak superposition and peak merging are the most frequent. In these cases the experiment must be repeated and human intervention is required. Recently, there have been attempts to provide methodological foundations to the problem and to use statistical models for solving it. In this paper, we exploit a priori information and Bayesian estimation to remove degradations and recover the signals in an impulsive form which makes basecalling straightforward.
机译:在自动DNA测序中,最后的算法阶段(称为碱基调用)包括将四个时间信号以峰序列(电泳图)的形式转换为相应的碱基序列。商业基础呼叫者根据启发式方法检测峰值,并且当峰值在扩展,幅度和间隔上不同且规则时,效率很高。不幸的是,在实践中,信号会经历几种劣化,其中最常见的是峰叠加和峰合并。在这些情况下,必须重复实验,并且需要人工干预。近来,已经尝试为该问题提供方法论基础并使用统计模型来解决该问题。在本文中,我们利用先验信息和贝叶斯估计来消除降级并以脉冲形式恢复信号,从而使基调变得简单。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号