The accuracy of base calls produced by Illumina sequencers is adversely affected by several processes, with laser cross-talk and cluster phasing being prominent. We introduce an explicit statistical model of the sequencing process that generalizes current models of phasing and cross-talk and forms the basis of a base calling method which improves on the best existing base callers, especially when comparing the number of error-free reads. The novel algorithms implemented in All Your Base (AYB) are comparable in speed to other competitive base-calling methods, do not require training data and are designed to be robust to gross errors, producing sensible results where other techniques struggle. AYB is available at
展开▼
机译:Illumina音序器产生的碱基检出的准确性受多个过程的不利影响,其中激光串扰和簇定相非常重要。我们介绍了排序过程的显式统计模型,该模型概括了当前的定相和串扰模型,并形成了基础调用方法的基础,该基础调用方法对现有的最佳基础调用者进行了改进,尤其是在比较无错误读段的数量时。 All Your Base(AYB)中实现的新颖算法在速度上可与其他竞争性基础调用方法相提并论,不需要训练数据,并且被设计为对严重错误具有鲁棒性,在其他技术难以克服的情况下产生明智的结果。 AYB在以下位置可用
展开▼