首页> 美国卫生研究院文献>other >DNA Barcoding through Quaternary LDPC Codes
【2h】

DNA Barcoding through Quaternary LDPC Codes

机译:通过第四级LDPC码进行DNA条形码

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

For many parallel applications of Next-Generation Sequencing (NGS) technologies short barcodes able to accurately multiplex a large number of samples are demanded. To address these competitive requirements, the use of error-correcting codes is advised. Current barcoding systems are mostly built from short random error-correcting codes, a feature that strongly limits their multiplexing accuracy and experimental scalability. To overcome these problems on sequencing systems impaired by mismatch errors, the alternative use of binary BCH and pseudo-quaternary Hamming codes has been proposed. However, these codes either fail to provide a fine-scale with regard to size of barcodes (BCH) or have intrinsic poor error correcting abilities (Hamming). Here, the design of barcodes from shortened binary BCH codes and quaternary Low Density Parity Check (LDPC) codes is introduced. Simulation results show that although accurate barcoding systems of high multiplexing capacity can be obtained with any of these codes, using quaternary LDPC codes may be particularly advantageous due to the lower rates of read losses and undetected sample misidentification errors. Even at mismatch error rates of 10−2 per base, 24-nt LDPC barcodes can be used to multiplex roughly 2000 samples with a sample misidentification error rate in the order of 10−9 at the expense of a rate of read losses just in the order of 10−6.
机译:对于下一代测序(NGS)技术的许多并行应用,需要能够准确多路复用大量样本的短条形码。为了满足这些竞争要求,建议使用纠错码。当前的条形码系统主要由短随机纠错码构建,该功能严重限制了它们的复用精度和实验可扩展性。为了克服因失配误差而受损的定序系统中的这些问题,已提出了二进制BCH和伪四进制汉明码的替代使用。然而,这些代码要么无法提供关于条形码(BCH)大小的精细尺度,要么具有固有的差错校正能力(Hamming)。在这里,介绍了从缩短的二进制BCH码和四级低密度奇偶校验(LDPC)码设计条形码的方法。仿真结果表明,尽管可以使用这些代码中的任何一种获得具有高复用能力的精确条形码系统,但由于较低的读取丢失率和未检测到的样本误识别错误,因此使用四元LDPC码可能特别有利。即使在每碱基10 −2 的错配错误率下,也可以使用24-nt LDPC条形码以大约10 -9 的样本误识别错误率复用大约2000个样本。 sup>以大约10 -6 的读取丢失率为代价。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号