首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Overcoming High Nanopore Basecaller Error Rates for DNA Storage via Basecaller-Decoder Integration and Convolutional Codes
【24h】

Overcoming High Nanopore Basecaller Error Rates for DNA Storage via Basecaller-Decoder Integration and Convolutional Codes

机译:通过Basecaller-Decoder集成和卷积码克服DNA储存的高纳米孔Basecaller错误率

获取原文

摘要

As magnetization and semiconductor based storage technologies approach their limits, bio-molecules, such as DNA, have been identified as promising media for future storage systems, due to their high storage density (petabytes/gram) and long-term durability (thousands of years). Furthermore, nanopore DNA sequencing enables high-throughput sequencing using devices as small as a USB thumb drive and thus is ideally suited for DNA storage applications. Due to the high insertion/deletion error rates associated with base-called nanopore reads, current approaches rely heavily on consensus among multiple reads and thus incur very high reading costs. We propose a novel approach which overcomes the high error rates in basecalled sequences by integrating a Viterbi error correction decoder with the basecaller, enabling the decoder to exploit the soft information available in the deep learning based basecaller pipeline. Using convolutional codes for error correction, we experimentally observed 3x lower reading costs than the state-of-the-art techniques at comparable writing costs.The code, data and Supplementary Material is available at https://github.com/shubhamchandak94anopore_dna_storage.
机译:随着磁化和基于半导体的存储技术接近极限,由于其高存储密度(PB /克)和长期耐用性(数千年),DNA等生物分子已被认为是未来存储系统的有希望的介质。 )。此外,纳米孔DNA测序可使用像USB拇指驱动器这样的小型设备进行高通量测序,因此非常适合DNA存储应用。由于与称为碱基的纳米孔读段相关的高插入/缺失错误率,当前的方法严重依赖于多个读段之间的共识,因此产生非常高的读成本。我们提出了一种新颖的方法,该方法通过将Viterbi纠错解码器与基本调用程序集成在一起,克服了基本调用序列中的高错误率,使解码器能够利用基于深度学习的基本调用程序管道中可用的软信息。使用卷积码进行纠错,我们在实验上观察到在可比的写作成本下阅读成本比最新技术低3倍。代码,数据和补充材料可在https://github.com/shubhamchandak94/nanopore_dna_storage获得。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号