Constrained Coding with Error Control for DNA-Based Data Storage

机译：带错误控制的约束编码，用于基于DNA的数据存储

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we first propose coding techniques for DNA-based data storage which account the maximum homopolymer runlength and the GC-content. In particular, for arbitrary ℓ,ϵ>0, we propose simple and efficient (ℓ,ϵ)-constrained encoders that transform binary sequences into DNA base sequences (codewords), that satisfy the following properties:• Runlength constraint: the maximum homopolymer run in each codeword is at most ℓ,• GC-content constraint: the GC-content of each codeword is within [0.5−ϵ, 0.5+ϵ].For practical values of ℓ and ϵ, our codes achieve higher rates than the existing results in the literature. We further design efficient (ℓ,ϵ)-constrained codes with error-correction capability. Specifically, the designed codes satisfy the runlength constraint, the GC-content constraint, and can correct a single edit (i.e. a single deletion, insertion, or substitution) and its variants. To the best of our knowledge, no such codes are constructed prior to this work.

机译：在本文中，我们首先提出了基于DNA的数据存储的编码技术，该技术考虑了最大均聚物的游程长度和GC含量。尤其是，对于任意ℓ，ϵ> 0，我们提出了一种简单有效的（ℓ，ϵ）约束编码器，该编码器将二进制序列转换为满足以下属性的DNA基本序列（代码字）：•运行长度约束：最大均聚物运行每个代码字中的最大GC含量约束：每个代码字中的GC含量在[0.5−ϵ，0.5 + ϵ]之内。对于ℓ和practical的实际值，我们的代码获得的速率比现有结果高。在文学中。我们进一步设计了具有纠错功能的高效（ℓ，ϵ）约束代码。具体而言，设计的代码满足游程长度约束，GC内容约束，并且可以纠正单个编辑（即单个删除，插入或替换）及其变体。据我们所知，在进行这项工作之前没有构建任何此类代码。

著录项

来源
《IEEE International Symposium on Information Theory》|2020年|694-699|共6页
会议地点
作者
Tuan Thanh Nguyen; Kui Cai; Kees A. Schouhamer Immink; Han Mao Kiah;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Sequence-Subset Distance and Coding for Error Control in DNA-Based Data Storage [J] . Song Wentu, Cai Kui, Schouhamer Immink Kees A. IEEE Transactions on Information Theory . 2020,第10期

机译：基于DNA的数据存储中的错误控制序列子集距离和编码
2. On Single-Error-Detecting Codes for DNA-Based Data Storage [J] . Weber Jos H., de Groot Joost A. M., van Leeuwen Charlot J. IEEE communications letters . 2021,第1期

机译：关于基于DNA的数据存储的单错误检测代码
3. Multilevel error-control codes for data storage channels [J] . Abdel-Ghaffar K.A.S., Hassner M. IEEE Transactions on Information Theory . 1991,第3期

机译：数据存储通道的多级错误控制代码
4. Sequence-Subset Distance and Coding for Error Control for DNA-based Data Storage [C] . Wentu Song, Kui Cai, Kees A. Schouhamer Immink IEEE International Symposium on Information Theory . 2019

机译：序列子集距离和编码，用于基于DNA的数据存储的错误控制
5. Optimal code rates for constrained systems with unconstrained positions: An approach to combining error correction codes with modulation codes for digital storage systems. [D] . Poo, Tze-Lei. 2005

机译：位置不受约束的约束系统的最佳编码率：一种将纠错码与数字存储系统的调制码相结合的方法。
6. Portable and Error-Free DNA-Based Data Storage [O] . S. M. Hossein Tabatabaei Yazdi, Ryan Gabrys, Olgica Milenkovic -1

机译：基于DNA的便携式无差错数据存储
7. Sequence-Subset Distance and Coding for Error Control for DNA-based Data Storage [O] . Wentu Song, Kui Cai, Kees A. Schouhamer Immink 2019

机译：序列 - 子集距离和编码用于基于DNA的数据存储的错误控制

Constrained Coding with Error Control for DNA-Based Data Storage

摘要

著录项

相似文献

相关主题

期刊订阅