Coding over Sets for DNA Storage,IEEE Transactions on Information Theory

当前位置： X-MOL 学术 › IEEE Trans. Inform. Theory › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Coding over Sets for DNA Storage
IEEE Transactions on Information Theory ( IF 2.2 ) Pub Date : 2020-04-01 , DOI: 10.1109/tit.2019.2961265
Andreas Lenz , Paul H. Siegel , Antonia Wachter-Zeh , Eitan Yaakobi

In this paper we study error-correcting codes for the storage of data in synthetic deoxyribonucleic acid (DNA). We investigate a storage model where a data set is represented by an unordered set of

$M$

sequences, each of length

$L$

. Errors within that model are a loss of whole sequences and point errors inside the sequences, such as insertions, deletions and substitutions. We derive Gilbert-Varshamov lower bounds and sphere packing upper bounds on achievable cardinalities of error-correcting codes within this storage model. We further propose explicit code constructions than can correct errors in such a storage system that can be encoded and decoded efficiently. Comparing the sizes of these codes to the upper bounds, we show that many of the constructions are close to optimal.

中文翻译：

对 DNA 存储的集合进行编码

在本文中，我们研究了用于在合成脱氧核糖核酸 (DNA) 中存储数据的纠错代码。我们研究了一个存储模型，其中数据集由一组无序的

百万美元

序列，每个长度

$L$

. 该模型中的错误是整个序列的丢失和序列内部的点错误，例如插入、删除和替换。我们根据该存储模型中可实现的纠错码基数推导出 Gilbert-Varshamov 下界和球体填充上限。我们进一步提出了显式代码结构，以便在可以有效编码和解码的存储系统中纠正错误。将这些代码的大小与上限进行比较，我们表明许多结构接近最优。

更新日期：2020-04-01

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11