Encoding scheme for data storage and retrieval on DNA computers,IET Nanobiotechnology

当前位置： X-MOL 学术 › IET Nanobiotechnol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Encoding scheme for data storage and retrieval on DNA computers
IET Nanobiotechnology ( IF 3.8 ) Pub Date : 2020-10-05 , DOI: 10.1049/iet-nbt.2020.0157
Dolly Sharma ₁ , Ranjit Kumar ₂ , Mayuri Gupta ₁ , Tanisha Saxena ₁

Affiliation

There has been exponential growth in the amount of data being generated on a daily basis. Such a huge amount of data creates a need for efficient data storage techniques. Due to the limitations of existing storage media, new storage solutions have always been of interest. There have been recent developments in order to efficiently use synthetic deoxyribonucleic acid (DNA) for information storage. DNA storage has attracted researchers because of its extremely high data storage density, about 1 exabyte/mm ³ and long life under easily achievable conditions. This work presents an encoding scheme for DNA-based data storage system with controllable redundancy and reliability, the authors have also talked about the feasibility of the proposed method. The authors have also analysed the proposed algorithm for time and space complexity. The proposed encoding scheme tries to minimise the bases per letter ratio while controlling the redundancy. They have experimented with three different types of data with a value of redundancy as 0.75. In the randomised simulation setup, it was observed that the proposed algorithm was able to correctly retrieve the stored data in our experiments about 94% of the time. In the situation, where redundancy was increased to 1, the authors were able to retrieve all the information correctly in the proposed experiments.

中文翻译：

DNA 计算机上数据存储和检索的编码方案

每天生成的数据量呈指数级增长。如此大量的数据需要高效的数据存储技术。由于现有存储介质的局限性，新的存储解决方案一直备受关注。为了有效地使用合成脱氧核糖核酸（DNA）进行信息存储，最近取得了进展。 DNA存储因其极高的数据存储密度（约1 exabyte/mm ^{3 ）}以及易于实现的条件下的长寿命而吸引了研究人员。该工作提出了一种基于DNA的数据存储系统的编码方案，具有可控冗余性和可靠性，作者还讨论了该方法的可行性。作者还分析了所提出算法的时间和空间复杂度。所提出的编码方案试图在控制冗余的同时最小化每个字母的碱基比率。他们用冗余值为 0.75 的三种不同类型的数据进行了实验。在随机模拟设置中，我们观察到所提出的算法能够在大约 94% 的时间内正确检索实验中存储的数据。在冗余度增加到 1 的情况下，作者能够在所提出的实验中正确检索所有信息。

更新日期：2020-10-06

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11