Optimized code design for constrained DNA data storage with asymmetric errors

With ultra-high density and preservation longevity, deoxyribonucleic acid (DNA)-based data storage is becoming an emerging storage technology. Limited by the current biochemical techniques, data might be corrupted during the processes of DNA data storage. A hybrid coding architecture consisting of m...

Full description

Saved in:
Bibliographic Details
Main Authors: Md. Noor-A-Rahim, Guan, Yong Liang, Shi, Zhiping, Gunawan, Erry, Poh, Chueh Loo, Deng, Li, Wang, Yixin
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/10356/83513
http://hdl.handle.net/10220/49766
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-83513
record_format dspace
spelling sg-ntu-dr.10356-835132020-03-07T13:57:23Z Optimized code design for constrained DNA data storage with asymmetric errors Md. Noor-A-Rahim Guan, Yong Liang Shi, Zhiping Gunawan, Erry Poh, Chueh Loo Deng, Li Wang, Yixin School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering DNA Data Storage Protograph LDPC Codes With ultra-high density and preservation longevity, deoxyribonucleic acid (DNA)-based data storage is becoming an emerging storage technology. Limited by the current biochemical techniques, data might be corrupted during the processes of DNA data storage. A hybrid coding architecture consisting of modified variable-length run-length limited (VL-RLL) codes and optimized protograph low-density parity-check (LDPC) codes is proposed in order to suppress error occurrence and correct asymmetric substitution errors. Based on the analyses of the different asymmetric DNA sequencer channel models, a series of the protograph LDPC codes are optimized using a modified extrinsic information transfer algorithm (EXIT). The simulation results show the better error performance of the proposed protograph LDPC codes over the conventional good codes and the codes used in the existing DNA data storage system. In addition, the theoretical analysis shows that the proposed hybrid coding scheme stores ~1.98 bits per nucleotide (bits/nt) with only 1% gap from the upper boundary (2 bits/nt). Published version 2019-08-23T06:43:38Z 2019-12-06T15:24:37Z 2019-08-23T06:43:38Z 2019-12-06T15:24:37Z 2019 Journal Article Deng, L., Wang, Y., Md. Noor-A-Rahim, Guan, Y. L., Shi, Z., Gunawan, E., & Poh, C. L. (2019). Optimized code design for constrained DNA data storage with asymmetric errors. IEEE Access, 7, 84107-84121. doi:10.1109/ACCESS.2019.2924827 https://hdl.handle.net/10356/83513 http://hdl.handle.net/10220/49766 10.1109/ACCESS.2019.2924827 en IEEE Access © 2019 IEEE. This journal is 100% open access, which means that all content is freely available without charge to users or their institutions. All articles accepted after 12 June 2019 are published under a CC BY 4.0 license*, and the author retains copyright. Users are allowed to read, download, copy, distribute, print, search, or link to the full texts of the articles, or use them for any other lawful purpose, as long as proper attribution is given. 15 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
DNA Data Storage
Protograph LDPC Codes
spellingShingle Engineering::Electrical and electronic engineering
DNA Data Storage
Protograph LDPC Codes
Md. Noor-A-Rahim
Guan, Yong Liang
Shi, Zhiping
Gunawan, Erry
Poh, Chueh Loo
Deng, Li
Wang, Yixin
Optimized code design for constrained DNA data storage with asymmetric errors
description With ultra-high density and preservation longevity, deoxyribonucleic acid (DNA)-based data storage is becoming an emerging storage technology. Limited by the current biochemical techniques, data might be corrupted during the processes of DNA data storage. A hybrid coding architecture consisting of modified variable-length run-length limited (VL-RLL) codes and optimized protograph low-density parity-check (LDPC) codes is proposed in order to suppress error occurrence and correct asymmetric substitution errors. Based on the analyses of the different asymmetric DNA sequencer channel models, a series of the protograph LDPC codes are optimized using a modified extrinsic information transfer algorithm (EXIT). The simulation results show the better error performance of the proposed protograph LDPC codes over the conventional good codes and the codes used in the existing DNA data storage system. In addition, the theoretical analysis shows that the proposed hybrid coding scheme stores ~1.98 bits per nucleotide (bits/nt) with only 1% gap from the upper boundary (2 bits/nt).
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Md. Noor-A-Rahim
Guan, Yong Liang
Shi, Zhiping
Gunawan, Erry
Poh, Chueh Loo
Deng, Li
Wang, Yixin
format Article
author Md. Noor-A-Rahim
Guan, Yong Liang
Shi, Zhiping
Gunawan, Erry
Poh, Chueh Loo
Deng, Li
Wang, Yixin
author_sort Md. Noor-A-Rahim
title Optimized code design for constrained DNA data storage with asymmetric errors
title_short Optimized code design for constrained DNA data storage with asymmetric errors
title_full Optimized code design for constrained DNA data storage with asymmetric errors
title_fullStr Optimized code design for constrained DNA data storage with asymmetric errors
title_full_unstemmed Optimized code design for constrained DNA data storage with asymmetric errors
title_sort optimized code design for constrained dna data storage with asymmetric errors
publishDate 2019
url https://hdl.handle.net/10356/83513
http://hdl.handle.net/10220/49766
_version_ 1681049454170341376