Rates of DNA sequence profiles for practical values of read lengths

A recent study by one of the authors has demonstrated the importance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length 1, and word length n. Consequently, we demonstrate that for q...

Full description

Saved in:
Bibliographic Details
Main Authors: Chang, Zuling, Chrisnata, Johan, Ezerman, Martianus Frederic, Kiah, Han Mao
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/10356/101858
http://hdl.handle.net/10220/48560
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-101858
record_format dspace
spelling sg-ntu-dr.10356-1018582023-02-28T19:43:23Z Rates of DNA sequence profiles for practical values of read lengths Chang, Zuling Chrisnata, Johan Ezerman, Martianus Frederic Kiah, Han Mao School of Physical and Mathematical Sciences DRNTU::Science::Physics Profile Vectors DNA-based Data Storage A recent study by one of the authors has demonstrated the importance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length 1, and word length n. Consequently, we demonstrate that for q ≥ 2 and n ≤ q 1/2-1 , the number of profile vectors is at least q κn with κ very close to 1. In addition to enumeration results, we provide a set of efficient encoding and decoding algorithms for certain families of profile vectors. Accepted version 2019-06-06T05:50:03Z 2019-12-06T20:45:49Z 2019-06-06T05:50:03Z 2019-12-06T20:45:49Z 2017 Journal Article Chang, Z., Chrisnata, J., Ezerman, M. F., & Kiah, H. M. (2017). Rates of DNA sequence profiles for practical values of read lengths. IEEE Transactions on Information Theory, 63(11), 7166-7177. doi:10.1109/TIT.2017.2747557 0018-9448 https://hdl.handle.net/10356/101858 http://hdl.handle.net/10220/48560 10.1109/TIT.2017.2747557 en IEEE Transactions on Information Theory © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TIT.2017.2747557 12 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Science::Physics
Profile Vectors
DNA-based Data Storage
spellingShingle DRNTU::Science::Physics
Profile Vectors
DNA-based Data Storage
Chang, Zuling
Chrisnata, Johan
Ezerman, Martianus Frederic
Kiah, Han Mao
Rates of DNA sequence profiles for practical values of read lengths
description A recent study by one of the authors has demonstrated the importance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length 1, and word length n. Consequently, we demonstrate that for q ≥ 2 and n ≤ q 1/2-1 , the number of profile vectors is at least q κn with κ very close to 1. In addition to enumeration results, we provide a set of efficient encoding and decoding algorithms for certain families of profile vectors.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Chang, Zuling
Chrisnata, Johan
Ezerman, Martianus Frederic
Kiah, Han Mao
format Article
author Chang, Zuling
Chrisnata, Johan
Ezerman, Martianus Frederic
Kiah, Han Mao
author_sort Chang, Zuling
title Rates of DNA sequence profiles for practical values of read lengths
title_short Rates of DNA sequence profiles for practical values of read lengths
title_full Rates of DNA sequence profiles for practical values of read lengths
title_fullStr Rates of DNA sequence profiles for practical values of read lengths
title_full_unstemmed Rates of DNA sequence profiles for practical values of read lengths
title_sort rates of dna sequence profiles for practical values of read lengths
publishDate 2019
url https://hdl.handle.net/10356/101858
http://hdl.handle.net/10220/48560
_version_ 1759854111010848768