Rates of DNA sequence profiles for practical values of read lengths
A recent study by one of the authors has demonstrated the importance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length 1, and word length n. Consequently, we demonstrate that for q...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/101858 http://hdl.handle.net/10220/48560 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-101858 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1018582023-02-28T19:43:23Z Rates of DNA sequence profiles for practical values of read lengths Chang, Zuling Chrisnata, Johan Ezerman, Martianus Frederic Kiah, Han Mao School of Physical and Mathematical Sciences DRNTU::Science::Physics Profile Vectors DNA-based Data Storage A recent study by one of the authors has demonstrated the importance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length 1, and word length n. Consequently, we demonstrate that for q ≥ 2 and n ≤ q 1/2-1 , the number of profile vectors is at least q κn with κ very close to 1. In addition to enumeration results, we provide a set of efficient encoding and decoding algorithms for certain families of profile vectors. Accepted version 2019-06-06T05:50:03Z 2019-12-06T20:45:49Z 2019-06-06T05:50:03Z 2019-12-06T20:45:49Z 2017 Journal Article Chang, Z., Chrisnata, J., Ezerman, M. F., & Kiah, H. M. (2017). Rates of DNA sequence profiles for practical values of read lengths. IEEE Transactions on Information Theory, 63(11), 7166-7177. doi:10.1109/TIT.2017.2747557 0018-9448 https://hdl.handle.net/10356/101858 http://hdl.handle.net/10220/48560 10.1109/TIT.2017.2747557 en IEEE Transactions on Information Theory © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TIT.2017.2747557 12 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Science::Physics Profile Vectors DNA-based Data Storage |
spellingShingle |
DRNTU::Science::Physics Profile Vectors DNA-based Data Storage Chang, Zuling Chrisnata, Johan Ezerman, Martianus Frederic Kiah, Han Mao Rates of DNA sequence profiles for practical values of read lengths |
description |
A recent study by one of the authors has demonstrated the importance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length 1, and word length n. Consequently, we demonstrate that for q ≥ 2 and n ≤ q 1/2-1 , the number of profile vectors is at least q κn with κ very close to 1. In addition to enumeration results, we provide a set of efficient encoding and decoding algorithms for certain families of profile vectors. |
author2 |
School of Physical and Mathematical Sciences |
author_facet |
School of Physical and Mathematical Sciences Chang, Zuling Chrisnata, Johan Ezerman, Martianus Frederic Kiah, Han Mao |
format |
Article |
author |
Chang, Zuling Chrisnata, Johan Ezerman, Martianus Frederic Kiah, Han Mao |
author_sort |
Chang, Zuling |
title |
Rates of DNA sequence profiles for practical values of read lengths |
title_short |
Rates of DNA sequence profiles for practical values of read lengths |
title_full |
Rates of DNA sequence profiles for practical values of read lengths |
title_fullStr |
Rates of DNA sequence profiles for practical values of read lengths |
title_full_unstemmed |
Rates of DNA sequence profiles for practical values of read lengths |
title_sort |
rates of dna sequence profiles for practical values of read lengths |
publishDate |
2019 |
url |
https://hdl.handle.net/10356/101858 http://hdl.handle.net/10220/48560 |
_version_ |
1759854111010848768 |