On the number of DNA sequence profiles for practical values of read lengths

A recent study by one of the authors has demonstrated the relevance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length ℓ, and word length n. Consequently, we demonstrate that for q...

Full description

Saved in:
Bibliographic Details
Main Authors: Chang, Zuling, Chrisnata, Johan, Ezerman, Martianus Frederic, Kiah, Han Mao
Other Authors: School of Physical and Mathematical Sciences
Format: Conference or Workshop Item
Language:English
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/10356/103253
http://hdl.handle.net/10220/48592
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-103253
record_format dspace
spelling sg-ntu-dr.10356-1032532023-02-28T19:17:56Z On the number of DNA sequence profiles for practical values of read lengths Chang, Zuling Chrisnata, Johan Ezerman, Martianus Frederic Kiah, Han Mao School of Physical and Mathematical Sciences 2016 IEEE International Symposium on Information Theory (ISIT) DNA-based Data Storage Profile Vectors DRNTU::Science::Mathematics A recent study by one of the authors has demonstrated the relevance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length ℓ, and word length n. Consequently, we demonstrate that for q ≥ 3 and n = q a ℓ, a = o(ℓ), the number of profile vectors is at least q κn for some constant 0 <; κ ≤ 1. In addition to enumeration results, we provide a set of efficient encoding and decoding algorithms for a family of profile vectors. MOE (Min. of Education, S’pore) Accepted version 2019-06-07T02:26:44Z 2019-12-06T21:08:26Z 2019-06-07T02:26:44Z 2019-12-06T21:08:26Z 2016 Conference Paper Chang, Z., Chrisnata, J., Ezerman, M. F., & Kiah, H. M. (2016). On the number of DNA sequence profiles for practical values of read lengths. 2016 IEEE International Symposium on Information Theory (ISIT), 2654-2658. doi:10.1109/ISIT.2016.7541780 https://hdl.handle.net/10356/103253 http://hdl.handle.net/10220/48592 10.1109/ISIT.2016.7541780 en © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/ISIT.2016.7541780 5 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DNA-based Data Storage
Profile Vectors
DRNTU::Science::Mathematics
spellingShingle DNA-based Data Storage
Profile Vectors
DRNTU::Science::Mathematics
Chang, Zuling
Chrisnata, Johan
Ezerman, Martianus Frederic
Kiah, Han Mao
On the number of DNA sequence profiles for practical values of read lengths
description A recent study by one of the authors has demonstrated the relevance of profile vectors in DNA-based data storage. We provide exact values and lower bounds on the number of profile vectors for finite values of alphabet size q, read length ℓ, and word length n. Consequently, we demonstrate that for q ≥ 3 and n = q a ℓ, a = o(ℓ), the number of profile vectors is at least q κn for some constant 0 <; κ ≤ 1. In addition to enumeration results, we provide a set of efficient encoding and decoding algorithms for a family of profile vectors.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Chang, Zuling
Chrisnata, Johan
Ezerman, Martianus Frederic
Kiah, Han Mao
format Conference or Workshop Item
author Chang, Zuling
Chrisnata, Johan
Ezerman, Martianus Frederic
Kiah, Han Mao
author_sort Chang, Zuling
title On the number of DNA sequence profiles for practical values of read lengths
title_short On the number of DNA sequence profiles for practical values of read lengths
title_full On the number of DNA sequence profiles for practical values of read lengths
title_fullStr On the number of DNA sequence profiles for practical values of read lengths
title_full_unstemmed On the number of DNA sequence profiles for practical values of read lengths
title_sort on the number of dna sequence profiles for practical values of read lengths
publishDate 2019
url https://hdl.handle.net/10356/103253
http://hdl.handle.net/10220/48592
_version_ 1759857409027735552