Efficient loss-less compression for genetic data

As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into ava...

全面介紹

Saved in:
書目詳細資料
主要作者: Tye, Yong Meng
其他作者: Anupam Chattopadhyay
格式: Final Year Project
語言:English
出版: 2016
主題:
在線閱讀:http://hdl.handle.net/10356/66707
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:As the usage of technology increases rapidly today, the amount of data created also increases exponentially. In particular, the rate of increase in DNA sequencing has been rising. Efficient compression significantly reduces the storage and maintenance cost. Therefore, this project will look into available compression algorithms which work better than other general compression tools. The first algorithm that will be examined is logic synthesis. It is an algorithm which takes in binary string as input, process it into logic circuits and then giving an optimized logic circuit as the output. This algorithm will work on a segment of DNA sequences to determine if it works well with such data. The second algorithm comes from the Fqzcomp program which won the first prize in the sequence squeeze competition because it offered the best compression ratio on DNA sequences. It will be examined and suggestions will be proposed to make it more efficient.