Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach

This paper investigates the development of a transcriptome assembly pipeline that provides as clean and accurate of a result as possible, for the plant Cycas edentata. De novo transcriptome assembly is a necessary process to understand the gene expression of non-model organisms that may not have a s...

Full description

Saved in:
Bibliographic Details
Main Author: Srinivasan, Niyathi
Other Authors: Marek Mutwil
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156860
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156860
record_format dspace
spelling sg-ntu-dr.10356-1568602023-02-28T18:09:28Z Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach Srinivasan, Niyathi Marek Mutwil School of Biological Sciences Irene Julca mutwil@ntu.edu.sg Science::Biological sciences::Molecular biology Science::Biological sciences::Botany::Plant physiology This paper investigates the development of a transcriptome assembly pipeline that provides as clean and accurate of a result as possible, for the plant Cycas edentata. De novo transcriptome assembly is a necessary process to understand the gene expression of non-model organisms that may not have a sequenced genome available. It can be quite difficult to obtain transcriptomes free of extraneous material such as splicing isoforms and partially transcribed sequences. Therefore, this paper aims to investigate various methods that may address this issue, by comparing the performance of single assemblers and merged assemblers, as well as the impact of varying k-mer size and introducing post-assembly modifications. Results show that the best pipeline for Cycas edentata involves the program SOAPdenovo-Trans at K35, followed by the program CD-HIT-Est, to obtain the cleanest and most accurate transcriptome. This pipeline may be used in future for other datasets as well. Bachelor of Science in Biological Sciences 2022-04-27T00:46:11Z 2022-04-27T00:46:11Z 2022 Final Year Project (FYP) Srinivasan, N. (2022). Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156860 https://hdl.handle.net/10356/156860 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences::Molecular biology
Science::Biological sciences::Botany::Plant physiology
spellingShingle Science::Biological sciences::Molecular biology
Science::Biological sciences::Botany::Plant physiology
Srinivasan, Niyathi
Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
description This paper investigates the development of a transcriptome assembly pipeline that provides as clean and accurate of a result as possible, for the plant Cycas edentata. De novo transcriptome assembly is a necessary process to understand the gene expression of non-model organisms that may not have a sequenced genome available. It can be quite difficult to obtain transcriptomes free of extraneous material such as splicing isoforms and partially transcribed sequences. Therefore, this paper aims to investigate various methods that may address this issue, by comparing the performance of single assemblers and merged assemblers, as well as the impact of varying k-mer size and introducing post-assembly modifications. Results show that the best pipeline for Cycas edentata involves the program SOAPdenovo-Trans at K35, followed by the program CD-HIT-Est, to obtain the cleanest and most accurate transcriptome. This pipeline may be used in future for other datasets as well.
author2 Marek Mutwil
author_facet Marek Mutwil
Srinivasan, Niyathi
format Final Year Project
author Srinivasan, Niyathi
author_sort Srinivasan, Niyathi
title Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
title_short Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
title_full Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
title_fullStr Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
title_full_unstemmed Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
title_sort investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156860
_version_ 1759855907510943744