High-throughput sequencing data and the impact of plant gene annotation quality
The use of draft genomes of different species and re-sequencing of accessions and populations are now common tools for plant biology research. The de novo assembled draft genomes make it possible to identify pivotal divergence points in the plant lineage and provide an opportunity to investigate the...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/105366 http://hdl.handle.net/10220/49526 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-105366 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1053662023-02-28T17:03:17Z High-throughput sequencing data and the impact of plant gene annotation quality Vaattovaara, Aleksia Leppälä, Johanna Salojärvi, Jarkko Wrzaczek, Michael School of Biological Sciences Gene Families Science::Biological sciences Genome Annotation The use of draft genomes of different species and re-sequencing of accessions and populations are now common tools for plant biology research. The de novo assembled draft genomes make it possible to identify pivotal divergence points in the plant lineage and provide an opportunity to investigate the genomic basis and timing of biological innovations by inferring orthologs between species. Furthermore, re-sequencing facilitates the mapping and subsequent molecular characterization of causative loci for traits, such as those for plant stress tolerance and development. In both cases high-quality gene annotation—the identification of protein-coding regions, gene promoters, and 5′- and 3′-untranslated regions—is critical for investigation of gene function. Annotations are constantly improving but automated gene annotations still require manual curation and experimental validation. This is particularly important for genes with large introns, genes located in regions rich with transposable elements or repeats, large gene families, and segmentally duplicated genes. In this opinion paper, we highlight the impact of annotation quality on evolutionary analyses, genome-wide association studies, and the identification of orthologous genes in plants. Furthermore, we predict that incorporating accurate information from manual curation into databases will dramatically improve the performance of automated gene predictors. Published version 2019-08-05T04:40:15Z 2019-12-06T21:50:04Z 2019-08-05T04:40:15Z 2019-12-06T21:50:04Z 2018 Journal Article Vaattovaara, A., Leppälä, J., Salojärvi, J., & Wrzaczek, M. (2019). High-throughput sequencing data and the impact of plant gene annotation quality. Journal of Experimental Botany, 70(4), 1069-1076. doi:10.1093/jxb/ery434 0022-0957 https://hdl.handle.net/10356/105366 http://hdl.handle.net/10220/49526 10.1093/jxb/ery434 en Journal of Experimental Botany © 2018 The Author(s). Published by Oxford University Press on behalf of the Society for Experimental Biology. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. 8 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Gene Families Science::Biological sciences Genome Annotation |
spellingShingle |
Gene Families Science::Biological sciences Genome Annotation Vaattovaara, Aleksia Leppälä, Johanna Salojärvi, Jarkko Wrzaczek, Michael High-throughput sequencing data and the impact of plant gene annotation quality |
description |
The use of draft genomes of different species and re-sequencing of accessions and populations are now common tools for plant biology research. The de novo assembled draft genomes make it possible to identify pivotal divergence points in the plant lineage and provide an opportunity to investigate the genomic basis and timing of biological innovations by inferring orthologs between species. Furthermore, re-sequencing facilitates the mapping and subsequent molecular characterization of causative loci for traits, such as those for plant stress tolerance and development. In both cases high-quality gene annotation—the identification of protein-coding regions, gene promoters, and 5′- and 3′-untranslated regions—is critical for investigation of gene function. Annotations are constantly improving but automated gene annotations still require manual curation and experimental validation. This is particularly important for genes with large introns, genes located in regions rich with transposable elements or repeats, large gene families, and segmentally duplicated genes. In this opinion paper, we highlight the impact of annotation quality on evolutionary analyses, genome-wide association studies, and the identification of orthologous genes in plants. Furthermore, we predict that incorporating accurate information from manual curation into databases will dramatically improve the performance of automated gene predictors. |
author2 |
School of Biological Sciences |
author_facet |
School of Biological Sciences Vaattovaara, Aleksia Leppälä, Johanna Salojärvi, Jarkko Wrzaczek, Michael |
format |
Article |
author |
Vaattovaara, Aleksia Leppälä, Johanna Salojärvi, Jarkko Wrzaczek, Michael |
author_sort |
Vaattovaara, Aleksia |
title |
High-throughput sequencing data and the impact of plant gene annotation quality |
title_short |
High-throughput sequencing data and the impact of plant gene annotation quality |
title_full |
High-throughput sequencing data and the impact of plant gene annotation quality |
title_fullStr |
High-throughput sequencing data and the impact of plant gene annotation quality |
title_full_unstemmed |
High-throughput sequencing data and the impact of plant gene annotation quality |
title_sort |
high-throughput sequencing data and the impact of plant gene annotation quality |
publishDate |
2019 |
url |
https://hdl.handle.net/10356/105366 http://hdl.handle.net/10220/49526 |
_version_ |
1759856822187982848 |