Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling

The analysis of metagenome data based on the recovery of draft genomes (so called metagenome-assembled genomes, or MAG) has assumed an increasingly central role in microbiome research in recent years. Microbial communities underpinning the operation of wastewater treatment plants are particularly ch...

Full description

Saved in:
Bibliographic Details
Main Authors: Haryono, Mindia A. S., Law, Ying Yu, Arumugam, Krithika, Liew, Larry C. W., Nguyen, Thi Quynh Ngoc, Drautz-Moses, Daniela Isabel, Schuster, Stephan Christoph, Wuertz, Stefan, Williams, Rohan B H
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/163813
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-163813
record_format dspace
spelling sg-ntu-dr.10356-1638132022-12-24T23:31:39Z Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling Haryono, Mindia A. S. Law, Ying Yu Arumugam, Krithika Liew, Larry C. W. Nguyen, Thi Quynh Ngoc Drautz-Moses, Daniela Isabel Schuster, Stephan Christoph Wuertz, Stefan Williams, Rohan B H School of Biological Sciences Singapore Centre for Environmental Life Sciences and Engineering (SCELSE) Science::Biological sciences Genome-Resolved Metagenomics Environmental Microbiomes The analysis of metagenome data based on the recovery of draft genomes (so called metagenome-assembled genomes, or MAG) has assumed an increasingly central role in microbiome research in recent years. Microbial communities underpinning the operation of wastewater treatment plants are particularly challenging targets for MAG analysis due to their high ecological complexity, and remain important, albeit understudied, microbial communities that play ssa key role in mediating interactions between human and natural ecosystems. Here we consider strategies for recovery of MAG sequence from time series metagenome surveys of full-scale activated sludge microbial communities. We generate MAG catalogs from this set of data using several different strategies, including the use of multiple individual sample assemblies, two variations on multi-sample co-assembly and a recently published MAG recovery workflow using deep learning. We obtain a total of just under 9,100 draft genomes, which collapse to around 3,100 non-redundant genomic clusters. We examine the strengths and weaknesses of these approaches in relation to MAG yield and quality, showing that co-assembly may offer advantages over single-sample assembly in the case of metagenome data obtained from closely sampled longitudinal study designs. Around 1,000 MAGs were candidates for being considered high quality, based on single-copy marker gene occurrence statistics, however only 58 MAG formally meet the MIMAG criteria for being high quality draft genomes. These findings carry broader broader implications for performing genome-resolved metagenomics on highly complex communities, the design and implementation of genome recoverability strategies, MAG decontamination and the search for better binning methodology. Ministry of Education (MOE) National Research Foundation (NRF) National Supercomputing Centre (NSCC) Singapore Published version This research was supported by the Singapore National Research Foundation and Ministry of Education under the Research Centre of Excellence Programme and by program grants 1102- IRIS-10-02 (to SW, RW, and SS) from the National Research Foundation (NRF). The computational work was performed in part on resources of the National Supercomputing Centre (NSCC, Singapore) supported by Project 11000984. 2022-12-19T03:06:48Z 2022-12-19T03:06:48Z 2022 Journal Article Haryono, M. A. S., Law, Y. Y., Arumugam, K., Liew, L. C. W., Nguyen, T. Q. N., Drautz-Moses, D. I., Schuster, S. C., Wuertz, S. & Williams, R. B. H. (2022). Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling. Frontiers in Microbiology, 13, 869135-. https://dx.doi.org/10.3389/fmicb.2022.869135 1664-302X https://hdl.handle.net/10356/163813 10.3389/fmicb.2022.869135 35756038 2-s2.0-85133450919 13 869135 en 1102-IRIS-10-02 11000984 Frontiers in Microbiology © 2022 Haryono, Law, Arumugam, Liew, Nguyen, Drautz-Moses, Schuster, Wuertz and Williams. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences
Genome-Resolved Metagenomics
Environmental Microbiomes
spellingShingle Science::Biological sciences
Genome-Resolved Metagenomics
Environmental Microbiomes
Haryono, Mindia A. S.
Law, Ying Yu
Arumugam, Krithika
Liew, Larry C. W.
Nguyen, Thi Quynh Ngoc
Drautz-Moses, Daniela Isabel
Schuster, Stephan Christoph
Wuertz, Stefan
Williams, Rohan B H
Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling
description The analysis of metagenome data based on the recovery of draft genomes (so called metagenome-assembled genomes, or MAG) has assumed an increasingly central role in microbiome research in recent years. Microbial communities underpinning the operation of wastewater treatment plants are particularly challenging targets for MAG analysis due to their high ecological complexity, and remain important, albeit understudied, microbial communities that play ssa key role in mediating interactions between human and natural ecosystems. Here we consider strategies for recovery of MAG sequence from time series metagenome surveys of full-scale activated sludge microbial communities. We generate MAG catalogs from this set of data using several different strategies, including the use of multiple individual sample assemblies, two variations on multi-sample co-assembly and a recently published MAG recovery workflow using deep learning. We obtain a total of just under 9,100 draft genomes, which collapse to around 3,100 non-redundant genomic clusters. We examine the strengths and weaknesses of these approaches in relation to MAG yield and quality, showing that co-assembly may offer advantages over single-sample assembly in the case of metagenome data obtained from closely sampled longitudinal study designs. Around 1,000 MAGs were candidates for being considered high quality, based on single-copy marker gene occurrence statistics, however only 58 MAG formally meet the MIMAG criteria for being high quality draft genomes. These findings carry broader broader implications for performing genome-resolved metagenomics on highly complex communities, the design and implementation of genome recoverability strategies, MAG decontamination and the search for better binning methodology.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Haryono, Mindia A. S.
Law, Ying Yu
Arumugam, Krithika
Liew, Larry C. W.
Nguyen, Thi Quynh Ngoc
Drautz-Moses, Daniela Isabel
Schuster, Stephan Christoph
Wuertz, Stefan
Williams, Rohan B H
format Article
author Haryono, Mindia A. S.
Law, Ying Yu
Arumugam, Krithika
Liew, Larry C. W.
Nguyen, Thi Quynh Ngoc
Drautz-Moses, Daniela Isabel
Schuster, Stephan Christoph
Wuertz, Stefan
Williams, Rohan B H
author_sort Haryono, Mindia A. S.
title Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling
title_short Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling
title_full Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling
title_fullStr Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling
title_full_unstemmed Recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling
title_sort recovery of high quality metagenome-assembled genomes from full-scale activated sludge microbial communities in a tropical climate using longitudinal metagenome sampling
publishDate 2022
url https://hdl.handle.net/10356/163813
_version_ 1753801089029767168