MiDAS 4: a global catalogue of full-length 16S rRNA gene sequences and taxonomy for studies of bacterial communities in wastewater treatment plants

Microbial communities are responsible for biological wastewater treatment, but our knowledge of their diversity and function is still poor. Here, we sequence more than 5 million high-quality, full-length 16S rRNA gene sequences from 740 wastewater treatment plants (WWTPs) across the world and use th...

Full description

Saved in:
Bibliographic Details
Main Authors: Dueholm, Morten Kam Dahl, Nierychlo, Marta, Andersen, Kasper Skytte, Rudkjøbing, Vibeke, Knutsson, Simon, Arriaga, Sonia, Bakke, Rune, Boon, Nico, Bux, Faizal, Christensson, Magnus, Chua, Adeline Seak May, Curtis, Thomas P., Cytryn, Eddie, Erijman, Leonardo, Etchebehere, Claudia, Fatta-Kassinos, Despo, Frigon, Dominic, Garcia-Chaves, Maria Carolina, Gu, April Z., Horn, Harald, Jenkins, David, Kreuzinger, Norbert, Kumari, Sheena, Lanham, Ana, Law, Yingyu, Leiknes, TorOve, Morgenroth, Eberhard, Muszyński, Adam, Petrovski, Steve, Pijuan, Maite, Pillai, Suraj Babu, Reis, Maria A. M., Rong, Qi, Rossetti, Simona, Seviour, Robert, Tooker, Nick, Vainio, Pirjo, van Loosdrecht, Mark, Vikraman, R., Wanner, Jiří, Weissbrodt, David, Wen, Xianghua, Zhang, Tong, Nielsen, Per H., Albertsen, Mads, Nielsen, Per Halkjær
Other Authors: Singapore Centre for Environmental Life Sciences and Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/163237
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Microbial communities are responsible for biological wastewater treatment, but our knowledge of their diversity and function is still poor. Here, we sequence more than 5 million high-quality, full-length 16S rRNA gene sequences from 740 wastewater treatment plants (WWTPs) across the world and use the sequences to construct the 'MiDAS 4' database. MiDAS 4 is an amplicon sequence variant resolved, full-length 16S rRNA gene reference database with a comprehensive taxonomy from domain to species level for all sequences. We use an independent dataset (269 WWTPs) to show that MiDAS 4, compared to commonly used universal reference databases, provides a better coverage for WWTP bacteria and an improved rate of genus and species level classification. Taking advantage of MiDAS 4, we carry out an amplicon-based, global-scale microbial community profiling of activated sludge plants using two common sets of primers targeting regions of the 16S rRNA gene, revealing how environmental conditions and biogeography shape the activated sludge microbiota. We also identify core and conditionally rare or abundant taxa, encompassing 966 genera and 1530 species that represent approximately 80% and 50% of the accumulated read abundance, respectively. Finally, we show that for well-studied functional guilds, such as nitrifiers or polyphosphate-accumulating organisms, the same genera are prevalent worldwide, with only a few abundant species in each genus.