Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis

© 2017 Skwark et al. Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent...

Full description

Saved in:
Bibliographic Details
Main Authors: Marcin J. Skwark, Nicholas J. Croucher, Santeri Puranen, Claire Chewapreecha, Maiju Pesonen, Ying Ying Xu, Paul Turner, Simon R. Harris, Stephen B. Beres, James M. Musser, Julian Parkhill, Stephen D. Bentley, Erik Aurell, Jukka Corander
Other Authors: Vanderbilt University
Format: Article
Published: 2018
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/41534
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.41534
record_format dspace
spelling th-mahidol.415342019-03-14T15:02:30Z Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis Marcin J. Skwark Nicholas J. Croucher Santeri Puranen Claire Chewapreecha Maiju Pesonen Ying Ying Xu Paul Turner Simon R. Harris Stephen B. Beres James M. Musser Julian Parkhill Stephen D. Bentley Erik Aurell Jukka Corander Vanderbilt University Imperial College London Aalto University University of Cambridge Mahidol University Nuffield Department of Clinical Medicine Wellcome Trust Sanger Institute Methodist Hospital Houston Weill Cornell Medical College The Royal Institute of Technology (KTH) Institute of Theoretical Physics Chinese Academy of Sciences Helsingin Yliopisto Universitetet i Oslo Agricultural and Biological Sciences Biochemistry, Genetics and Molecular Biology © 2017 Skwark et al. Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent advances in computational structural biology to identify the polymorphic loci under the strongest co-evolutionary pressures. We apply genomeDCA to two large population data sets representing the major human pathogens Streptococcus pneumoniae (pneumococcus) and Streptococcus pyogenes (group A Streptococcus). For pneumococcus we identified 5,199 putative epistatic interactions between 1,936 sites. Over three-quarters of the links were between sites within the pbp2x, pbp1a and pbp2b genes, the sequences of which are critical in determining non-susceptibility to beta-lactam antibiotics. A network-based analysis found these genes were also coupled to that encoding dihydrofolate reductase, changes to which underlie trimethoprim resistance. Distinct from these antibiotic resistance genes, a large network component of 384 protein coding sequences encompassed many genes critical in basic cellular functions, while another distinct component included genes associated with virulence. The group A Streptococcus (GAS) data set population represents a clonal population with relatively little genetic variation and a high level of linkage disequilibrium across the genome. Despite this, we were able to pinpoint two RNA pseudouridine synthases, which were each strongly linked to a separate set of loci across the chromosome, representing biologically plausible targets of co-selection. The population genomic analysis method applied here identifies statistically significantly co-evolving locus pairs, potentially arising from fitness selection interdependence reflecting underlying protein-protein interactions, or genes whose product activities contribute to the same phenotype. This discovery approach greatly enhances the future potential of epistasis analysis for systems biology, and can complement genome-wide association studies as a means of formulating hypotheses for targeted experimental work. 2018-12-21T06:33:21Z 2019-03-14T08:02:30Z 2018-12-21T06:33:21Z 2019-03-14T08:02:30Z 2017-02-01 Article PLoS Genetics. Vol.13, No.2 (2017) 10.1371/journal.pgen.1006508 15537404 15537390 2-s2.0-85014119857 https://repository.li.mahidol.ac.th/handle/123456789/41534 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85014119857&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Agricultural and Biological Sciences
Biochemistry, Genetics and Molecular Biology
spellingShingle Agricultural and Biological Sciences
Biochemistry, Genetics and Molecular Biology
Marcin J. Skwark
Nicholas J. Croucher
Santeri Puranen
Claire Chewapreecha
Maiju Pesonen
Ying Ying Xu
Paul Turner
Simon R. Harris
Stephen B. Beres
James M. Musser
Julian Parkhill
Stephen D. Bentley
Erik Aurell
Jukka Corander
Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
description © 2017 Skwark et al. Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent advances in computational structural biology to identify the polymorphic loci under the strongest co-evolutionary pressures. We apply genomeDCA to two large population data sets representing the major human pathogens Streptococcus pneumoniae (pneumococcus) and Streptococcus pyogenes (group A Streptococcus). For pneumococcus we identified 5,199 putative epistatic interactions between 1,936 sites. Over three-quarters of the links were between sites within the pbp2x, pbp1a and pbp2b genes, the sequences of which are critical in determining non-susceptibility to beta-lactam antibiotics. A network-based analysis found these genes were also coupled to that encoding dihydrofolate reductase, changes to which underlie trimethoprim resistance. Distinct from these antibiotic resistance genes, a large network component of 384 protein coding sequences encompassed many genes critical in basic cellular functions, while another distinct component included genes associated with virulence. The group A Streptococcus (GAS) data set population represents a clonal population with relatively little genetic variation and a high level of linkage disequilibrium across the genome. Despite this, we were able to pinpoint two RNA pseudouridine synthases, which were each strongly linked to a separate set of loci across the chromosome, representing biologically plausible targets of co-selection. The population genomic analysis method applied here identifies statistically significantly co-evolving locus pairs, potentially arising from fitness selection interdependence reflecting underlying protein-protein interactions, or genes whose product activities contribute to the same phenotype. This discovery approach greatly enhances the future potential of epistasis analysis for systems biology, and can complement genome-wide association studies as a means of formulating hypotheses for targeted experimental work.
author2 Vanderbilt University
author_facet Vanderbilt University
Marcin J. Skwark
Nicholas J. Croucher
Santeri Puranen
Claire Chewapreecha
Maiju Pesonen
Ying Ying Xu
Paul Turner
Simon R. Harris
Stephen B. Beres
James M. Musser
Julian Parkhill
Stephen D. Bentley
Erik Aurell
Jukka Corander
format Article
author Marcin J. Skwark
Nicholas J. Croucher
Santeri Puranen
Claire Chewapreecha
Maiju Pesonen
Ying Ying Xu
Paul Turner
Simon R. Harris
Stephen B. Beres
James M. Musser
Julian Parkhill
Stephen D. Bentley
Erik Aurell
Jukka Corander
author_sort Marcin J. Skwark
title Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
title_short Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
title_full Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
title_fullStr Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
title_full_unstemmed Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
title_sort interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis
publishDate 2018
url https://repository.li.mahidol.ac.th/handle/123456789/41534
_version_ 1763497185940340736