Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
Background: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. W...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/146995 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-146995 |
---|---|
record_format |
dspace |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Biological sciences Dengue Exposures Cluster Analysis |
spellingShingle |
Science::Biological sciences Dengue Exposures Cluster Analysis Sangkaew, Sorawat Tan, Li Kiang Ng, Lee Ching Ferguson, Neil M. Dorigatti, Ilaria Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore |
description |
Background: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. We analysed serotype-specific antibody levels obtained by PRNT in two serological surveys conducted in Singapore in 2009 and 2013 using cluster analysis, a machine learning technique that was used to identify the most common histories of DENV exposure. Methods: We explored the use of five distinct clustering methods (i.e. agglomerative hierarchical, divisive hierarchi-cal, K-means, K-medoids and model-based clustering) with varying number (from 4 to 10) of clusters for each method. Weighted rank aggregation, an evaluating technique for a set of internal validity metrics, was adopted to determine the optimal algorithm, comprising the optimal clustering method and the optimal number of clusters. Results: The K-means algorithm with six clusters was selected as the algorithm with the highest weighted rank aggregation. The six clusters were characterised by (i) dominant DENV2 PRNT titres; (ii) co-dominant DENV1 and DENV2 titres with average DENV2 titre > average DENV1 titre; (iii) co-dominant DENV1 and DENV2 titres with average DENV1 titre > average DENV2 titre; (iv) low PRNT titres against DENV1-4; (v) intermediate PRNT titres against DENV1-4; and (vi) dominant DENV1-3 titres. Analyses of the relative size and age-stratification of the clusters by year of sample collection and the application of cluster analysis to the 2009 and 2013 datasets considered separately revealed the epidemic circulation of DENV2 and DENV3 between 2009 and 2013. Conclusion: Cluster analysis is an unsupervised machine learning technique that can be applied to analyse PRNT antibody titres (without pre-established cut-off thresholds to indicate protection) to explore common patterns of DENV infection and infer the likely history of dengue exposure in a population. |
author2 |
School of Biological Sciences |
author_facet |
School of Biological Sciences Sangkaew, Sorawat Tan, Li Kiang Ng, Lee Ching Ferguson, Neil M. Dorigatti, Ilaria |
format |
Article |
author |
Sangkaew, Sorawat Tan, Li Kiang Ng, Lee Ching Ferguson, Neil M. Dorigatti, Ilaria |
author_sort |
Sangkaew, Sorawat |
title |
Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore |
title_short |
Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore |
title_full |
Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore |
title_fullStr |
Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore |
title_full_unstemmed |
Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore |
title_sort |
using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in singapore |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/146995 |
_version_ |
1759857642766860288 |
spelling |
sg-ntu-dr.10356-1469952023-02-28T17:07:29Z Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore Sangkaew, Sorawat Tan, Li Kiang Ng, Lee Ching Ferguson, Neil M. Dorigatti, Ilaria School of Biological Sciences Science::Biological sciences Dengue Exposures Cluster Analysis Background: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. We analysed serotype-specific antibody levels obtained by PRNT in two serological surveys conducted in Singapore in 2009 and 2013 using cluster analysis, a machine learning technique that was used to identify the most common histories of DENV exposure. Methods: We explored the use of five distinct clustering methods (i.e. agglomerative hierarchical, divisive hierarchi-cal, K-means, K-medoids and model-based clustering) with varying number (from 4 to 10) of clusters for each method. Weighted rank aggregation, an evaluating technique for a set of internal validity metrics, was adopted to determine the optimal algorithm, comprising the optimal clustering method and the optimal number of clusters. Results: The K-means algorithm with six clusters was selected as the algorithm with the highest weighted rank aggregation. The six clusters were characterised by (i) dominant DENV2 PRNT titres; (ii) co-dominant DENV1 and DENV2 titres with average DENV2 titre > average DENV1 titre; (iii) co-dominant DENV1 and DENV2 titres with average DENV1 titre > average DENV2 titre; (iv) low PRNT titres against DENV1-4; (v) intermediate PRNT titres against DENV1-4; and (vi) dominant DENV1-3 titres. Analyses of the relative size and age-stratification of the clusters by year of sample collection and the application of cluster analysis to the 2009 and 2013 datasets considered separately revealed the epidemic circulation of DENV2 and DENV3 between 2009 and 2013. Conclusion: Cluster analysis is an unsupervised machine learning technique that can be applied to analyse PRNT antibody titres (without pre-established cut-off thresholds to indicate protection) to explore common patterns of DENV infection and infer the likely history of dengue exposure in a population. National Environmental Agency (NEA) Published version This work is jointly funded by the UK Medical Research Council (MRC) and the UK Department for International Development (DFID) under the MRC/DFID Concordat agreement and is also part of the EDCTP2 programme supported by the European Union. SS acknowledges funding from the Collaborative Project to Increase Production of Rural Doctor and Royal Thai Government Scholarship. ID also acknowledges research funding from an Imperial College Junior Research Fellowship and a Sir Henry Dale Fellowship funded by Wellcome Trust and the Royal Society [grant 213494/Z/18/Z]. NLC and TLK acknowledge support from the National Environment Agency, Singapore. The funding bodies had no role in the design of the study, analysis of the data, interpretation of the results and in the manuscript preparation. 2021-03-18T08:40:37Z 2021-03-18T08:40:37Z 2020 Journal Article Sangkaew, S., Tan, L. K., Ng, L. C., Ferguson, N. M. & Dorigatti, I. (2020). Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore. Parasites & Vectors, 13(1). https://dx.doi.org/10.1186/s13071-020-3898-5 1756-3305 https://hdl.handle.net/10356/146995 10.1186/s13071-020-3898-5 31952539 2-s2.0-85077979546 1 13 en Parasites & Vectors © 2020 The Author(s). This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. application/pdf |