Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore

Background: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. W...

Full description

Saved in:
Bibliographic Details
Main Authors: Sangkaew, Sorawat, Tan, Li Kiang, Ng, Lee Ching, Ferguson, Neil M., Dorigatti, Ilaria
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2021
Subjects:
Online Access:https://hdl.handle.net/10356/146995
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-146995
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences
Dengue Exposures
Cluster Analysis
spellingShingle Science::Biological sciences
Dengue Exposures
Cluster Analysis
Sangkaew, Sorawat
Tan, Li Kiang
Ng, Lee Ching
Ferguson, Neil M.
Dorigatti, Ilaria
Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
description Background: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. We analysed serotype-specific antibody levels obtained by PRNT in two serological surveys conducted in Singapore in 2009 and 2013 using cluster analysis, a machine learning technique that was used to identify the most common histories of DENV exposure. Methods: We explored the use of five distinct clustering methods (i.e. agglomerative hierarchical, divisive hierarchi-cal, K-means, K-medoids and model-based clustering) with varying number (from 4 to 10) of clusters for each method. Weighted rank aggregation, an evaluating technique for a set of internal validity metrics, was adopted to determine the optimal algorithm, comprising the optimal clustering method and the optimal number of clusters. Results: The K-means algorithm with six clusters was selected as the algorithm with the highest weighted rank aggregation. The six clusters were characterised by (i) dominant DENV2 PRNT titres; (ii) co-dominant DENV1 and DENV2 titres with average DENV2 titre > average DENV1 titre; (iii) co-dominant DENV1 and DENV2 titres with average DENV1 titre > average DENV2 titre; (iv) low PRNT titres against DENV1-4; (v) intermediate PRNT titres against DENV1-4; and (vi) dominant DENV1-3 titres. Analyses of the relative size and age-stratification of the clusters by year of sample collection and the application of cluster analysis to the 2009 and 2013 datasets considered separately revealed the epidemic circulation of DENV2 and DENV3 between 2009 and 2013. Conclusion: Cluster analysis is an unsupervised machine learning technique that can be applied to analyse PRNT antibody titres (without pre-established cut-off thresholds to indicate protection) to explore common patterns of DENV infection and infer the likely history of dengue exposure in a population.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Sangkaew, Sorawat
Tan, Li Kiang
Ng, Lee Ching
Ferguson, Neil M.
Dorigatti, Ilaria
format Article
author Sangkaew, Sorawat
Tan, Li Kiang
Ng, Lee Ching
Ferguson, Neil M.
Dorigatti, Ilaria
author_sort Sangkaew, Sorawat
title Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_short Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_full Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_fullStr Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_full_unstemmed Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_sort using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in singapore
publishDate 2021
url https://hdl.handle.net/10356/146995
_version_ 1759857642766860288
spelling sg-ntu-dr.10356-1469952023-02-28T17:07:29Z Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore Sangkaew, Sorawat Tan, Li Kiang Ng, Lee Ching Ferguson, Neil M. Dorigatti, Ilaria School of Biological Sciences Science::Biological sciences Dengue Exposures Cluster Analysis Background: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. We analysed serotype-specific antibody levels obtained by PRNT in two serological surveys conducted in Singapore in 2009 and 2013 using cluster analysis, a machine learning technique that was used to identify the most common histories of DENV exposure. Methods: We explored the use of five distinct clustering methods (i.e. agglomerative hierarchical, divisive hierarchi-cal, K-means, K-medoids and model-based clustering) with varying number (from 4 to 10) of clusters for each method. Weighted rank aggregation, an evaluating technique for a set of internal validity metrics, was adopted to determine the optimal algorithm, comprising the optimal clustering method and the optimal number of clusters. Results: The K-means algorithm with six clusters was selected as the algorithm with the highest weighted rank aggregation. The six clusters were characterised by (i) dominant DENV2 PRNT titres; (ii) co-dominant DENV1 and DENV2 titres with average DENV2 titre > average DENV1 titre; (iii) co-dominant DENV1 and DENV2 titres with average DENV1 titre > average DENV2 titre; (iv) low PRNT titres against DENV1-4; (v) intermediate PRNT titres against DENV1-4; and (vi) dominant DENV1-3 titres. Analyses of the relative size and age-stratification of the clusters by year of sample collection and the application of cluster analysis to the 2009 and 2013 datasets considered separately revealed the epidemic circulation of DENV2 and DENV3 between 2009 and 2013. Conclusion: Cluster analysis is an unsupervised machine learning technique that can be applied to analyse PRNT antibody titres (without pre-established cut-off thresholds to indicate protection) to explore common patterns of DENV infection and infer the likely history of dengue exposure in a population. National Environmental Agency (NEA) Published version This work is jointly funded by the UK Medical Research Council (MRC) and the UK Department for International Development (DFID) under the MRC/DFID Concordat agreement and is also part of the EDCTP2 programme supported by the European Union. SS acknowledges funding from the Collaborative Project to Increase Production of Rural Doctor and Royal Thai Government Scholarship. ID also acknowledges research funding from an Imperial College Junior Research Fellowship and a Sir Henry Dale Fellowship funded by Wellcome Trust and the Royal Society [grant 213494/Z/18/Z]. NLC and TLK acknowledge support from the National Environment Agency, Singapore. The funding bodies had no role in the design of the study, analysis of the data, interpretation of the results and in the manuscript preparation. 2021-03-18T08:40:37Z 2021-03-18T08:40:37Z 2020 Journal Article Sangkaew, S., Tan, L. K., Ng, L. C., Ferguson, N. M. & Dorigatti, I. (2020). Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore. Parasites & Vectors, 13(1). https://dx.doi.org/10.1186/s13071-020-3898-5 1756-3305 https://hdl.handle.net/10356/146995 10.1186/s13071-020-3898-5 31952539 2-s2.0-85077979546 1 13 en Parasites & Vectors © 2020 The Author(s). This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. application/pdf