Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission

In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19...

Full description

Saved in:
Bibliographic Details
Main Authors: Chew, Alvin Wei Ze, Pan, Yue, Wang, Ying, Zhang, Limao
Other Authors: School of Civil and Environmental Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/160694
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-160694
record_format dspace
spelling sg-ntu-dr.10356-1606942022-08-01T03:43:13Z Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission Chew, Alvin Wei Ze Pan, Yue Wang, Ying Zhang, Limao School of Civil and Environmental Engineering Engineering::Civil engineering Natural Language Processing Time-Series Prediction In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19 related Twitter data, as representative of the global community's aggregated emotional responses towards the current pandemic, to model the growth rate in the number of confirmed COVID-19 cases globally via a proposed G parameter. Overall, there were 3 key components to ODANN's development phase, namely: (i) data hydration and pre-processing were performed on COVID-19 related Twitter data ranging between 23 January 2020 and 10 May 2020, which amounted to over 100 million Tweets written in English language; (ii) multiple NLP features extraction methods were subsequently leveraged to encode the hydrated Twitter data into useful semantic word vectors for training ODANN under an optimal set of hyperparameters; and (iii) historical time-series data of defined characteristics were also assimilated into ODANN's selected hidden layer(s) to model the G parameter daily with a lead-time of 1 day. By far, our experimental results demonstrated that by adopting a rolling time-window size of 5 days, with respect to the number of historical time-series records for assimilating different data features, enabled ODANN to outperform other traditional time-series models and recent studies, in terms of the computed RMSE and MAE scores attained from the model's testing step. Overall, the summarized results from ODANN demonstrated its competitive edge in modelling and forecasting the growth rate in the number of COVID-19 cases globally. Nanyang Technological University This study was supported in part by Microsoft Corporation for the AI for Health COVID-19 Azure Compute Grant of ID:00011000272 and the Start-Up Grant at Nanyang Technological University, Singapore (No. 04INS000423C120). 2022-08-01T03:43:13Z 2022-08-01T03:43:13Z 2021 Journal Article Chew, A. W. Z., Pan, Y., Wang, Y. & Zhang, L. (2021). Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission. Knowledge-Based Systems, 233, 107417-. https://dx.doi.org/10.1016/j.knosys.2021.107417 0950-7051 https://hdl.handle.net/10356/160694 10.1016/j.knosys.2021.107417 34690447 2-s2.0-85116318005 233 107417 en 04INS000423C120 Knowledge-Based Systems © 2021 Elsevier B.V. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Civil engineering
Natural Language Processing
Time-Series Prediction
spellingShingle Engineering::Civil engineering
Natural Language Processing
Time-Series Prediction
Chew, Alvin Wei Ze
Pan, Yue
Wang, Ying
Zhang, Limao
Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
description In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19 related Twitter data, as representative of the global community's aggregated emotional responses towards the current pandemic, to model the growth rate in the number of confirmed COVID-19 cases globally via a proposed G parameter. Overall, there were 3 key components to ODANN's development phase, namely: (i) data hydration and pre-processing were performed on COVID-19 related Twitter data ranging between 23 January 2020 and 10 May 2020, which amounted to over 100 million Tweets written in English language; (ii) multiple NLP features extraction methods were subsequently leveraged to encode the hydrated Twitter data into useful semantic word vectors for training ODANN under an optimal set of hyperparameters; and (iii) historical time-series data of defined characteristics were also assimilated into ODANN's selected hidden layer(s) to model the G parameter daily with a lead-time of 1 day. By far, our experimental results demonstrated that by adopting a rolling time-window size of 5 days, with respect to the number of historical time-series records for assimilating different data features, enabled ODANN to outperform other traditional time-series models and recent studies, in terms of the computed RMSE and MAE scores attained from the model's testing step. Overall, the summarized results from ODANN demonstrated its competitive edge in modelling and forecasting the growth rate in the number of COVID-19 cases globally.
author2 School of Civil and Environmental Engineering
author_facet School of Civil and Environmental Engineering
Chew, Alvin Wei Ze
Pan, Yue
Wang, Ying
Zhang, Limao
format Article
author Chew, Alvin Wei Ze
Pan, Yue
Wang, Ying
Zhang, Limao
author_sort Chew, Alvin Wei Ze
title Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_short Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_full Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_fullStr Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_full_unstemmed Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_sort hybrid deep learning of social media big data for predicting the evolution of covid-19 transmission
publishDate 2022
url https://hdl.handle.net/10356/160694
_version_ 1743119514571112448