Real-time data-processing framework with model updating for digital twins of water treatment facilities

Machine learning (ML) models are now widely used in digital twins of water treatment facilities. These models are commonly trained based on historical datasets, and their predictions serve various important objectives, such as anomaly detection and optimization. While predictions from the trained mo...

Full description

Saved in:
Bibliographic Details
Main Authors: Wei, Yuying, Law, Adrian Wing-Keung, Yang, Chun
Other Authors: School of Civil and Environmental Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/165234
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-165234
record_format dspace
spelling sg-ntu-dr.10356-1652342023-03-22T15:34:29Z Real-time data-processing framework with model updating for digital twins of water treatment facilities Wei, Yuying Law, Adrian Wing-Keung Yang, Chun School of Civil and Environmental Engineering School of Mechanical and Aerospace Engineering Nanyang Environment and Water Research Institute Environmental Process Modelling Centre Engineering::Environmental engineering Real-Time Model Updating Machine Learning Machine learning (ML) models are now widely used in digital twins of water treatment facilities. These models are commonly trained based on historical datasets, and their predictions serve various important objectives, such as anomaly detection and optimization. While predictions from the trained models are being made continuously for the digital twin, model updating using newly available real-time data is also necessary so that the twin can mimic the changes in the physical system dynamically. Thus, a synchronicity framework needs to be established in the digital twin, which has not been addressed in the literature so far. In this study, a novel framework with new coverage-based algorithms is proposed to determine the necessity and timing for model updating during real-time data transfers to improve the ML performance over time. The framework is tested in a prototype water treatment facility called the secure water treatment (SWaT) system. The results show that the framework performs well in general to synchronize the model updates and predictions, with a significant reduction in errors of up to 97%. The good performance can be attributed particularly to the coverage-based updating algorithms which control the size of training datasets to accelerate the ML model updating during synchronization. National Research Foundation (NRF) Published version This research was funded by the National Research Foundation (NRF), Prime Minister’s Office, Singapore, under its National Cybersecurity R&D Program and administered by the National Satellite of Excellence in Design Science and Technology for Secure Critical Infrastructure, Award No. NSoE_DeST-SCI2019-0011. 2023-03-21T02:51:06Z 2023-03-21T02:51:06Z 2022 Journal Article Wei, Y., Law, A. W. & Yang, C. (2022). Real-time data-processing framework with model updating for digital twins of water treatment facilities. Water, 14(22), 3591-. https://dx.doi.org/10.3390/w14223591 2073-4441 https://hdl.handle.net/10356/165234 10.3390/w14223591 2-s2.0-85142417105 22 14 3591 en NSoE_DeST-SCI2019-0011 Water © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/). application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Environmental engineering
Real-Time Model Updating
Machine Learning
spellingShingle Engineering::Environmental engineering
Real-Time Model Updating
Machine Learning
Wei, Yuying
Law, Adrian Wing-Keung
Yang, Chun
Real-time data-processing framework with model updating for digital twins of water treatment facilities
description Machine learning (ML) models are now widely used in digital twins of water treatment facilities. These models are commonly trained based on historical datasets, and their predictions serve various important objectives, such as anomaly detection and optimization. While predictions from the trained models are being made continuously for the digital twin, model updating using newly available real-time data is also necessary so that the twin can mimic the changes in the physical system dynamically. Thus, a synchronicity framework needs to be established in the digital twin, which has not been addressed in the literature so far. In this study, a novel framework with new coverage-based algorithms is proposed to determine the necessity and timing for model updating during real-time data transfers to improve the ML performance over time. The framework is tested in a prototype water treatment facility called the secure water treatment (SWaT) system. The results show that the framework performs well in general to synchronize the model updates and predictions, with a significant reduction in errors of up to 97%. The good performance can be attributed particularly to the coverage-based updating algorithms which control the size of training datasets to accelerate the ML model updating during synchronization.
author2 School of Civil and Environmental Engineering
author_facet School of Civil and Environmental Engineering
Wei, Yuying
Law, Adrian Wing-Keung
Yang, Chun
format Article
author Wei, Yuying
Law, Adrian Wing-Keung
Yang, Chun
author_sort Wei, Yuying
title Real-time data-processing framework with model updating for digital twins of water treatment facilities
title_short Real-time data-processing framework with model updating for digital twins of water treatment facilities
title_full Real-time data-processing framework with model updating for digital twins of water treatment facilities
title_fullStr Real-time data-processing framework with model updating for digital twins of water treatment facilities
title_full_unstemmed Real-time data-processing framework with model updating for digital twins of water treatment facilities
title_sort real-time data-processing framework with model updating for digital twins of water treatment facilities
publishDate 2023
url https://hdl.handle.net/10356/165234
_version_ 1761781565517463552