AIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH
Air pollution is a global issue that significantly impacts human health and the environment. The COVID-19 pandemic introduced unexpected changes in air quality patterns, necessitating an approach capable of handling data pattern shifts and anomalies. This study aims to demonstrate that LSTM-XGBoo...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/87887 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:87887 |
---|---|
spelling |
id-itb.:878872025-02-03T22:10:42ZAIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH Layalia S.A.G., Aurell Indonesia Theses Anomaly Detection, LSTM, LSTM-XGBoost, COVID-19 Pandemic, Air Quality Forecasting, PM10, XGBoost. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/87887 Air pollution is a global issue that significantly impacts human health and the environment. The COVID-19 pandemic introduced unexpected changes in air quality patterns, necessitating an approach capable of handling data pattern shifts and anomalies. This study aims to demonstrate that LSTM-XGBoost can achieve the best performance in air quality prediction for time series data containing anomalies during the COVID-19 pandemic, compared to XGBoost and LSTM individually. LSTM-XGBoost integrates LSTM to capture temporal patterns and XGBoost to model non-linear relationships. The dataset used in this study includes air pollutant concentrations and meteorological factors from 2020 to 2024, focusing on case studies in Jakarta and Bandung. This approach incorporates temporal event indicators to mark the COVID-19 pandemic, PSBB, and PPKM periods, as well as anomaly detection using Isolation Forest and LSTM Autoencoder to identify and mitigate anomalies in the data. The results show that LSTM-XGBoost achieves the best MAPE performance, with 8.56% for Jakarta and 8.74% for Bandung, outperforming both XGBoost and LSTM individually. The addition of temporal event indicators enhances the model’s ability to recognize changes in data patterns, while anomaly detection allows the model to identify and reduce the impact of anomalies. LSTM-XGBoost proves to be the most effective approach for predicting air quality in time series data containing anomalies. The findings of this study are expected to contribute to air quality management and support the development of predictive models that are responsive to environmental changes. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Air pollution is a global issue that significantly impacts human health and the
environment. The COVID-19 pandemic introduced unexpected changes in air
quality patterns, necessitating an approach capable of handling data pattern shifts
and anomalies. This study aims to demonstrate that LSTM-XGBoost can achieve
the best performance in air quality prediction for time series data containing
anomalies during the COVID-19 pandemic, compared to XGBoost and LSTM
individually.
LSTM-XGBoost integrates LSTM to capture temporal patterns and XGBoost to
model non-linear relationships. The dataset used in this study includes air pollutant
concentrations and meteorological factors from 2020 to 2024, focusing on case
studies in Jakarta and Bandung. This approach incorporates temporal event
indicators to mark the COVID-19 pandemic, PSBB, and PPKM periods, as well as
anomaly detection using Isolation Forest and LSTM Autoencoder to identify and
mitigate anomalies in the data.
The results show that LSTM-XGBoost achieves the best MAPE performance, with
8.56% for Jakarta and 8.74% for Bandung, outperforming both XGBoost and
LSTM individually. The addition of temporal event indicators enhances the model’s
ability to recognize changes in data patterns, while anomaly detection allows the
model to identify and reduce the impact of anomalies. LSTM-XGBoost proves to be
the most effective approach for predicting air quality in time series data containing
anomalies. The findings of this study are expected to contribute to air quality
management and support the development of predictive models that are responsive
to environmental changes. |
format |
Theses |
author |
Layalia S.A.G., Aurell |
spellingShingle |
Layalia S.A.G., Aurell AIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH |
author_facet |
Layalia S.A.G., Aurell |
author_sort |
Layalia S.A.G., Aurell |
title |
AIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH |
title_short |
AIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH |
title_full |
AIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH |
title_fullStr |
AIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH |
title_full_unstemmed |
AIR QUALITY FORECASTING ON TIME SERIES DATA WITH ANOMALIES USING THE LSTM-XGBOOST APPROACH |
title_sort |
air quality forecasting on time series data with anomalies using the lstm-xgboost approach |
url |
https://digilib.itb.ac.id/gdl/view/87887 |
_version_ |
1823658307853746176 |