GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms

Rapid urbanization has caused severe deterioration of air quality globally, leading to increased hospitalization and premature deaths. Therefore, accurate prediction of air quality is crucial for mitigation planning to support urban sustainability and resilience. Although some studies have predicted...

Full description

Saved in:
Bibliographic Details
Main Authors: Tella, A., Balogun, A.-L.
Format: Article
Published: Springer Science and Business Media Deutschland GmbH 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85115059955&doi=10.1007%2fs11356-021-16150-0&partnerID=40&md5=d9f30264f43f221194853254359149db
http://eprints.utp.edu.my/29428/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Petronas
id my.utp.eprints.29428
record_format eprints
spelling my.utp.eprints.294282022-03-25T01:52:16Z GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms Tella, A. Balogun, A.-L. Rapid urbanization has caused severe deterioration of air quality globally, leading to increased hospitalization and premature deaths. Therefore, accurate prediction of air quality is crucial for mitigation planning to support urban sustainability and resilience. Although some studies have predicted air pollutants such as particulate matter (PM) using machine learning algorithms (MLAs), there is a paucity of studies on spatial hazard assessment with respect to the air quality index (AQI). Incorporating PM in AQI studies is crucial because of its easily inhalable micro-size which has adverse impacts on ecology, environment, and human health. Accurate and timely prediction of the air quality index can ensure adequate intervention to aid air quality management. Therefore, this study undertakes a spatial hazard assessment of the air quality index using particulate matter with a diameter of 10 μm or lesser (PM10) in Selangor, Malaysia, by developing four machine learning models: eXtreme Gradient Boosting (XGBoost), random forest (RF), K-nearest neighbour (KNN), and Naive Bayes (NB). Spatially processed data such as NDVI, SAVI, BU, LST, Ws, slope, elevation, and road density was used for the modelling. The model was trained with 70 of the dataset, while 30 was used for cross-validation. Results showed that XGBoost has the highest overall accuracy and precision of 0.989 and 0.995, followed by random forest (0.989, 0.993), K-nearest neighbour (0.987, 0.984), and Naive Bayes (0.917, 0.922), respectively. The spatial air quality maps were generated by integrating the geographical information system (GIS) with the four MLAs, which correlated with Malaysia�s air pollution index. The maps indicate that air quality in Selangor is satisfactory and posed no threats to health. Nevertheless, the two algorithms with the best performance (XGBoost and RF) indicate that a high percentage of the air quality is moderate. The study concludes that successful air pollution management policies such as green infrastructure practice, improvement of energy efficiency, and restrictions on heavy-duty vehicles can be adopted in Selangor and other Southeast Asian cities to prevent deterioration of air quality in the future. Graphical abstract: Figure not available: see fulltext. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature. Springer Science and Business Media Deutschland GmbH 2021 Article NonPeerReviewed https://www.scopus.com/inward/record.uri?eid=2-s2.0-85115059955&doi=10.1007%2fs11356-021-16150-0&partnerID=40&md5=d9f30264f43f221194853254359149db Tella, A. and Balogun, A.-L. (2021) GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms. Environmental Science and Pollution Research . http://eprints.utp.edu.my/29428/
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description Rapid urbanization has caused severe deterioration of air quality globally, leading to increased hospitalization and premature deaths. Therefore, accurate prediction of air quality is crucial for mitigation planning to support urban sustainability and resilience. Although some studies have predicted air pollutants such as particulate matter (PM) using machine learning algorithms (MLAs), there is a paucity of studies on spatial hazard assessment with respect to the air quality index (AQI). Incorporating PM in AQI studies is crucial because of its easily inhalable micro-size which has adverse impacts on ecology, environment, and human health. Accurate and timely prediction of the air quality index can ensure adequate intervention to aid air quality management. Therefore, this study undertakes a spatial hazard assessment of the air quality index using particulate matter with a diameter of 10 μm or lesser (PM10) in Selangor, Malaysia, by developing four machine learning models: eXtreme Gradient Boosting (XGBoost), random forest (RF), K-nearest neighbour (KNN), and Naive Bayes (NB). Spatially processed data such as NDVI, SAVI, BU, LST, Ws, slope, elevation, and road density was used for the modelling. The model was trained with 70 of the dataset, while 30 was used for cross-validation. Results showed that XGBoost has the highest overall accuracy and precision of 0.989 and 0.995, followed by random forest (0.989, 0.993), K-nearest neighbour (0.987, 0.984), and Naive Bayes (0.917, 0.922), respectively. The spatial air quality maps were generated by integrating the geographical information system (GIS) with the four MLAs, which correlated with Malaysia�s air pollution index. The maps indicate that air quality in Selangor is satisfactory and posed no threats to health. Nevertheless, the two algorithms with the best performance (XGBoost and RF) indicate that a high percentage of the air quality is moderate. The study concludes that successful air pollution management policies such as green infrastructure practice, improvement of energy efficiency, and restrictions on heavy-duty vehicles can be adopted in Selangor and other Southeast Asian cities to prevent deterioration of air quality in the future. Graphical abstract: Figure not available: see fulltext. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
format Article
author Tella, A.
Balogun, A.-L.
spellingShingle Tella, A.
Balogun, A.-L.
GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms
author_facet Tella, A.
Balogun, A.-L.
author_sort Tella, A.
title GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms
title_short GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms
title_full GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms
title_fullStr GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms
title_full_unstemmed GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms
title_sort gis-based air quality modelling: spatial prediction of pm10 for selangor state, malaysia using machine learning algorithms
publisher Springer Science and Business Media Deutschland GmbH
publishDate 2021
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85115059955&doi=10.1007%2fs11356-021-16150-0&partnerID=40&md5=d9f30264f43f221194853254359149db
http://eprints.utp.edu.my/29428/
_version_ 1738656964411392000