Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles

Flooding is a very common natural hazard that causes catastrophic effects worldwide. Recently, ensemble-based techniques have become popular in flood susceptibility modelling due to their greater strength and efficiency in the prediction of flood locations. Thus, the aim of this study was to employ...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen, Wei, Hong, Haoyuan, Li, Shaojun, Shahabi, Himan, Wang, Yi, Wang, Xiaojing, Ahmad, Baharin
Format: Article
Published: Elsevier B. V. 2019
Subjects:
Online Access:http://eprints.utm.my/id/eprint/87726/
http://dx.doi.org/10.1016/j.jhydrol.2019.05.089
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.87726
record_format eprints
spelling my.utm.877262020-11-30T13:15:07Z http://eprints.utm.my/id/eprint/87726/ Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles Chen, Wei Hong, Haoyuan Li, Shaojun Shahabi, Himan Wang, Yi Wang, Xiaojing Ahmad, Baharin TD Environmental technology. Sanitary engineering Flooding is a very common natural hazard that causes catastrophic effects worldwide. Recently, ensemble-based techniques have become popular in flood susceptibility modelling due to their greater strength and efficiency in the prediction of flood locations. Thus, the aim of this study was to employ machine learning-based Reduced-error pruning trees (REPTree) with Bagging (Bag-REPTree) and Random subspace (RS-REPTree) ensemble frameworks for spatial prediction of flood susceptibility using a geographic information system (GIS). First, a flood spatial database was constructed with 363 flood locations and thirteen flood influencing factors, namely altitude, slope angle, slope aspect, curvature, stream power index (SPI), sediment transport index (STI), topographic wetness index (TWI), distance to rivers, normalized difference vegetation index (NDVI), soil, land use, lithology, and rainfall. Subsequently, correlation attribute evaluation (CAE) was used as the factor selection method for optimization of input factors. Finally, the receiver operating characteristic (ROC) curve, standard error (SE), confidence interval (CI) at 95%, and Wilcoxon signed-rank test were used to validate and compare the performance of the models. Results show that the RS-REPTree model has the highest prediction capability for flood susceptibility assessment, with the highest area under (the ROC) curve (AUC) value (0.949, 0.907), the smallest SE (0.011, 0.023), and the narrowest CI (95%) (0.928–0.970, 0.863–0.952) for the training and validation datasets. It was followed by the Bag-REPTree and REPTree models, respectively. The results also proved the superiority of the ensemble method over using these methods individually. Elsevier B. V. 2019 Article PeerReviewed Chen, Wei and Hong, Haoyuan and Li, Shaojun and Shahabi, Himan and Wang, Yi and Wang, Xiaojing and Ahmad, Baharin (2019) Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles. Journal of Hydrology, 575 . pp. 864-873. ISSN 0022-1694 http://dx.doi.org/10.1016/j.jhydrol.2019.05.089
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic TD Environmental technology. Sanitary engineering
spellingShingle TD Environmental technology. Sanitary engineering
Chen, Wei
Hong, Haoyuan
Li, Shaojun
Shahabi, Himan
Wang, Yi
Wang, Xiaojing
Ahmad, Baharin
Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles
description Flooding is a very common natural hazard that causes catastrophic effects worldwide. Recently, ensemble-based techniques have become popular in flood susceptibility modelling due to their greater strength and efficiency in the prediction of flood locations. Thus, the aim of this study was to employ machine learning-based Reduced-error pruning trees (REPTree) with Bagging (Bag-REPTree) and Random subspace (RS-REPTree) ensemble frameworks for spatial prediction of flood susceptibility using a geographic information system (GIS). First, a flood spatial database was constructed with 363 flood locations and thirteen flood influencing factors, namely altitude, slope angle, slope aspect, curvature, stream power index (SPI), sediment transport index (STI), topographic wetness index (TWI), distance to rivers, normalized difference vegetation index (NDVI), soil, land use, lithology, and rainfall. Subsequently, correlation attribute evaluation (CAE) was used as the factor selection method for optimization of input factors. Finally, the receiver operating characteristic (ROC) curve, standard error (SE), confidence interval (CI) at 95%, and Wilcoxon signed-rank test were used to validate and compare the performance of the models. Results show that the RS-REPTree model has the highest prediction capability for flood susceptibility assessment, with the highest area under (the ROC) curve (AUC) value (0.949, 0.907), the smallest SE (0.011, 0.023), and the narrowest CI (95%) (0.928–0.970, 0.863–0.952) for the training and validation datasets. It was followed by the Bag-REPTree and REPTree models, respectively. The results also proved the superiority of the ensemble method over using these methods individually.
format Article
author Chen, Wei
Hong, Haoyuan
Li, Shaojun
Shahabi, Himan
Wang, Yi
Wang, Xiaojing
Ahmad, Baharin
author_facet Chen, Wei
Hong, Haoyuan
Li, Shaojun
Shahabi, Himan
Wang, Yi
Wang, Xiaojing
Ahmad, Baharin
author_sort Chen, Wei
title Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles
title_short Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles
title_full Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles
title_fullStr Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles
title_full_unstemmed Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles
title_sort flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles
publisher Elsevier B. V.
publishDate 2019
url http://eprints.utm.my/id/eprint/87726/
http://dx.doi.org/10.1016/j.jhydrol.2019.05.089
_version_ 1685578979988209664