Optimized conditioning factors using machine learning techniques for groundwater potential mapping

Assessment of the most appropriate groundwater conditioning factors (GCFs) is essential when performing analyses for groundwater potential mapping. For this reason, in this work, we look at three statistical factor analysis methods—Variance Inflation Factor (VIF), Chi-Square Factor Optimization, and...

Full description

Saved in:
Bibliographic Details
Main Authors: Kalantar, Bahareh, Al-Najjar, Husam A. H., Pradhan, Biswajeet, Saeidi, Vahideh, Abdul Halin, Alfian, Ueda, Naonori, Naghibi, Seyed Amir
Format: Article
Language:English
Published: MDPI 2019
Online Access:http://psasir.upm.edu.my/id/eprint/38259/1/38259.pdf
http://psasir.upm.edu.my/id/eprint/38259/
https://www.mdpi.com/2073-4441/11/9/1909
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Putra Malaysia
Language: English
id my.upm.eprints.38259
record_format eprints
spelling my.upm.eprints.382592020-05-04T16:08:41Z http://psasir.upm.edu.my/id/eprint/38259/ Optimized conditioning factors using machine learning techniques for groundwater potential mapping Kalantar, Bahareh Al-Najjar, Husam A. H. Pradhan, Biswajeet Saeidi, Vahideh Abdul Halin, Alfian Ueda, Naonori Naghibi, Seyed Amir Assessment of the most appropriate groundwater conditioning factors (GCFs) is essential when performing analyses for groundwater potential mapping. For this reason, in this work, we look at three statistical factor analysis methods—Variance Inflation Factor (VIF), Chi-Square Factor Optimization, and Gini Importance—to measure the significance of GCFs. From a total of 15 frequently used GCFs, 11 most effective ones (i.e., altitude, slope angle, plan curvature, profile curvature, topographic wetness index, distance from river, distance from fault, river density, fault density, land use, and lithology) were finally selected. In addition, 917 spring locations were identified and used to train and test three machine learning algorithms, namely Mixture Discriminant Analysis (MDA), Linear Discriminant Analysis (LDA) and Random Forest (RF). The resultant trained models were then applied for groundwater potential prediction and mapping in the Haraz basin of Mazandaran province, Iran. MDA has been successfully applied for soil erosion and landslide mapping, but has not yet been fully explored for groundwater potential mapping (GPM). Although other discriminant methods, such as LDA, exist, MDA is worth exploring due to its capability to model multivariate nonlinear relationships between variables; it also undertakes a mixture of unobserved subclasses with regularization of non-linear decision boundaries, which could potentially provide more accurate classification. For the validation, areas under Receiver Operating Characteristics (ROC) curves (AUC) were calculated for the three algorithms. RF performed better with AUC value of 84.4%, while MDA and LDA yielded 75.2% and 74.9%, respectively. Although MDA performance is lower than RF, the result is satisfactory, because it is within the acceptable standard of environmental modeling. The outcome of factor analysis and groundwater maps emphasizes on optimization of multicolinearity factors for faster spatial modeling and provides valuable information for government agencies and private sectors to effectively manage groundwater in the region. MDPI 2019 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/38259/1/38259.pdf Kalantar, Bahareh and Al-Najjar, Husam A. H. and Pradhan, Biswajeet and Saeidi, Vahideh and Abdul Halin, Alfian and Ueda, Naonori and Naghibi, Seyed Amir (2019) Optimized conditioning factors using machine learning techniques for groundwater potential mapping. Water, 11 (9). art. no. 1909. pp. 1-21. ISSN 2073-4441 https://www.mdpi.com/2073-4441/11/9/1909 10.3390/w11091909
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description Assessment of the most appropriate groundwater conditioning factors (GCFs) is essential when performing analyses for groundwater potential mapping. For this reason, in this work, we look at three statistical factor analysis methods—Variance Inflation Factor (VIF), Chi-Square Factor Optimization, and Gini Importance—to measure the significance of GCFs. From a total of 15 frequently used GCFs, 11 most effective ones (i.e., altitude, slope angle, plan curvature, profile curvature, topographic wetness index, distance from river, distance from fault, river density, fault density, land use, and lithology) were finally selected. In addition, 917 spring locations were identified and used to train and test three machine learning algorithms, namely Mixture Discriminant Analysis (MDA), Linear Discriminant Analysis (LDA) and Random Forest (RF). The resultant trained models were then applied for groundwater potential prediction and mapping in the Haraz basin of Mazandaran province, Iran. MDA has been successfully applied for soil erosion and landslide mapping, but has not yet been fully explored for groundwater potential mapping (GPM). Although other discriminant methods, such as LDA, exist, MDA is worth exploring due to its capability to model multivariate nonlinear relationships between variables; it also undertakes a mixture of unobserved subclasses with regularization of non-linear decision boundaries, which could potentially provide more accurate classification. For the validation, areas under Receiver Operating Characteristics (ROC) curves (AUC) were calculated for the three algorithms. RF performed better with AUC value of 84.4%, while MDA and LDA yielded 75.2% and 74.9%, respectively. Although MDA performance is lower than RF, the result is satisfactory, because it is within the acceptable standard of environmental modeling. The outcome of factor analysis and groundwater maps emphasizes on optimization of multicolinearity factors for faster spatial modeling and provides valuable information for government agencies and private sectors to effectively manage groundwater in the region.
format Article
author Kalantar, Bahareh
Al-Najjar, Husam A. H.
Pradhan, Biswajeet
Saeidi, Vahideh
Abdul Halin, Alfian
Ueda, Naonori
Naghibi, Seyed Amir
spellingShingle Kalantar, Bahareh
Al-Najjar, Husam A. H.
Pradhan, Biswajeet
Saeidi, Vahideh
Abdul Halin, Alfian
Ueda, Naonori
Naghibi, Seyed Amir
Optimized conditioning factors using machine learning techniques for groundwater potential mapping
author_facet Kalantar, Bahareh
Al-Najjar, Husam A. H.
Pradhan, Biswajeet
Saeidi, Vahideh
Abdul Halin, Alfian
Ueda, Naonori
Naghibi, Seyed Amir
author_sort Kalantar, Bahareh
title Optimized conditioning factors using machine learning techniques for groundwater potential mapping
title_short Optimized conditioning factors using machine learning techniques for groundwater potential mapping
title_full Optimized conditioning factors using machine learning techniques for groundwater potential mapping
title_fullStr Optimized conditioning factors using machine learning techniques for groundwater potential mapping
title_full_unstemmed Optimized conditioning factors using machine learning techniques for groundwater potential mapping
title_sort optimized conditioning factors using machine learning techniques for groundwater potential mapping
publisher MDPI
publishDate 2019
url http://psasir.upm.edu.my/id/eprint/38259/1/38259.pdf
http://psasir.upm.edu.my/id/eprint/38259/
https://www.mdpi.com/2073-4441/11/9/1909
_version_ 1665895969048231936