Optimized conditioning factors using machine learning techniques for groundwater potential mapping
Assessment of the most appropriate groundwater conditioning factors (GCFs) is essential when performing analyses for groundwater potential mapping. For this reason, in this work, we look at three statistical factor analysis methods—Variance Inflation Factor (VIF), Chi-Square Factor Optimization, and...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI
2019
|
Online Access: | http://psasir.upm.edu.my/id/eprint/38259/1/38259.pdf http://psasir.upm.edu.my/id/eprint/38259/ https://www.mdpi.com/2073-4441/11/9/1909 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Putra Malaysia |
Language: | English |
id |
my.upm.eprints.38259 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.382592020-05-04T16:08:41Z http://psasir.upm.edu.my/id/eprint/38259/ Optimized conditioning factors using machine learning techniques for groundwater potential mapping Kalantar, Bahareh Al-Najjar, Husam A. H. Pradhan, Biswajeet Saeidi, Vahideh Abdul Halin, Alfian Ueda, Naonori Naghibi, Seyed Amir Assessment of the most appropriate groundwater conditioning factors (GCFs) is essential when performing analyses for groundwater potential mapping. For this reason, in this work, we look at three statistical factor analysis methods—Variance Inflation Factor (VIF), Chi-Square Factor Optimization, and Gini Importance—to measure the significance of GCFs. From a total of 15 frequently used GCFs, 11 most effective ones (i.e., altitude, slope angle, plan curvature, profile curvature, topographic wetness index, distance from river, distance from fault, river density, fault density, land use, and lithology) were finally selected. In addition, 917 spring locations were identified and used to train and test three machine learning algorithms, namely Mixture Discriminant Analysis (MDA), Linear Discriminant Analysis (LDA) and Random Forest (RF). The resultant trained models were then applied for groundwater potential prediction and mapping in the Haraz basin of Mazandaran province, Iran. MDA has been successfully applied for soil erosion and landslide mapping, but has not yet been fully explored for groundwater potential mapping (GPM). Although other discriminant methods, such as LDA, exist, MDA is worth exploring due to its capability to model multivariate nonlinear relationships between variables; it also undertakes a mixture of unobserved subclasses with regularization of non-linear decision boundaries, which could potentially provide more accurate classification. For the validation, areas under Receiver Operating Characteristics (ROC) curves (AUC) were calculated for the three algorithms. RF performed better with AUC value of 84.4%, while MDA and LDA yielded 75.2% and 74.9%, respectively. Although MDA performance is lower than RF, the result is satisfactory, because it is within the acceptable standard of environmental modeling. The outcome of factor analysis and groundwater maps emphasizes on optimization of multicolinearity factors for faster spatial modeling and provides valuable information for government agencies and private sectors to effectively manage groundwater in the region. MDPI 2019 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/38259/1/38259.pdf Kalantar, Bahareh and Al-Najjar, Husam A. H. and Pradhan, Biswajeet and Saeidi, Vahideh and Abdul Halin, Alfian and Ueda, Naonori and Naghibi, Seyed Amir (2019) Optimized conditioning factors using machine learning techniques for groundwater potential mapping. Water, 11 (9). art. no. 1909. pp. 1-21. ISSN 2073-4441 https://www.mdpi.com/2073-4441/11/9/1909 10.3390/w11091909 |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English |
description |
Assessment of the most appropriate groundwater conditioning factors (GCFs) is essential when performing analyses for groundwater potential mapping. For this reason, in this work, we look at three statistical factor analysis methods—Variance Inflation Factor (VIF), Chi-Square Factor Optimization, and Gini Importance—to measure the significance of GCFs. From a total of 15 frequently used GCFs, 11 most effective ones (i.e., altitude, slope angle, plan curvature, profile curvature, topographic wetness index, distance from river, distance from fault, river density, fault density, land use, and lithology) were finally selected. In addition, 917 spring locations were identified and used to train and test three machine learning algorithms, namely Mixture Discriminant Analysis (MDA), Linear Discriminant Analysis (LDA) and Random Forest (RF). The resultant trained models were then applied for groundwater potential prediction and mapping in the Haraz basin of Mazandaran province, Iran. MDA has been successfully applied for soil erosion and landslide mapping, but has not yet been fully explored for groundwater potential mapping (GPM). Although other discriminant methods, such as LDA, exist, MDA is worth exploring due to its capability to model multivariate nonlinear relationships between variables; it also undertakes a mixture of unobserved subclasses with regularization of non-linear decision boundaries, which could potentially provide more accurate classification. For the validation, areas under Receiver Operating Characteristics (ROC) curves (AUC) were calculated for the three algorithms. RF performed better with AUC value of 84.4%, while MDA and LDA yielded 75.2% and 74.9%, respectively. Although MDA performance is lower than RF, the result is satisfactory, because it is within the acceptable standard of environmental modeling. The outcome of factor analysis and groundwater maps emphasizes on optimization of multicolinearity factors for faster spatial modeling and provides valuable information for government agencies and private sectors to effectively manage groundwater in the region. |
format |
Article |
author |
Kalantar, Bahareh Al-Najjar, Husam A. H. Pradhan, Biswajeet Saeidi, Vahideh Abdul Halin, Alfian Ueda, Naonori Naghibi, Seyed Amir |
spellingShingle |
Kalantar, Bahareh Al-Najjar, Husam A. H. Pradhan, Biswajeet Saeidi, Vahideh Abdul Halin, Alfian Ueda, Naonori Naghibi, Seyed Amir Optimized conditioning factors using machine learning techniques for groundwater potential mapping |
author_facet |
Kalantar, Bahareh Al-Najjar, Husam A. H. Pradhan, Biswajeet Saeidi, Vahideh Abdul Halin, Alfian Ueda, Naonori Naghibi, Seyed Amir |
author_sort |
Kalantar, Bahareh |
title |
Optimized conditioning factors using machine learning techniques for groundwater potential mapping |
title_short |
Optimized conditioning factors using machine learning techniques for groundwater potential mapping |
title_full |
Optimized conditioning factors using machine learning techniques for groundwater potential mapping |
title_fullStr |
Optimized conditioning factors using machine learning techniques for groundwater potential mapping |
title_full_unstemmed |
Optimized conditioning factors using machine learning techniques for groundwater potential mapping |
title_sort |
optimized conditioning factors using machine learning techniques for groundwater potential mapping |
publisher |
MDPI |
publishDate |
2019 |
url |
http://psasir.upm.edu.my/id/eprint/38259/1/38259.pdf http://psasir.upm.edu.my/id/eprint/38259/ https://www.mdpi.com/2073-4441/11/9/1909 |
_version_ |
1665895969048231936 |