Soil database development with the application of machine learning methods in soil properties prediction
Excessive rainwater infiltration can be an important causal agent of both slope and whole tree uprooting failures. Early warnings or stabilization measures on high-risk slopes or trees are critically important. To identify the high-risk areas, it is necessary to conduct seepage, slope and tree stabi...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/169077 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-169077 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1690772023-06-28T05:32:06Z Soil database development with the application of machine learning methods in soil properties prediction Li, Yangyang Rahardjo, Harianto Satyanaga, Alfrendo Rangarajan, Saranya Lee, Daryl Tsen-Tieng School of Civil and Environmental Engineering Engineering::Civil engineering Soil Database Unsaturated Soil Excessive rainwater infiltration can be an important causal agent of both slope and whole tree uprooting failures. Early warnings or stabilization measures on high-risk slopes or trees are critically important. To identify the high-risk areas, it is necessary to conduct seepage, slope and tree stability analyses over a large region. Given the spatial variability of soil properties, a soil database is therefore required before performing distributed or Geographical Information System (GIS) -based water balance and stability analyses. Considering that the unsaturated soil properties could be very different from saturated soil properties, in this study, a soil database containing both saturated and unsaturated hydraulic and mechanical soil properties was developed for the first time. Machine learning methods were used to predict the unknown soil properties. Based on the predicted soil properties, spatial distributions of different saturated and unsaturated soil properties were generated using the ordinary kriging method. Then the soil database was developed with Singapore island being divided into 97 zones, with each zone having similar soil properties. In this study, the importance of different input variables in soil properties prediction was also investigated. In addition to soil plasticity (i.e., Liquid Limit (LL), Plastic Limit (PL) and Plasticity Index (PI)) and grain size distribution (i.e., gravel, sand, and fines fractions), location (i.e., longitude and latitude) was found to be of high importance as well and are recommended to be used as input variables to predict soil properties, especially when data volume is relatively limited. For those soil properties that cover a large range of values, model performance is better when logarithm values were used as the outputs. Moreover, given the possible correlation between some output parameters, the prediction of the Soil-water Characteristic Curve (SWCC) from a multi-output model is recommended after comparing its performance with a single output model. Furthermore, the performance of two commonly used machine learning methods (i.e., random forest regression and artificial neural network) in soil properties prediction were compared and the prediction error resulting from the random forest regression method was generally smaller. The developed database includes the mean values of saturated permeability, saturated and unsaturated shear strength parameters, and SWCC in each zone. The database can be applied in regional GIS-based water balance and slope stability analyses to account for the spatial heterogeneity instead of assuming constant soil properties. National Parks Board The authors would like to acknowledge the support from National Parks Board, Singapore through the “Eye-on-Trees” project. 2023-06-28T05:32:05Z 2023-06-28T05:32:05Z 2022 Journal Article Li, Y., Rahardjo, H., Satyanaga, A., Rangarajan, S. & Lee, D. T. (2022). Soil database development with the application of machine learning methods in soil properties prediction. Engineering Geology, 306, 106769-. https://dx.doi.org/10.1016/j.enggeo.2022.106769 0013-7952 https://hdl.handle.net/10356/169077 10.1016/j.enggeo.2022.106769 2-s2.0-85145807261 306 106769 en Engineering Geology © 2022 Elsevier B.V. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Civil engineering Soil Database Unsaturated Soil |
spellingShingle |
Engineering::Civil engineering Soil Database Unsaturated Soil Li, Yangyang Rahardjo, Harianto Satyanaga, Alfrendo Rangarajan, Saranya Lee, Daryl Tsen-Tieng Soil database development with the application of machine learning methods in soil properties prediction |
description |
Excessive rainwater infiltration can be an important causal agent of both slope and whole tree uprooting failures. Early warnings or stabilization measures on high-risk slopes or trees are critically important. To identify the high-risk areas, it is necessary to conduct seepage, slope and tree stability analyses over a large region. Given the spatial variability of soil properties, a soil database is therefore required before performing distributed or Geographical Information System (GIS) -based water balance and stability analyses. Considering that the unsaturated soil properties could be very different from saturated soil properties, in this study, a soil database containing both saturated and unsaturated hydraulic and mechanical soil properties was developed for the first time. Machine learning methods were used to predict the unknown soil properties. Based on the predicted soil properties, spatial distributions of different saturated and unsaturated soil properties were generated using the ordinary kriging method. Then the soil database was developed with Singapore island being divided into 97 zones, with each zone having similar soil properties. In this study, the importance of different input variables in soil properties prediction was also investigated. In addition to soil plasticity (i.e., Liquid Limit (LL), Plastic Limit (PL) and Plasticity Index (PI)) and grain size distribution (i.e., gravel, sand, and fines fractions), location (i.e., longitude and latitude) was found to be of high importance as well and are recommended to be used as input variables to predict soil properties, especially when data volume is relatively limited. For those soil properties that cover a large range of values, model performance is better when logarithm values were used as the outputs. Moreover, given the possible correlation between some output parameters, the prediction of the Soil-water Characteristic Curve (SWCC) from a multi-output model is recommended after comparing its performance with a single output model. Furthermore, the performance of two commonly used machine learning methods (i.e., random forest regression and artificial neural network) in soil properties prediction were compared and the prediction error resulting from the random forest regression method was generally smaller. The developed database includes the mean values of saturated permeability, saturated and unsaturated shear strength parameters, and SWCC in each zone. The database can be applied in regional GIS-based water balance and slope stability analyses to account for the spatial heterogeneity instead of assuming constant soil properties. |
author2 |
School of Civil and Environmental Engineering |
author_facet |
School of Civil and Environmental Engineering Li, Yangyang Rahardjo, Harianto Satyanaga, Alfrendo Rangarajan, Saranya Lee, Daryl Tsen-Tieng |
format |
Article |
author |
Li, Yangyang Rahardjo, Harianto Satyanaga, Alfrendo Rangarajan, Saranya Lee, Daryl Tsen-Tieng |
author_sort |
Li, Yangyang |
title |
Soil database development with the application of machine learning methods in soil properties prediction |
title_short |
Soil database development with the application of machine learning methods in soil properties prediction |
title_full |
Soil database development with the application of machine learning methods in soil properties prediction |
title_fullStr |
Soil database development with the application of machine learning methods in soil properties prediction |
title_full_unstemmed |
Soil database development with the application of machine learning methods in soil properties prediction |
title_sort |
soil database development with the application of machine learning methods in soil properties prediction |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/169077 |
_version_ |
1772825208786255872 |