LANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED)
Landslide susceptibility modeling (LSM) is necessary as an initial effort in disaster mitigation. State of the art in study of LSM is an integrated method of bivariate statistics and machine learning. In machine learning-based modeling, the response variable has two classes, i.e., landslide presence...
Saved in:
Main Author: | |
---|---|
Format: | Dissertations |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/70973 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:70973 |
---|---|
spelling |
id-itb.:709732023-01-25T13:19:09ZLANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED) Sukristiyanti Indonesia Dissertations bivariate statistics, landslide susceptibility modeling, machine learning, non-landslide data. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/70973 Landslide susceptibility modeling (LSM) is necessary as an initial effort in disaster mitigation. State of the art in study of LSM is an integrated method of bivariate statistics and machine learning. In machine learning-based modeling, the response variable has two classes, i.e., landslide presence data and landslide absence data (non-landslide). A representative approach to determining non-landslide data does not yet exist and has not been widely discussed, and then this research was conducted to answer these problems. The novelty of this research is to develop a new approach to obtain more representative non-landslide data. The new approach is to take non-landslide samples randomly in the lowest zone of the LSM bivariate statistic. The non-landslide samples of the new approach were compared with the non-landslide samples of the previous approaches by testing them on machine learning-based LSM modeling integrated with bivariate statistics. Two machine learning methods, namely random forest (RF) and support vector machine (SVM), were integrated with the weight of evidence (WoE). The WoE was chosen from the other three bivariate statistical methods used here (frequency ratio, information value model, and Shannon entropy) then it was chosen as an input in this integrated modeling. Therefore, WoE values were used not only to convert predictor variables which are nominal ones, into numeric ones but also to obtain the non-landslide samples. The results showed that using non-landslide samples from the new approach could improve the LSM model compared to using non-landslide samples from the previous approaches. The model's accuracy by using the non-landslide samples from the new approach increased significantly (±10% – 40%) on balanced data both in number and location (spatial). text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Landslide susceptibility modeling (LSM) is necessary as an initial effort in disaster mitigation. State of the art in study of LSM is an integrated method of bivariate statistics and machine learning. In machine learning-based modeling, the response variable has two classes, i.e., landslide presence data and landslide absence data (non-landslide). A representative approach to determining non-landslide data does not yet exist and has not been widely discussed, and then this research was conducted to answer these problems.
The novelty of this research is to develop a new approach to obtain more representative non-landslide data. The new approach is to take non-landslide samples randomly in the lowest zone of the LSM bivariate statistic. The non-landslide samples of the new approach were compared with the non-landslide samples of the previous approaches by testing them on machine learning-based LSM modeling integrated with bivariate statistics. Two machine learning methods, namely random forest (RF) and support vector machine (SVM), were integrated with the weight of evidence (WoE). The WoE was chosen from the other three bivariate statistical methods used here (frequency ratio, information value model, and Shannon entropy) then it was chosen as an input in this integrated modeling. Therefore, WoE values were used not only to convert predictor variables which are nominal ones, into numeric ones but also to obtain the non-landslide samples.
The results showed that using non-landslide samples from the new approach could improve the LSM model compared to using non-landslide samples from the previous approaches. The model's accuracy by using the non-landslide samples from the new approach increased significantly (±10% – 40%) on balanced data both in number and location (spatial). |
format |
Dissertations |
author |
Sukristiyanti |
spellingShingle |
Sukristiyanti LANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED) |
author_facet |
Sukristiyanti |
author_sort |
Sukristiyanti |
title |
LANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED) |
title_short |
LANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED) |
title_full |
LANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED) |
title_fullStr |
LANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED) |
title_full_unstemmed |
LANDSLIDE SUSCEPTIBILITY MODELING USING WOE-RF-SBS (WEIGHT OF EVIDENCE-RANDOM FOREST-WITH SPATIALLY BALANCED SAMPLING) METHOD BASED ON MACHINE LEARNING AND BIVARIATE STATISTICAL ANALYSIS (CASE STUDY: CISANGKUY SUB-WATERSHED) |
title_sort |
landslide susceptibility modeling using woe-rf-sbs (weight of evidence-random forest-with spatially balanced sampling) method based on machine learning and bivariate statistical analysis (case study: cisangkuy sub-watershed) |
url |
https://digilib.itb.ac.id/gdl/view/70973 |
_version_ |
1822278918322782208 |