DEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA
Indonesia is an agricultural country with one of the main agricultural commodities being rice. One of the areas that is the center of rice cultivation in Indonesia, especially in Karawang Regency, West Java. There are several methods of measuring rice production data, one of which is the ASF stat...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/75525 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:75525 |
---|---|
spelling |
id-itb.:755252023-08-02T13:15:07ZDEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA Fawziyya Masnur, Nadira Indonesia Final Project Paddy Growth Stages, Machine Learning, KSA, Modified CART algorithm, Hyperparameter Tuning, ensemble stacking, oversampling. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/75525 Indonesia is an agricultural country with one of the main agricultural commodities being rice. One of the areas that is the center of rice cultivation in Indonesia, especially in Karawang Regency, West Java. There are several methods of measuring rice production data, one of which is the ASF statistical method. Apart from using these statistical methods, several studies are showing that machine learning models can be developed to predict the rice growing phase precisely and accurately. In this final project, the development of the CART algorithm which is being researched at BRIN is carried out as a method of measuring rice production data. The development of a modified CART algorithm is aimed at improving the performance of the CART model in classifying rice growing phases in the study area of Karawang district, West Java in 2020–2021. The development was carried out using KSA sample data extrapolated with attribute data from the Sentinel-1A SAR satellite imagery. The CART model is modified using the grid search hyperparameter tuning method and ensemble stacking with a random forest model, support vector machine, and gradient boosting as the final estimator. The ensemble stacking method uses a passthrough parameter with a value of True which indicates that the training attribute data is also used to train the final estimator. In addition, several experiments were carried out to modify the data in the form of adding NDVI and NDBI attributes obtained from Landsat-8 OLI imagery and oversampling the rice growing phase class samples (1, 2, 3, and 4). Model development is carried out by fitting the 2020 ASF sample data in the study area. Furthermore, model testing is carried out by comparing the model predictions to the KSA sample data in 2021. The evaluation metrics used are overall accuracy, f-1 score in the rice growing phase class, and kappa score. The result of the best modified CART algorithm obtained from this series of final assignments is a stacking CART model with gradient boosting that uses sentinel-1A SAR image data that has been oversampled on its training data. The CART base classifier model used in stacking CART with gradient boosting is a tuned CART hyperparameter model with optimal hyperparameter results being maxnodes of 1000 and minsamplesleaf of 2. The best modified CART algorithm model appears to increase the OA of the baseline CART model with the lowest difference of 0.70% in training data September 2020 and the highest difference was 6.50% in the training data January 2020. In testing the model using NDVI and NDBI attribute data, it was found that in months with image data that had a low level of cloud cover it succeeded in increasing the OA model, but for months with a high level of cloud cover is seen to reduce the OA of the model. In addition, the application of oversampling also shows an increase in model performance in estimating the area of the rice growing phase class area when compared to the estimated area based on the calculation of the ASF statistical method. Therefore, it is found that stacking CART with gradient boosting has better performance than other models. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Indonesia is an agricultural country with one of the main agricultural commodities being rice.
One of the areas that is the center of rice cultivation in Indonesia, especially in Karawang
Regency, West Java. There are several methods of measuring rice production data, one of
which is the ASF statistical method. Apart from using these statistical methods, several studies
are showing that machine learning models can be developed to predict the rice growing phase
precisely and accurately. In this final project, the development of the CART algorithm which
is being researched at BRIN is carried out as a method of measuring rice production data. The
development of a modified CART algorithm is aimed at improving the performance of the
CART model in classifying rice growing phases in the study area of Karawang district, West
Java in 2020–2021. The development was carried out using KSA sample data extrapolated with
attribute data from the Sentinel-1A SAR satellite imagery. The CART model is modified using
the grid search hyperparameter tuning method and ensemble stacking with a random forest
model, support vector machine, and gradient boosting as the final estimator. The ensemble
stacking method uses a passthrough parameter with a value of True which indicates that the
training attribute data is also used to train the final estimator. In addition, several experiments
were carried out to modify the data in the form of adding NDVI and NDBI attributes obtained
from Landsat-8 OLI imagery and oversampling the rice growing phase class samples (1, 2, 3,
and 4). Model development is carried out by fitting the 2020 ASF sample data in the study area.
Furthermore, model testing is carried out by comparing the model predictions to the KSA
sample data in 2021. The evaluation metrics used are overall accuracy, f-1 score in the rice
growing phase class, and kappa score. The result of the best modified CART algorithm
obtained from this series of final assignments is a stacking CART model with gradient boosting
that uses sentinel-1A SAR image data that has been oversampled on its training data. The
CART base classifier model used in stacking CART with gradient boosting is a tuned CART
hyperparameter model with optimal hyperparameter results being maxnodes of 1000 and
minsamplesleaf of 2. The best modified CART algorithm model appears to increase the OA of
the baseline CART model with the lowest difference of 0.70% in training data September 2020
and the highest difference was 6.50% in the training data January 2020. In testing the model
using NDVI and NDBI attribute data, it was found that in months with image data that had a
low level of cloud cover it succeeded in increasing the OA model, but for months with a high
level of cloud cover is seen to reduce the OA of the model. In addition, the application of
oversampling also shows an increase in model performance in estimating the area of the rice
growing phase class area when compared to the estimated area based on the calculation of the
ASF statistical method. Therefore, it is found that stacking CART with gradient boosting has
better performance than other models. |
format |
Final Project |
author |
Fawziyya Masnur, Nadira |
spellingShingle |
Fawziyya Masnur, Nadira DEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA |
author_facet |
Fawziyya Masnur, Nadira |
author_sort |
Fawziyya Masnur, Nadira |
title |
DEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA |
title_short |
DEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA |
title_full |
DEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA |
title_fullStr |
DEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA |
title_full_unstemmed |
DEVELOPMENT OF MODIFIED CART ALGORITHM FOR CLASSIFYING PADDY GROWTH STAGES WITH GRID SEARCH HYPERPARAMETER TUNING METHOD IN KARAWANG REGENCY, WEST JAVA |
title_sort |
development of modified cart algorithm for classifying paddy growth stages with grid search hyperparameter tuning method in karawang regency, west java |
url |
https://digilib.itb.ac.id/gdl/view/75525 |
_version_ |
1822007709865607168 |