ADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS

Adaptive Ridge Regression Selection Operator (ARSO) is an elaboration of the Ridge Regression by adjusting the variable selection. In ARSO the weighting is calculated using the Ordinary Least Squares (OLS) and the weighting formulation is still using conventional methods. Therefore, in this re...

Full description

Saved in:
Bibliographic Details
Main Author: Suwardiman
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/55677
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:55677
spelling id-itb.:556772021-06-18T13:52:12ZADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS Suwardiman Indonesia Theses Ridge Regression, GDP, Google Trends, Lasso, RReliefF, CFS, Variable Selection INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/55677 Adaptive Ridge Regression Selection Operator (ARSO) is an elaboration of the Ridge Regression by adjusting the variable selection. In ARSO the weighting is calculated using the Ordinary Least Squares (OLS) and the weighting formulation is still using conventional methods. Therefore, in this research, changes were made on the weighting technique with expectations the accuracy and ability of ARSO in selecting variables are increased. In this research, the weighting technique was done by conducting to use several feature selection methods such as the lasso, RReliefF and Correlation Feature Selection (CFS) and the weighting formulation was was done by normalizing the value of the selection feature’s results. Experiments were performed by indicating the case studies on the prediction of Indonesia's Gross Domestic Product (GDP) by utilizing Google Trends. This case research was chosen because the need for GDP information is indispensable for better policy planning. The current condition in estimating Indonesia's GDP using economic variables, the data is incomplete and the release frequency of economic variables varies are monthly and quarterly. Therefore, convenient data were really needed,, and also abundant and available in real-time, one of the solutions is to use Google Trends as a predictor variable of Indonesia's GDP predictions. The use of selection variables on Google trends is intended to extract the most significant variables from the large number of Google Trends data. One of the selections variable used is the lasso in which lasso penalizes variables that do not significantly take effect in Indonesia's GDP. In penalizing the variable lambda value (?), it greatly affects the results of the selection variable. The determination of the lambda value on the lasso usually uses cross-validation where the value of each ? in a certain range will be evaluated entirely and then the ? value which has the lowest error is taken. Determination of ? in this way is less efficient so that an algorithm is developed that determines the lambda value based on the number of variables selected from each iteration as well as changes in the lasso convergence criterion which is when there are variables selected iteration for the ? value is terminated. So it is expected that a faster search for ? could increase the execution time and its accuracy value. The result of this experiment was showed 14 scenarios which were a combination of the methods and features of the selection to be tested. The test results showed that from the 14 scenarios performed, the results showed that the ARSO method using normalized weighting is the method with the best predictive value with a RMSE value of 0.74 which when compared to ARSO using vi conventional weighting reaches 1.19. In addition, ARSO with normalized weighting succeeded in capturing the pattern of economic crises that occurred during pandemic in 2020. The results of normalized ARSO were obtained using the lasso selection feature as weighting where the ? value was obtained from the algorithm proposed in this study. When it’s compared with the cross-validation algorithm proposed conducting the search ? managed to outperform both the execution time and the final accuracy of the method, namely for the execution time reaching 0.047 seconds and the RMSE value at 0.74 when compared to cross-validation is 3.9 seconds and the value RMSE 1.16. In addition, from this research it can be seen that data from Google Trends is able to reflect events that are happening in Indonesia, in this study the search keywords that indirectly reflect that the state of the Indonesian economy is declining based on the best prediction methods are dana_blt and pandemic. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Adaptive Ridge Regression Selection Operator (ARSO) is an elaboration of the Ridge Regression by adjusting the variable selection. In ARSO the weighting is calculated using the Ordinary Least Squares (OLS) and the weighting formulation is still using conventional methods. Therefore, in this research, changes were made on the weighting technique with expectations the accuracy and ability of ARSO in selecting variables are increased. In this research, the weighting technique was done by conducting to use several feature selection methods such as the lasso, RReliefF and Correlation Feature Selection (CFS) and the weighting formulation was was done by normalizing the value of the selection feature’s results. Experiments were performed by indicating the case studies on the prediction of Indonesia's Gross Domestic Product (GDP) by utilizing Google Trends. This case research was chosen because the need for GDP information is indispensable for better policy planning. The current condition in estimating Indonesia's GDP using economic variables, the data is incomplete and the release frequency of economic variables varies are monthly and quarterly. Therefore, convenient data were really needed,, and also abundant and available in real-time, one of the solutions is to use Google Trends as a predictor variable of Indonesia's GDP predictions. The use of selection variables on Google trends is intended to extract the most significant variables from the large number of Google Trends data. One of the selections variable used is the lasso in which lasso penalizes variables that do not significantly take effect in Indonesia's GDP. In penalizing the variable lambda value (?), it greatly affects the results of the selection variable. The determination of the lambda value on the lasso usually uses cross-validation where the value of each ? in a certain range will be evaluated entirely and then the ? value which has the lowest error is taken. Determination of ? in this way is less efficient so that an algorithm is developed that determines the lambda value based on the number of variables selected from each iteration as well as changes in the lasso convergence criterion which is when there are variables selected iteration for the ? value is terminated. So it is expected that a faster search for ? could increase the execution time and its accuracy value. The result of this experiment was showed 14 scenarios which were a combination of the methods and features of the selection to be tested. The test results showed that from the 14 scenarios performed, the results showed that the ARSO method using normalized weighting is the method with the best predictive value with a RMSE value of 0.74 which when compared to ARSO using vi conventional weighting reaches 1.19. In addition, ARSO with normalized weighting succeeded in capturing the pattern of economic crises that occurred during pandemic in 2020. The results of normalized ARSO were obtained using the lasso selection feature as weighting where the ? value was obtained from the algorithm proposed in this study. When it’s compared with the cross-validation algorithm proposed conducting the search ? managed to outperform both the execution time and the final accuracy of the method, namely for the execution time reaching 0.047 seconds and the RMSE value at 0.74 when compared to cross-validation is 3.9 seconds and the value RMSE 1.16. In addition, from this research it can be seen that data from Google Trends is able to reflect events that are happening in Indonesia, in this study the search keywords that indirectly reflect that the state of the Indonesian economy is declining based on the best prediction methods are dana_blt and pandemic.
format Theses
author Suwardiman
spellingShingle Suwardiman
ADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS
author_facet Suwardiman
author_sort Suwardiman
title ADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS
title_short ADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS
title_full ADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS
title_fullStr ADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS
title_full_unstemmed ADAPTIVE RIDGE REGRESSION-FS WITH WEIGHTED NORMALIZATION OF INDONESIA'S GDP GROWTH PREDICTION USING GOOGLE TRENDS
title_sort adaptive ridge regression-fs with weighted normalization of indonesia's gdp growth prediction using google trends
url https://digilib.itb.ac.id/gdl/view/55677
_version_ 1822929972846657536