Predictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election)

General election to select members of the House of Representatives (DPR) in Indonesia is done once in every 5 years. In every election, only a few of candidates nominated by political parties is elected and serves as legislative member in the House of Representatives. Ananda, Arifin & Suryadinat...

Full description

Saved in:
Bibliographic Details
Main Author: Yudha Pranata, Anugrah
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/43458
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:43458
spelling id-itb.:434582019-09-27T09:43:07ZPredictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election) Yudha Pranata, Anugrah Indonesia Final Project election, The House of Representatives (DPR), imbalanced data, binary classification prediction modelling, supervised learning algorithm INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/43458 General election to select members of the House of Representatives (DPR) in Indonesia is done once in every 5 years. In every election, only a few of candidates nominated by political parties is elected and serves as legislative member in the House of Representatives. Ananda, Arifin & Suryadinata (2005) documented that there were 7,756 candidates for 550 seats in the House of Representatives in 2004. Then, General Election Comission documented that there were 11,301 candidates for 560 seats in the House of Representatives in 2009, 6,606 candidates for 560 seats in the House of Representatives in 2014 and 7,968 candidates for 575 seats in the House of Representatives in 2019. Since the early beginning of the election process for every election period, there are many people/institutions trying to predict the election results, either using historical data or state affairs at the time. The condition where proportion of classes that are not equal in predictive modelling e.g. in this case of Indonesian legislative election is usually called as imbalanced data. One method that can be used to model prediction of election results is learning algorithm or usually called as machine learning. This research’s goal is to create predictive modelling for the House of Representatives members’ electability with using supervised learning algorithm. Predictive models are created using data of factors influencing election candidates’ electability from previous research about estimation model in case study of legislative election in 2004 with methods Ordinary Least Square (OLS) and Logistic Regression by Darawijaya (2014). This research is improving previous model with adding Neural Network implementation based on research reference of predictive modelling for case of president election in United States by Zolghadr, Niaki, & Niaki (2018). This predictive modelling of election candidates’ electability is created for binary classification task because there are only 2 classes in the output variable: ‘Elected’ or ‘Not Elected’. To evaluate the results of created predictive models, metrics that are used are accuracy and Receiver Operating Characteristic (ROC)/Area Under Curve (AUC). Result of this research is the predictive model created that gives the best value for all evaluation metrics is Neural Network model with data of combined candidates’ public personal information and campaign fund which results accuracy of 91.48% and AUC of 91.59%. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description General election to select members of the House of Representatives (DPR) in Indonesia is done once in every 5 years. In every election, only a few of candidates nominated by political parties is elected and serves as legislative member in the House of Representatives. Ananda, Arifin & Suryadinata (2005) documented that there were 7,756 candidates for 550 seats in the House of Representatives in 2004. Then, General Election Comission documented that there were 11,301 candidates for 560 seats in the House of Representatives in 2009, 6,606 candidates for 560 seats in the House of Representatives in 2014 and 7,968 candidates for 575 seats in the House of Representatives in 2019. Since the early beginning of the election process for every election period, there are many people/institutions trying to predict the election results, either using historical data or state affairs at the time. The condition where proportion of classes that are not equal in predictive modelling e.g. in this case of Indonesian legislative election is usually called as imbalanced data. One method that can be used to model prediction of election results is learning algorithm or usually called as machine learning. This research’s goal is to create predictive modelling for the House of Representatives members’ electability with using supervised learning algorithm. Predictive models are created using data of factors influencing election candidates’ electability from previous research about estimation model in case study of legislative election in 2004 with methods Ordinary Least Square (OLS) and Logistic Regression by Darawijaya (2014). This research is improving previous model with adding Neural Network implementation based on research reference of predictive modelling for case of president election in United States by Zolghadr, Niaki, & Niaki (2018). This predictive modelling of election candidates’ electability is created for binary classification task because there are only 2 classes in the output variable: ‘Elected’ or ‘Not Elected’. To evaluate the results of created predictive models, metrics that are used are accuracy and Receiver Operating Characteristic (ROC)/Area Under Curve (AUC). Result of this research is the predictive model created that gives the best value for all evaluation metrics is Neural Network model with data of combined candidates’ public personal information and campaign fund which results accuracy of 91.48% and AUC of 91.59%.
format Final Project
author Yudha Pranata, Anugrah
spellingShingle Yudha Pranata, Anugrah
Predictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election)
author_facet Yudha Pranata, Anugrah
author_sort Yudha Pranata, Anugrah
title Predictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election)
title_short Predictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election)
title_full Predictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election)
title_fullStr Predictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election)
title_full_unstemmed Predictive Modelling of Binary Classification Task on Imbalanced Data Using Supervised Learning Algorithm (Case Study: The House of Representatives of the Republic of Indonesia Member Electability in Election)
title_sort predictive modelling of binary classification task on imbalanced data using supervised learning algorithm (case study: the house of representatives of the republic of indonesia member electability in election)
url https://digilib.itb.ac.id/gdl/view/43458
_version_ 1822926583868948480