APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT
NCAA Men’s Basketball Tournament is a competition that bring the best men basketball university team in United States to compete each year. This tournament use a knockout format which mean that one lose can shatter the dream to win the championship, even the favorite one. For this reason each team m...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/49831 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:49831 |
---|---|
spelling |
id-itb.:498312020-09-21T08:29:54ZAPPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT Susanto, Marcello Indonesia Final Project tournament, lasso, feature importance, generalized linear model, gradient boosting machine, logloss, cross-validation INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/49831 NCAA Men’s Basketball Tournament is a competition that bring the best men basketball university team in United States to compete each year. This tournament use a knockout format which mean that one lose can shatter the dream to win the championship, even the favorite one. For this reason each team must give their best effort on every match, which makes this competition even more competitive than regular season. The favorite team lose in the first match, the underdog can go deep to even win the tournament, all these scenario is very possible. This final paper will explore the ability of predictive model such as generalized linear model (GLM) and gradient boosting machine (GBM), and also find the most predictive variable using method such as GBM feature importance and GLM LASSO. The data use in this final paper consist of Las Vegas odds, Kenpom possession-based metrics, team statistics and team quality dataset which popularized by Kaggle user Darius Barusauskas (radar). Based on the logloss of the test dataset it can be concluded that GBM outperform GLM by small margin. The best model which have the lowest test logloss is GBM trained using regular season matchup on Las Vegas odds dataset that achieve test logloss of 0.52485 text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
NCAA Men’s Basketball Tournament is a competition that bring the best men basketball university team in United States to compete each year. This tournament use a knockout format which mean that one lose can shatter the dream to win the championship, even the favorite one. For this reason each team must give their best effort on every match, which makes this competition even more competitive than regular season. The favorite team lose in the first match, the underdog can go deep to even win the tournament, all these scenario is very possible. This final paper will explore the ability of predictive model such as generalized linear model (GLM) and gradient boosting machine (GBM), and also find the most predictive variable using method such as GBM feature importance and GLM LASSO. The data use in this final paper consist of Las Vegas odds, Kenpom possession-based metrics, team statistics and team quality dataset which popularized by Kaggle user Darius Barusauskas (radar). Based on the logloss of the test dataset it can be concluded that GBM outperform GLM by small margin. The best model which have the lowest test logloss is GBM trained using regular season matchup on Las Vegas odds dataset that achieve test logloss of 0.52485 |
format |
Final Project |
author |
Susanto, Marcello |
spellingShingle |
Susanto, Marcello APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT |
author_facet |
Susanto, Marcello |
author_sort |
Susanto, Marcello |
title |
APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT |
title_short |
APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT |
title_full |
APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT |
title_fullStr |
APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT |
title_full_unstemmed |
APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MENâS BASKETBALL TOURNAMENT |
title_sort |
application of generalized linear model and gradient boosting machine to predict match outcome on ncaa menâs basketball tournament |
url |
https://digilib.itb.ac.id/gdl/view/49831 |
_version_ |
1822928278611034112 |