APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT

NCAA Men’s Basketball Tournament is a competition that bring the best men basketball university team in United States to compete each year. This tournament use a knockout format which mean that one lose can shatter the dream to win the championship, even the favorite one. For this reason each team m...

Full description

Saved in:
Bibliographic Details
Main Author: Susanto, Marcello
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/49831
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:49831
spelling id-itb.:498312020-09-21T08:29:54ZAPPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT Susanto, Marcello Indonesia Final Project tournament, lasso, feature importance, generalized linear model, gradient boosting machine, logloss, cross-validation INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/49831 NCAA Men’s Basketball Tournament is a competition that bring the best men basketball university team in United States to compete each year. This tournament use a knockout format which mean that one lose can shatter the dream to win the championship, even the favorite one. For this reason each team must give their best effort on every match, which makes this competition even more competitive than regular season. The favorite team lose in the first match, the underdog can go deep to even win the tournament, all these scenario is very possible. This final paper will explore the ability of predictive model such as generalized linear model (GLM) and gradient boosting machine (GBM), and also find the most predictive variable using method such as GBM feature importance and GLM LASSO. The data use in this final paper consist of Las Vegas odds, Kenpom possession-based metrics, team statistics and team quality dataset which popularized by Kaggle user Darius Barusauskas (radar). Based on the logloss of the test dataset it can be concluded that GBM outperform GLM by small margin. The best model which have the lowest test logloss is GBM trained using regular season matchup on Las Vegas odds dataset that achieve test logloss of 0.52485 text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description NCAA Men’s Basketball Tournament is a competition that bring the best men basketball university team in United States to compete each year. This tournament use a knockout format which mean that one lose can shatter the dream to win the championship, even the favorite one. For this reason each team must give their best effort on every match, which makes this competition even more competitive than regular season. The favorite team lose in the first match, the underdog can go deep to even win the tournament, all these scenario is very possible. This final paper will explore the ability of predictive model such as generalized linear model (GLM) and gradient boosting machine (GBM), and also find the most predictive variable using method such as GBM feature importance and GLM LASSO. The data use in this final paper consist of Las Vegas odds, Kenpom possession-based metrics, team statistics and team quality dataset which popularized by Kaggle user Darius Barusauskas (radar). Based on the logloss of the test dataset it can be concluded that GBM outperform GLM by small margin. The best model which have the lowest test logloss is GBM trained using regular season matchup on Las Vegas odds dataset that achieve test logloss of 0.52485
format Final Project
author Susanto, Marcello
spellingShingle Susanto, Marcello
APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT
author_facet Susanto, Marcello
author_sort Susanto, Marcello
title APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT
title_short APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT
title_full APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT
title_fullStr APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT
title_full_unstemmed APPLICATION OF GENERALIZED LINEAR MODEL AND GRADIENT BOOSTING MACHINE TO PREDICT MATCH OUTCOME ON NCAA MEN’S BASKETBALL TOURNAMENT
title_sort application of generalized linear model and gradient boosting machine to predict match outcome on ncaa men’s basketball tournament
url https://digilib.itb.ac.id/gdl/view/49831
_version_ 1822928278611034112