Recommender system for events with hybrid filtering and ensemble machine learning

This work aims to build a recommender system for local events that uses hybrid information filtering methods and ensemble machine learning. The motivation behind the project is the lack of a universal event aggregator that recommends personalized local events to its users. The sources currently used...

Full description

Saved in:
Bibliographic Details
Main Author: Malaviya, Chaitanya
Other Authors: Kong Wai-Kin Adams
Format: Final Year Project
Language:English
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/10356/66955
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-66955
record_format dspace
spelling sg-ntu-dr.10356-669552023-03-03T20:27:04Z Recommender system for events with hybrid filtering and ensemble machine learning Malaviya, Chaitanya Kong Wai-Kin Adams School of Computer Engineering DRNTU::Engineering This work aims to build a recommender system for local events that uses hybrid information filtering methods and ensemble machine learning. The motivation behind the project is the lack of a universal event aggregator that recommends personalized local events to its users. The sources currently used by people to find local events do not cater to their individual preferences and sometimes they contain incomplete details. The objective through the project is to make it easier to discover local events for a user. The data for events was scraped from the platform of Facebook Events, and included around 1.5 million events and 38000 users. This data was stored in a database organized into tables. The data was then preprocessed: missing values were inferred, irrelevant data was removed, data was subsetted to use a smaller training set, and values were normalized. We then performed model-based information filtering to calculate similarity metrics, including user-user collaborative, event-event collaborative, content-based and demographic filtering. Next, we performed feature engineering by building a set of features from the above data and calculating their values from our preprocessed data. We used our feature vector and our response variable to fit three different classifiers and evaluated the cross-validation accuracy for each. The classifiers used were: logistic regression, decision tree and random forests. After performing an 11-fold cross validation for each classifier, random forests gave the best performance with a mean accuracy of 90.1% followed by decision trees and logistic regression. We investigated the reasons why random forests classifier was successful in building an improved model from our training data. We also attribute the good performance of the recommender system to its architecture, that calculates the similarity metrics and feature values first, and then performs machine learning. Finally, we suggested some improvements for the recommender system, which include using more sophisticated natural language processing techniques to extract important keywords from event descriptions and using an ensemble of ensemble classifiers to improve accuracy. Bachelor of Engineering (Computer Engineering) 2016-05-06T07:10:40Z 2016-05-06T07:10:40Z 2016 Final Year Project (FYP) http://hdl.handle.net/10356/66955 en Nanyang Technological University 58 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Malaviya, Chaitanya
Recommender system for events with hybrid filtering and ensemble machine learning
description This work aims to build a recommender system for local events that uses hybrid information filtering methods and ensemble machine learning. The motivation behind the project is the lack of a universal event aggregator that recommends personalized local events to its users. The sources currently used by people to find local events do not cater to their individual preferences and sometimes they contain incomplete details. The objective through the project is to make it easier to discover local events for a user. The data for events was scraped from the platform of Facebook Events, and included around 1.5 million events and 38000 users. This data was stored in a database organized into tables. The data was then preprocessed: missing values were inferred, irrelevant data was removed, data was subsetted to use a smaller training set, and values were normalized. We then performed model-based information filtering to calculate similarity metrics, including user-user collaborative, event-event collaborative, content-based and demographic filtering. Next, we performed feature engineering by building a set of features from the above data and calculating their values from our preprocessed data. We used our feature vector and our response variable to fit three different classifiers and evaluated the cross-validation accuracy for each. The classifiers used were: logistic regression, decision tree and random forests. After performing an 11-fold cross validation for each classifier, random forests gave the best performance with a mean accuracy of 90.1% followed by decision trees and logistic regression. We investigated the reasons why random forests classifier was successful in building an improved model from our training data. We also attribute the good performance of the recommender system to its architecture, that calculates the similarity metrics and feature values first, and then performs machine learning. Finally, we suggested some improvements for the recommender system, which include using more sophisticated natural language processing techniques to extract important keywords from event descriptions and using an ensemble of ensemble classifiers to improve accuracy.
author2 Kong Wai-Kin Adams
author_facet Kong Wai-Kin Adams
Malaviya, Chaitanya
format Final Year Project
author Malaviya, Chaitanya
author_sort Malaviya, Chaitanya
title Recommender system for events with hybrid filtering and ensemble machine learning
title_short Recommender system for events with hybrid filtering and ensemble machine learning
title_full Recommender system for events with hybrid filtering and ensemble machine learning
title_fullStr Recommender system for events with hybrid filtering and ensemble machine learning
title_full_unstemmed Recommender system for events with hybrid filtering and ensemble machine learning
title_sort recommender system for events with hybrid filtering and ensemble machine learning
publishDate 2016
url http://hdl.handle.net/10356/66955
_version_ 1759856657224957952