Recommender system for events with hybrid filtering and ensemble machine learning
This work aims to build a recommender system for local events that uses hybrid information filtering methods and ensemble machine learning. The motivation behind the project is the lack of a universal event aggregator that recommends personalized local events to its users. The sources currently used...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2016
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/66955 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-66955 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-669552023-03-03T20:27:04Z Recommender system for events with hybrid filtering and ensemble machine learning Malaviya, Chaitanya Kong Wai-Kin Adams School of Computer Engineering DRNTU::Engineering This work aims to build a recommender system for local events that uses hybrid information filtering methods and ensemble machine learning. The motivation behind the project is the lack of a universal event aggregator that recommends personalized local events to its users. The sources currently used by people to find local events do not cater to their individual preferences and sometimes they contain incomplete details. The objective through the project is to make it easier to discover local events for a user. The data for events was scraped from the platform of Facebook Events, and included around 1.5 million events and 38000 users. This data was stored in a database organized into tables. The data was then preprocessed: missing values were inferred, irrelevant data was removed, data was subsetted to use a smaller training set, and values were normalized. We then performed model-based information filtering to calculate similarity metrics, including user-user collaborative, event-event collaborative, content-based and demographic filtering. Next, we performed feature engineering by building a set of features from the above data and calculating their values from our preprocessed data. We used our feature vector and our response variable to fit three different classifiers and evaluated the cross-validation accuracy for each. The classifiers used were: logistic regression, decision tree and random forests. After performing an 11-fold cross validation for each classifier, random forests gave the best performance with a mean accuracy of 90.1% followed by decision trees and logistic regression. We investigated the reasons why random forests classifier was successful in building an improved model from our training data. We also attribute the good performance of the recommender system to its architecture, that calculates the similarity metrics and feature values first, and then performs machine learning. Finally, we suggested some improvements for the recommender system, which include using more sophisticated natural language processing techniques to extract important keywords from event descriptions and using an ensemble of ensemble classifiers to improve accuracy. Bachelor of Engineering (Computer Engineering) 2016-05-06T07:10:40Z 2016-05-06T07:10:40Z 2016 Final Year Project (FYP) http://hdl.handle.net/10356/66955 en Nanyang Technological University 58 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering |
spellingShingle |
DRNTU::Engineering Malaviya, Chaitanya Recommender system for events with hybrid filtering and ensemble machine learning |
description |
This work aims to build a recommender system for local events that uses hybrid information filtering methods and ensemble machine learning. The motivation behind the project is the lack of a universal event aggregator that recommends personalized local events to its users. The sources currently used by people to find local events do not cater to their individual preferences and sometimes they contain incomplete details. The objective through the project is to make it easier to discover local events for a user. The data for events was scraped from the platform of Facebook Events, and included around 1.5 million events and 38000 users. This data was stored in a database organized into tables. The data was then preprocessed: missing values were inferred, irrelevant data was removed, data was subsetted to use a smaller training set, and values were normalized. We then performed model-based information filtering to calculate similarity metrics, including user-user collaborative, event-event collaborative, content-based and demographic filtering. Next, we performed feature engineering by building a set of features from the above data and calculating their values from our preprocessed data. We used our feature vector and our response variable to fit three different classifiers and evaluated the cross-validation accuracy for each. The classifiers used were: logistic regression, decision tree and random forests. After performing an 11-fold cross validation for each classifier, random forests gave the best performance with a mean accuracy of 90.1% followed by decision trees and logistic regression. We investigated the reasons why random forests classifier was successful in building an improved model from our training data. We also attribute the good performance of the recommender system to its architecture, that calculates the similarity metrics and feature values first, and then performs machine learning. Finally, we suggested some improvements for the recommender system, which include using more sophisticated natural language processing techniques to extract important keywords from event descriptions and using an ensemble of ensemble classifiers to improve accuracy. |
author2 |
Kong Wai-Kin Adams |
author_facet |
Kong Wai-Kin Adams Malaviya, Chaitanya |
format |
Final Year Project |
author |
Malaviya, Chaitanya |
author_sort |
Malaviya, Chaitanya |
title |
Recommender system for events with hybrid filtering and ensemble machine learning |
title_short |
Recommender system for events with hybrid filtering and ensemble machine learning |
title_full |
Recommender system for events with hybrid filtering and ensemble machine learning |
title_fullStr |
Recommender system for events with hybrid filtering and ensemble machine learning |
title_full_unstemmed |
Recommender system for events with hybrid filtering and ensemble machine learning |
title_sort |
recommender system for events with hybrid filtering and ensemble machine learning |
publishDate |
2016 |
url |
http://hdl.handle.net/10356/66955 |
_version_ |
1759856657224957952 |