EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the mu...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/65784 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:65784 |
---|---|
spelling |
id-itb.:657842022-06-24T16:06:20ZEMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM Surya Angkasa, Hengky Indonesia Final Project classification, emotion, ConvXGB, CNN, music recommendation system, incomplete cold start INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/65784 A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the music recommendation system. Gilda et al. (2017) and Krupa et al. (2020) used Convolutional Neural Network (CNN) to classify emotion from face image with FER2013 dataset. The performance of Krupa’s CNN is not good yet and unequally to each emotion. While performance of Gilda’s CNN is good, but the convolution layers quite a lot (9 layers) with big filters (256). In a recommender system, the incomplete cold start could happen if a user has lack of rating. So the system needs extra information to give better recommendations. Thongsuwan et al. (2021) designed ConvXGB. Two convolution layers are used for extracting input and XGBoost does the learning task. The model has better performance than CNN in DrivFace dataset. So ConvXGB was implemented in this research for emotion classification. Emotion is mapped with the mood attribute in the music dataset (Emotify). The mood is the additional information of the user for solving incomplete cold start. The system recommends musics that are liked in particular mood by a user and sorted in descending. Recall metric is used to evaluate the recommendation. The performance of ConvXGB in emotion classification with 128 filters and a maxpooling is better than the two CNN in the oversampled dataset. ConvXGB gained 78.64% accuracy on 4 emotions and 80.99% on 7 emotions. Evaluation for incomplete cold start is using 10% to 90% of each user’s ratings in a mood as train data. From the average recall result, incomplete cold start can be solved with mood information using 10% of each user’s ratings. The system tends to have better performance with the increment of user rating data. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
A person could listen to different music when sad or happy. Listening to favorite
music stimulates the brain to release dopamine hormone to the corpus striatum,
which manages human feelings such as addiction, satisfaction, and motivation. So
human emotion could be an opportunity to enhance the music recommendation
system.
Gilda et al. (2017) and Krupa et al. (2020) used Convolutional Neural Network
(CNN) to classify emotion from face image with FER2013 dataset. The
performance of Krupa’s CNN is not good yet and unequally to each emotion. While
performance of Gilda’s CNN is good, but the convolution layers quite a lot (9
layers) with big filters (256). In a recommender system, the incomplete cold start
could happen if a user has lack of rating. So the system needs extra information to
give better recommendations.
Thongsuwan et al. (2021) designed ConvXGB. Two convolution layers are used
for extracting input and XGBoost does the learning task. The model has better
performance than CNN in DrivFace dataset. So ConvXGB was implemented in this
research for emotion classification. Emotion is mapped with the mood attribute in
the music dataset (Emotify). The mood is the additional information of the user for
solving incomplete cold start. The system recommends musics that are liked in
particular mood by a user and sorted in descending. Recall metric is used to evaluate
the recommendation.
The performance of ConvXGB in emotion classification with 128 filters and a maxpooling is better than the two CNN in the oversampled dataset. ConvXGB gained
78.64% accuracy on 4 emotions and 80.99% on 7 emotions. Evaluation for
incomplete cold start is using 10% to 90% of each user’s ratings in a mood as train
data. From the average recall result, incomplete cold start can be solved with mood
information using 10% of each user’s ratings. The system tends to have better
performance with the increment of user rating data. |
format |
Final Project |
author |
Surya Angkasa, Hengky |
spellingShingle |
Surya Angkasa, Hengky EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM |
author_facet |
Surya Angkasa, Hengky |
author_sort |
Surya Angkasa, Hengky |
title |
EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM |
title_short |
EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM |
title_full |
EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM |
title_fullStr |
EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM |
title_full_unstemmed |
EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM |
title_sort |
emotion classification of user face image in music recommendation system |
url |
https://digilib.itb.ac.id/gdl/view/65784 |
_version_ |
1822932851072434176 |