EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM

A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the mu...

Full description

Saved in:
Bibliographic Details
Main Author: Surya Angkasa, Hengky
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/65784
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:65784
spelling id-itb.:657842022-06-24T16:06:20ZEMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM Surya Angkasa, Hengky Indonesia Final Project classification, emotion, ConvXGB, CNN, music recommendation system, incomplete cold start INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/65784 A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the music recommendation system. Gilda et al. (2017) and Krupa et al. (2020) used Convolutional Neural Network (CNN) to classify emotion from face image with FER2013 dataset. The performance of Krupa’s CNN is not good yet and unequally to each emotion. While performance of Gilda’s CNN is good, but the convolution layers quite a lot (9 layers) with big filters (256). In a recommender system, the incomplete cold start could happen if a user has lack of rating. So the system needs extra information to give better recommendations. Thongsuwan et al. (2021) designed ConvXGB. Two convolution layers are used for extracting input and XGBoost does the learning task. The model has better performance than CNN in DrivFace dataset. So ConvXGB was implemented in this research for emotion classification. Emotion is mapped with the mood attribute in the music dataset (Emotify). The mood is the additional information of the user for solving incomplete cold start. The system recommends musics that are liked in particular mood by a user and sorted in descending. Recall metric is used to evaluate the recommendation. The performance of ConvXGB in emotion classification with 128 filters and a maxpooling is better than the two CNN in the oversampled dataset. ConvXGB gained 78.64% accuracy on 4 emotions and 80.99% on 7 emotions. Evaluation for incomplete cold start is using 10% to 90% of each user’s ratings in a mood as train data. From the average recall result, incomplete cold start can be solved with mood information using 10% of each user’s ratings. The system tends to have better performance with the increment of user rating data. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the music recommendation system. Gilda et al. (2017) and Krupa et al. (2020) used Convolutional Neural Network (CNN) to classify emotion from face image with FER2013 dataset. The performance of Krupa’s CNN is not good yet and unequally to each emotion. While performance of Gilda’s CNN is good, but the convolution layers quite a lot (9 layers) with big filters (256). In a recommender system, the incomplete cold start could happen if a user has lack of rating. So the system needs extra information to give better recommendations. Thongsuwan et al. (2021) designed ConvXGB. Two convolution layers are used for extracting input and XGBoost does the learning task. The model has better performance than CNN in DrivFace dataset. So ConvXGB was implemented in this research for emotion classification. Emotion is mapped with the mood attribute in the music dataset (Emotify). The mood is the additional information of the user for solving incomplete cold start. The system recommends musics that are liked in particular mood by a user and sorted in descending. Recall metric is used to evaluate the recommendation. The performance of ConvXGB in emotion classification with 128 filters and a maxpooling is better than the two CNN in the oversampled dataset. ConvXGB gained 78.64% accuracy on 4 emotions and 80.99% on 7 emotions. Evaluation for incomplete cold start is using 10% to 90% of each user’s ratings in a mood as train data. From the average recall result, incomplete cold start can be solved with mood information using 10% of each user’s ratings. The system tends to have better performance with the increment of user rating data.
format Final Project
author Surya Angkasa, Hengky
spellingShingle Surya Angkasa, Hengky
EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
author_facet Surya Angkasa, Hengky
author_sort Surya Angkasa, Hengky
title EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_short EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_full EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_fullStr EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_full_unstemmed EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_sort emotion classification of user face image in music recommendation system
url https://digilib.itb.ac.id/gdl/view/65784
_version_ 1822932851072434176