EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM

A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the mu...

Full description

Saved in:

Bibliographic Details
Main Author:	Surya Angkasa, Hengky
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/65784
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:65784
spelling	id-itb.:657842022-06-24T16:06:20ZEMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM Surya Angkasa, Hengky Indonesia Final Project classification, emotion, ConvXGB, CNN, music recommendation system, incomplete cold start INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/65784 A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the music recommendation system. Gilda et al. (2017) and Krupa et al. (2020) used Convolutional Neural Network (CNN) to classify emotion from face image with FER2013 dataset. The performance of Krupa’s CNN is not good yet and unequally to each emotion. While performance of Gilda’s CNN is good, but the convolution layers quite a lot (9 layers) with big filters (256). In a recommender system, the incomplete cold start could happen if a user has lack of rating. So the system needs extra information to give better recommendations. Thongsuwan et al. (2021) designed ConvXGB. Two convolution layers are used for extracting input and XGBoost does the learning task. The model has better performance than CNN in DrivFace dataset. So ConvXGB was implemented in this research for emotion classification. Emotion is mapped with the mood attribute in the music dataset (Emotify). The mood is the additional information of the user for solving incomplete cold start. The system recommends musics that are liked in particular mood by a user and sorted in descending. Recall metric is used to evaluate the recommendation. The performance of ConvXGB in emotion classification with 128 filters and a maxpooling is better than the two CNN in the oversampled dataset. ConvXGB gained 78.64% accuracy on 4 emotions and 80.99% on 7 emotions. Evaluation for incomplete cold start is using 10% to 90% of each user’s ratings in a mood as train data. From the average recall result, incomplete cold start can be solved with mood information using 10% of each user’s ratings. The system tends to have better performance with the increment of user rating data. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	A person could listen to different music when sad or happy. Listening to favorite music stimulates the brain to release dopamine hormone to the corpus striatum, which manages human feelings such as addiction, satisfaction, and motivation. So human emotion could be an opportunity to enhance the music recommendation system. Gilda et al. (2017) and Krupa et al. (2020) used Convolutional Neural Network (CNN) to classify emotion from face image with FER2013 dataset. The performance of Krupa’s CNN is not good yet and unequally to each emotion. While performance of Gilda’s CNN is good, but the convolution layers quite a lot (9 layers) with big filters (256). In a recommender system, the incomplete cold start could happen if a user has lack of rating. So the system needs extra information to give better recommendations. Thongsuwan et al. (2021) designed ConvXGB. Two convolution layers are used for extracting input and XGBoost does the learning task. The model has better performance than CNN in DrivFace dataset. So ConvXGB was implemented in this research for emotion classification. Emotion is mapped with the mood attribute in the music dataset (Emotify). The mood is the additional information of the user for solving incomplete cold start. The system recommends musics that are liked in particular mood by a user and sorted in descending. Recall metric is used to evaluate the recommendation. The performance of ConvXGB in emotion classification with 128 filters and a maxpooling is better than the two CNN in the oversampled dataset. ConvXGB gained 78.64% accuracy on 4 emotions and 80.99% on 7 emotions. Evaluation for incomplete cold start is using 10% to 90% of each user’s ratings in a mood as train data. From the average recall result, incomplete cold start can be solved with mood information using 10% of each user’s ratings. The system tends to have better performance with the increment of user rating data.
format	Final Project
author	Surya Angkasa, Hengky
spellingShingle	Surya Angkasa, Hengky EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
author_facet	Surya Angkasa, Hengky
author_sort	Surya Angkasa, Hengky
title	EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_short	EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_full	EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_fullStr	EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_full_unstemmed	EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM
title_sort	emotion classification of user face image in music recommendation system
url	https://digilib.itb.ac.id/gdl/view/65784
_version_	1822932851072434176

EMOTION CLASSIFICATION OF USER FACE IMAGE IN MUSIC RECOMMENDATION SYSTEM

Similar Items