Facial expression recognition using deep learning

Recognizing facial expressions is one of the fundamental computer vision applications. Many prior esearch studies have been conducted for more robust recognition performance. With the success of Vision Transformer (ViT) in many other areas, we found it remains challenging to apply it in the task of...

Full description

Saved in:

Bibliographic Details
Main Author:	Wang, Xiao Yi
Other Authors:	Yap Kim Hui
Format:	Thesis-Master by Coursework
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science Deep learning Computer vision
Online Access:	https://hdl.handle.net/10356/178999
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-178999
record_format	dspace
spelling	sg-ntu-dr.10356-1789992024-07-19T15:43:38Z Facial expression recognition using deep learning Wang, Xiao Yi Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Computer and Information Science Deep learning Computer vision Recognizing facial expressions is one of the fundamental computer vision applications. Many prior esearch studies have been conducted for more robust recognition performance. With the success of Vision Transformer (ViT) in many other areas, we found it remains challenging to apply it in the task of facial expression recognition. It limits the development of this research area due to the issue that the existing dataset is insufficient for the requirement of training a robust Vision Transformer. In this dissertation, in order to train a high-performance Vision Transformer for the facial expression recognition problem, three existing public datasets are merged into a new standard dataset with unified samples, and the sample size under each label reaches 20,000 by using data augmentation and other methods. We also implement a Vision Transformer and it is trained on our augmented dataset. Under the same parameter setting, we compare ViT with the other four baseline models and demonstrate its superiority. The optimal ViT configuration parameters are obtained by analyzing and comparing the training statistics with different configurations on our dataset and the testing results in a noisy test set. In addition, a real-time facial expression recognition prototype using the web camera and Single Shot Multibox Detector (SSD) face detection module is implemented for real-world evaluation. Master's degree 2024-07-15T09:00:53Z 2024-07-15T09:00:53Z 2024 Thesis-Master by Coursework Wang, X. Y. (2024). Facial expression recognition using deep learning. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/178999 https://hdl.handle.net/10356/178999 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Computer and Information Science Deep learning Computer vision
spellingShingle	Computer and Information Science Deep learning Computer vision Wang, Xiao Yi Facial expression recognition using deep learning
description	Recognizing facial expressions is one of the fundamental computer vision applications. Many prior esearch studies have been conducted for more robust recognition performance. With the success of Vision Transformer (ViT) in many other areas, we found it remains challenging to apply it in the task of facial expression recognition. It limits the development of this research area due to the issue that the existing dataset is insufficient for the requirement of training a robust Vision Transformer. In this dissertation, in order to train a high-performance Vision Transformer for the facial expression recognition problem, three existing public datasets are merged into a new standard dataset with unified samples, and the sample size under each label reaches 20,000 by using data augmentation and other methods. We also implement a Vision Transformer and it is trained on our augmented dataset. Under the same parameter setting, we compare ViT with the other four baseline models and demonstrate its superiority. The optimal ViT configuration parameters are obtained by analyzing and comparing the training statistics with different configurations on our dataset and the testing results in a noisy test set. In addition, a real-time facial expression recognition prototype using the web camera and Single Shot Multibox Detector (SSD) face detection module is implemented for real-world evaluation.
author2	Yap Kim Hui
author_facet	Yap Kim Hui Wang, Xiao Yi
format	Thesis-Master by Coursework
author	Wang, Xiao Yi
author_sort	Wang, Xiao Yi
title	Facial expression recognition using deep learning
title_short	Facial expression recognition using deep learning
title_full	Facial expression recognition using deep learning
title_fullStr	Facial expression recognition using deep learning
title_full_unstemmed	Facial expression recognition using deep learning
title_sort	facial expression recognition using deep learning
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/178999
_version_	1806059757672333312

Facial expression recognition using deep learning

Similar Items