Deep learning based detector for real-time facial expression recognition
Automated Facial Expression Recognition (AFER) technology has become a hot topic in the field of pattern recognition. An accurate and fast real-time detection of facial expression could significantly bring benefits to important applications in many areas such as HumanComputer Interaction and Compute...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/77018 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-77018 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-770182023-03-03T20:39:00Z Deep learning based detector for real-time facial expression recognition Han, Wei Jie Lu Shijian School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering Automated Facial Expression Recognition (AFER) technology has become a hot topic in the field of pattern recognition. An accurate and fast real-time detection of facial expression could significantly bring benefits to important applications in many areas such as HumanComputer Interaction and Computer Vision. Leveraging on the recent developments in deep learning techniques (Convolutional Neural Networks) in computer vision, enabling researchers to drastically improve the accuracy and performance of objection detection and recognition systems. In this paper, we use TensorFlow Object Detection (TFOD) API, an open source framework to train and test our end-to-end deep learning-based object detector on Compound Facial Expression of Emotion Database (CFEED). The aim is to develop a robust real-time facial expression detector for detecting and classifying the key seven human emotions: neutrality, happiness, sadness, fear, anger, surprise, and disgust. We employed two different types of meta architectures which are Faster R-CNN and SSD for object detection. Each of these meta-architectures is combined with deep feature extractors such as InceptionNet and MobileNet respectively to extract high-level representation automatically direct from raw images. Furthermore, we focus on reducing the needed amount of training data drastically by exploring transfer learning and fine-tuning the model parameters, while still maintaining high average precision. To aid generalization, data augmentation and dropout techniques were used to avoid overfitting. Our experiments show that with more fine-tuning and depth, the accuracy performance of “SSD_MobileNet_V1_COCO” and “Faster_RCNN_InceptionNet_V2_COCO” achieves 84.85% and 86.42% respectively on the CFEED testing set. Bachelor of Engineering (Computer Science) 2019-04-30T14:02:40Z 2019-04-30T14:02:40Z 2019 Final Year Project (FYP) http://hdl.handle.net/10356/77018 en Nanyang Technological University 63 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering |
spellingShingle |
DRNTU::Engineering::Computer science and engineering Han, Wei Jie Deep learning based detector for real-time facial expression recognition |
description |
Automated Facial Expression Recognition (AFER) technology has become a hot topic in the field of pattern recognition. An accurate and fast real-time detection of facial expression could significantly bring benefits to important applications in many areas such as HumanComputer Interaction and Computer Vision. Leveraging on the recent developments in deep learning techniques (Convolutional Neural Networks) in computer vision, enabling researchers to drastically improve the accuracy and performance of objection detection and recognition systems. In this paper, we use TensorFlow Object Detection (TFOD) API, an open source framework to train and test our end-to-end deep learning-based object detector on Compound Facial Expression of Emotion Database (CFEED). The aim is to develop a robust real-time facial expression detector for detecting and classifying the key seven human emotions: neutrality, happiness, sadness, fear, anger, surprise, and disgust. We employed two different types of meta architectures which are Faster R-CNN and SSD for object detection. Each of these meta-architectures is combined with deep feature extractors such as InceptionNet and MobileNet respectively to extract high-level representation automatically direct from raw images. Furthermore, we focus on reducing the needed amount of training data drastically by exploring transfer learning and fine-tuning the model parameters, while still maintaining high average precision. To aid generalization, data augmentation and dropout techniques were used to avoid overfitting. Our experiments show that with more fine-tuning and depth, the accuracy performance of “SSD_MobileNet_V1_COCO” and “Faster_RCNN_InceptionNet_V2_COCO” achieves 84.85% and 86.42% respectively on the CFEED testing set. |
author2 |
Lu Shijian |
author_facet |
Lu Shijian Han, Wei Jie |
format |
Final Year Project |
author |
Han, Wei Jie |
author_sort |
Han, Wei Jie |
title |
Deep learning based detector for real-time facial expression recognition |
title_short |
Deep learning based detector for real-time facial expression recognition |
title_full |
Deep learning based detector for real-time facial expression recognition |
title_fullStr |
Deep learning based detector for real-time facial expression recognition |
title_full_unstemmed |
Deep learning based detector for real-time facial expression recognition |
title_sort |
deep learning based detector for real-time facial expression recognition |
publishDate |
2019 |
url |
http://hdl.handle.net/10356/77018 |
_version_ |
1759855530439868416 |