PARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET
Since the early 1970s, face recognition has been a well-known research topic in computer vision, and it has advanced rapidly in the deep learning era. Face recognition has two primary tasks: face verification and face identification. Face recognition remains difficult due to four fundamental issu...
Saved in:
Main Author: | |
---|---|
Format: | Dissertations |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/79984 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:79984 |
---|---|
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Since the early 1970s, face recognition has been a well-known research topic in
computer vision, and it has advanced rapidly in the deep learning era. Face
recognition has two primary tasks: face verification and face identification. Face
recognition remains difficult due to four fundamental issues: pose, occlusion, facial
expressions, and illumination. And three out of four of these real difficulties are
part of the partial face problem, which makes facial recognition harder. One form
of partial face is a profile face with a counterclockwise angle of 90, while a
standard face perpendicular to the front of the camera is called a frontal face.
Frontal face recognition is a closed problem that can be used as a reference for
individual identity.
The primary objective of this study is to investigate the difficulty of identifying a
person's frontal face from his profile face when trained and tested on a database
containing a very small number of facial images per subject and pose. For this
research, the ITB Frontal Profile Limited Dataset (IFPLD) was created, which
contained only one frontal face and one profile face for each subject. Many
institutions still preserve individual data along with the individual's frontal face
and profile faces in such personal databases.
This study focused on addressing the challenge of face identification using a dataset
with few samples. Two distinct methodologies were employed: a supervised
learning approach and a meta-learning technique. The supervised learning
strategy involves the extraction of features using two distinct methods: the classical
method, namely Scale-Invariant Feature Transform (SIFT), and the deep learning
method, specifically FaceNet. The input to the face verification system consists of
concatenated FaceNet embedding vectors of frontal faces and profile faces.
However, there is a significant imbalance in the proportions of data between the
same face (minority) and various faces (majority). The classification system is
developed using standard training methods and a method we call 2k-balanced
training to ensure that the percentage of identical and different face class data is
balanced. The experiments concluded that the face verification system has high
accuracy but low precision and sensitivity. However, the supervised learning
iv
approach for face verification has low accuracy when used for face identification
tasks, which are the main research topic.
Utilizing a meta-learning technique, the second strategy looks at face identification
in the IFPLD dataset as a few-shot learning problem, or more specifically, N-way-
1-shot learning, which is further studied with the help of Siamese and prototypical
networks. The quality of the embedding vector features in both metric-based meta-
learning techniques mostly determines the effectiveness of the final model. Fine-
tuning was done on the ResNet-18 model, which has been pre-trained with
ImageNet using a variety of training methods to obtain representative embedding
vectors, such as using the siamese network with contrastive loss and triplet loss and
SimCLR training method, which is a self-supervised learning (SSL) method, in
order to produce representative features on the IFPLD dataset..
Two approaches to determining the distance between query data and prototypes
are suggested in the prototypical network method, and they impact the accuracy
level. By transforming 1-shot learning problems into k-shot learning, where there
is k-1 augmentation data in training and testing episodes to gain superior
performance, the prototypical network outperforms the siamese network in
accuracy. It has been demonstrated that using augmentation data in prototype
networks enhances model performance. The augmentation data comes from five
basic image processing procedures. On the IFPLD dataset, the ResNet-18_IN*
model produced the best results with 97%, 89%, and 82% accuracy in identifying
5, 20, and 40 individuals, respectively, in the N-way-1-shot learning schema. |
format |
Dissertations |
author |
Djamaluddin, Muhammad |
spellingShingle |
Djamaluddin, Muhammad PARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET |
author_facet |
Djamaluddin, Muhammad |
author_sort |
Djamaluddin, Muhammad |
title |
PARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET |
title_short |
PARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET |
title_full |
PARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET |
title_fullStr |
PARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET |
title_full_unstemmed |
PARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET |
title_sort |
partial face identification on a limited pose type and sample face dataset |
url |
https://digilib.itb.ac.id/gdl/view/79984 |
_version_ |
1822281471905234944 |
spelling |
id-itb.:799842024-01-17T10:00:38ZPARTIAL FACE IDENTIFICATION ON A LIMITED POSE TYPE AND SAMPLE FACE DATASET Djamaluddin, Muhammad Indonesia Dissertations Face Identification, IFPLD, Siamese Network, Few-shot learning, Prototypical Network INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/79984 Since the early 1970s, face recognition has been a well-known research topic in computer vision, and it has advanced rapidly in the deep learning era. Face recognition has two primary tasks: face verification and face identification. Face recognition remains difficult due to four fundamental issues: pose, occlusion, facial expressions, and illumination. And three out of four of these real difficulties are part of the partial face problem, which makes facial recognition harder. One form of partial face is a profile face with a counterclockwise angle of 90, while a standard face perpendicular to the front of the camera is called a frontal face. Frontal face recognition is a closed problem that can be used as a reference for individual identity. The primary objective of this study is to investigate the difficulty of identifying a person's frontal face from his profile face when trained and tested on a database containing a very small number of facial images per subject and pose. For this research, the ITB Frontal Profile Limited Dataset (IFPLD) was created, which contained only one frontal face and one profile face for each subject. Many institutions still preserve individual data along with the individual's frontal face and profile faces in such personal databases. This study focused on addressing the challenge of face identification using a dataset with few samples. Two distinct methodologies were employed: a supervised learning approach and a meta-learning technique. The supervised learning strategy involves the extraction of features using two distinct methods: the classical method, namely Scale-Invariant Feature Transform (SIFT), and the deep learning method, specifically FaceNet. The input to the face verification system consists of concatenated FaceNet embedding vectors of frontal faces and profile faces. However, there is a significant imbalance in the proportions of data between the same face (minority) and various faces (majority). The classification system is developed using standard training methods and a method we call 2k-balanced training to ensure that the percentage of identical and different face class data is balanced. The experiments concluded that the face verification system has high accuracy but low precision and sensitivity. However, the supervised learning iv approach for face verification has low accuracy when used for face identification tasks, which are the main research topic. Utilizing a meta-learning technique, the second strategy looks at face identification in the IFPLD dataset as a few-shot learning problem, or more specifically, N-way- 1-shot learning, which is further studied with the help of Siamese and prototypical networks. The quality of the embedding vector features in both metric-based meta- learning techniques mostly determines the effectiveness of the final model. Fine- tuning was done on the ResNet-18 model, which has been pre-trained with ImageNet using a variety of training methods to obtain representative embedding vectors, such as using the siamese network with contrastive loss and triplet loss and SimCLR training method, which is a self-supervised learning (SSL) method, in order to produce representative features on the IFPLD dataset.. Two approaches to determining the distance between query data and prototypes are suggested in the prototypical network method, and they impact the accuracy level. By transforming 1-shot learning problems into k-shot learning, where there is k-1 augmentation data in training and testing episodes to gain superior performance, the prototypical network outperforms the siamese network in accuracy. It has been demonstrated that using augmentation data in prototype networks enhances model performance. The augmentation data comes from five basic image processing procedures. On the IFPLD dataset, the ResNet-18_IN* model produced the best results with 97%, 89%, and 82% accuracy in identifying 5, 20, and 40 individuals, respectively, in the N-way-1-shot learning schema. text |