Camera domain transfer for video-based person re-identification

Person re-identification (Re-ID), also known as pedestrian re-identification, is a new technology that has emerged in the area of intelligent video analysis in recent years, and belongs to the category of image and video processing and analysis in complex environments. According to different researc...

Full description

Saved in:
Bibliographic Details
Main Author: Ding, Bangjie
Other Authors: Tan Yap Peng
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/159290
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-159290
record_format dspace
spelling sg-ntu-dr.10356-1592902022-06-14T04:57:23Z Camera domain transfer for video-based person re-identification Ding, Bangjie Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Person re-identification (Re-ID), also known as pedestrian re-identification, is a new technology that has emerged in the area of intelligent video analysis in recent years, and belongs to the category of image and video processing and analysis in complex environments. According to different research objects, person Re-ID can be divided into two types: image-based and video-based. The object of this dissertation is video-based person Re-ID, which is based on surveillance videos. Compared to methods use single frame image methods use surveillance videos has two advantages:1) more information is available, including the appearance, posture and movement of the person, 2) it is closer to the real-world application scenario, as the person in surveillance cameras are also presented in the form of video. However, due to the environment and the limitations of the equipment, there are instances where videos from the same identity are less similar than the video from the different identities. This is a common problem in cross-camera person Re-ID. Besides, feature learning based on deep learning methods is prone to overfitting on the relatively small scale video dataset. To solve the problems mentioned above, a network based on StarGAN v2 is proposed in this dissertation. The proposed method, CaTSGAN, can address the video style variations problem caused by different cameras by transferring videos from one camera domain to others. In this way we can eliminate the video style variations. CaTSGAN can learn the relationship among the multiple camera domains with a single model and generate the cross-camera extra training samples for person Re-ID, which serves as a data augmentation approach that smooths the video style disparities and increase data diversity against overfitting. Compared to the baseline, the proposed method achieves a nearly 10-point improvement in the mean average precision (mAP), with Rank-5 and Rank-10 achieving 97.6% and 99.1% accuracy, respectively. This demonstrates the feasibility and improvement of the method. Master of Science (Communications Engineering) 2022-06-14T04:57:23Z 2022-06-14T04:57:23Z 2022 Thesis-Master by Coursework Ding, B. (2022). Camera domain transfer for video-based person re-identification. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159290 https://hdl.handle.net/10356/159290 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Ding, Bangjie
Camera domain transfer for video-based person re-identification
description Person re-identification (Re-ID), also known as pedestrian re-identification, is a new technology that has emerged in the area of intelligent video analysis in recent years, and belongs to the category of image and video processing and analysis in complex environments. According to different research objects, person Re-ID can be divided into two types: image-based and video-based. The object of this dissertation is video-based person Re-ID, which is based on surveillance videos. Compared to methods use single frame image methods use surveillance videos has two advantages:1) more information is available, including the appearance, posture and movement of the person, 2) it is closer to the real-world application scenario, as the person in surveillance cameras are also presented in the form of video. However, due to the environment and the limitations of the equipment, there are instances where videos from the same identity are less similar than the video from the different identities. This is a common problem in cross-camera person Re-ID. Besides, feature learning based on deep learning methods is prone to overfitting on the relatively small scale video dataset. To solve the problems mentioned above, a network based on StarGAN v2 is proposed in this dissertation. The proposed method, CaTSGAN, can address the video style variations problem caused by different cameras by transferring videos from one camera domain to others. In this way we can eliminate the video style variations. CaTSGAN can learn the relationship among the multiple camera domains with a single model and generate the cross-camera extra training samples for person Re-ID, which serves as a data augmentation approach that smooths the video style disparities and increase data diversity against overfitting. Compared to the baseline, the proposed method achieves a nearly 10-point improvement in the mean average precision (mAP), with Rank-5 and Rank-10 achieving 97.6% and 99.1% accuracy, respectively. This demonstrates the feasibility and improvement of the method.
author2 Tan Yap Peng
author_facet Tan Yap Peng
Ding, Bangjie
format Thesis-Master by Coursework
author Ding, Bangjie
author_sort Ding, Bangjie
title Camera domain transfer for video-based person re-identification
title_short Camera domain transfer for video-based person re-identification
title_full Camera domain transfer for video-based person re-identification
title_fullStr Camera domain transfer for video-based person re-identification
title_full_unstemmed Camera domain transfer for video-based person re-identification
title_sort camera domain transfer for video-based person re-identification
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/159290
_version_ 1736856404209172480