Pedestrian attribute recognition

Pedestrian Attribute Recognition aims to develop a machine learning model to recognize person attributes including age, gender, clothing and accessories in a given person image. It is a challenging task due to high quality variance and inconsistent number of attributes in input image samples. Many m...

Full description

Saved in:
Bibliographic Details
Main Author: Liu, Yuan
Other Authors: Lin Zhiping
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/139139
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Pedestrian Attribute Recognition aims to develop a machine learning model to recognize person attributes including age, gender, clothing and accessories in a given person image. It is a challenging task due to high quality variance and inconsistent number of attributes in input image samples. Many models have been proposed to tackle this problem. However, there are two open problems that remain to be addressed. On the one hand, visually similar pedestrian images tend to share similar attributes, and the upper and lower attributes normally match in style. However, the existing methods normally either ignore these relations or capture them ineffectively. On the other hand, how to maximise the benefit from each input pedestrian sample based on their importance to the machine learning model has not been explored, which can be critical given the huge variety in training images. In this report, we propose two deep learning based neural network models to tackle the two problems separately. The “Pentadent-Net” (PD-Net) is introduced with two parallel branches, where the two branches capture the relations among different samples and different attributes with a graphical method. The “Reinforced Sample Re-weighting” (RSR) is also proposed to re-weight samples in a batch during back-propagation through reinforcement learning, which can facilitate model training based on each input sample’s importance. The proposed PD-Net and RSR achieve state-of-the-art performance against existing methods on two large scale pedestrian attribute benchmarks PETA and PA-100K, which validates their effectives and advantages. The research paper generated from RSR has been accepted by WCCI 2020. The research paper on PD-Net is pending review for ICIP 2020. Notably, this project has been shortlisted as one of the 8 finalists for the EEE FYP Challenge 2020.