Pedestrian tracking based on deep learning
The methodology of pedestrian tracking extensively incorporates techniques derived from the field of computer vision. This domain is primarily concerned with the processing and interpretation of visual data obtained from still images or video sequences. Tracking pedestrians simultaneously requires a...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/176521 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-176521 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1765212024-05-17T15:49:00Z Pedestrian tracking based on deep learning Xia, Yuqi Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Computer and Information Science Deep learning Multiple object tracking The methodology of pedestrian tracking extensively incorporates techniques derived from the field of computer vision. This domain is primarily concerned with the processing and interpretation of visual data obtained from still images or video sequences. Tracking pedestrians simultaneously requires advanced algorithms capable of detecting, identifying, and continuously monitoring these subjects through successive video frames. The original FairMOT algorithm, conceived as an innovative model, is designed for precise multi-object tracking. It operates using a dual-branch structure: one branch predicts pixel-wise objectness scores, while the other captures features for re-identification. The primary goal of this model is to balance the tasks effectively, aiming for superior tracking and detection accuracy. Despite its strengths, the algorithm encounters challenges with inaccurate target detection, often exacerbated by frequent target occlusions. Our research is dedicated to optimizing the computational efficiency of the tracking algorithm and significantly enhancing its tracking accuracy. This dissertation presents two enhancements to the FairMOT framework, improving pedestrian tracking accuracy. It explores integrating the Cross-Stage-Partial (CSP) network structure into CenterNet’s backbone network and introduces a threshold-based method for more effective Intersection over Union (IoU) matching detection. These enhancements result in a 0.2% increase in detection accuracy over the standard FairMOT algorithm, significant improvements in HOTA (2.60%), MOTA (0.56%), and IDF1 (3.7%) scores, and a 0.64% accuracy gain over DeepSORT and Track-RCNN algorithms. Master's degree 2024-05-16T08:24:29Z 2024-05-16T08:24:29Z 2023 Thesis-Master by Coursework Xia, Y. (2023). Pedestrian tracking based on deep learning. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176521 https://hdl.handle.net/10356/176521 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Deep learning Multiple object tracking |
spellingShingle |
Computer and Information Science Deep learning Multiple object tracking Xia, Yuqi Pedestrian tracking based on deep learning |
description |
The methodology of pedestrian tracking extensively incorporates techniques derived from the field of computer vision. This domain is primarily concerned with the processing and interpretation of visual data obtained from still images or video sequences. Tracking pedestrians simultaneously requires advanced algorithms capable of detecting, identifying, and continuously monitoring these subjects through successive video frames.
The original FairMOT algorithm, conceived as an innovative model, is designed for precise multi-object tracking. It operates using a dual-branch structure: one branch predicts pixel-wise objectness scores, while the other captures features for re-identification. The primary goal of this model is to balance the tasks effectively, aiming for superior tracking and detection accuracy. Despite its strengths, the algorithm encounters challenges with inaccurate target detection, often exacerbated by frequent target occlusions. Our research is dedicated to optimizing the computational efficiency of the tracking algorithm and significantly enhancing its tracking accuracy.
This dissertation presents two enhancements to the FairMOT framework, improving pedestrian tracking accuracy. It explores integrating the Cross-Stage-Partial (CSP) network structure into CenterNet’s backbone network and introduces a threshold-based method for more effective Intersection over Union (IoU) matching detection. These enhancements result in a 0.2% increase in detection accuracy over the standard FairMOT algorithm, significant improvements in HOTA (2.60%), MOTA (0.56%), and IDF1 (3.7%) scores, and a 0.64% accuracy gain over DeepSORT and Track-RCNN algorithms. |
author2 |
Yap Kim Hui |
author_facet |
Yap Kim Hui Xia, Yuqi |
format |
Thesis-Master by Coursework |
author |
Xia, Yuqi |
author_sort |
Xia, Yuqi |
title |
Pedestrian tracking based on deep learning |
title_short |
Pedestrian tracking based on deep learning |
title_full |
Pedestrian tracking based on deep learning |
title_fullStr |
Pedestrian tracking based on deep learning |
title_full_unstemmed |
Pedestrian tracking based on deep learning |
title_sort |
pedestrian tracking based on deep learning |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/176521 |
_version_ |
1800916142965915648 |