Comparative analysis of YOLO and transformers for pedestrian detection
This report aims to study and compare the performance of two state-of-the-art real-time object detectors – YOLOv8 (You Only Look Once, 8th version) and RT-DETR (Real-Time Detection Transformers) in tackling pedestrian detection. Throughout the report, both models were trained and evaluated on differ...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/175269 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-175269 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1752692024-04-26T15:42:19Z Comparative analysis of YOLO and transformers for pedestrian detection Wong, Ying Xuan Vidya Sudarshan School of Computer Science and Engineering vidya.sudarshan@ntu.edu.sg Computer and Information Science This report aims to study and compare the performance of two state-of-the-art real-time object detectors – YOLOv8 (You Only Look Once, 8th version) and RT-DETR (Real-Time Detection Transformers) in tackling pedestrian detection. Throughout the report, both models were trained and evaluated on different pedestrian datasets, including TJU-DHD-Traffic, Caltech Pedestrian, KITTI, INRIA Person and Cityscapes. Besides, the performance of the integrated models between YOLOv8 and RT-DETR was also investigated. Thorough analyses were conducted, and it was concluded that YOLOv8 achieved a faster inference speed than RT-DETR regarding limited GPU resources. Besides, the integrated achieved comparable speed with YOLOv8, with accuracies comparable to or surpassing the RT-DETR models, highlighting the feasibility of integrating both detectors. Future work can include alternating integrated models to attain optimal results. Besides, tuning and experimenting on larger batch sizes shall also be included to conduct a more comprehensive comparison. Bachelor's degree 2024-04-23T04:57:24Z 2024-04-23T04:57:24Z 2024 Final Year Project (FYP) Wong, Y. X. (2024). Comparative analysis of YOLO and transformers for pedestrian detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175269 https://hdl.handle.net/10356/175269 en SCSE23-0720 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science |
spellingShingle |
Computer and Information Science Wong, Ying Xuan Comparative analysis of YOLO and transformers for pedestrian detection |
description |
This report aims to study and compare the performance of two state-of-the-art real-time object detectors – YOLOv8 (You Only Look Once, 8th version) and RT-DETR (Real-Time Detection Transformers) in tackling pedestrian detection. Throughout the report, both models were trained and evaluated on different pedestrian datasets, including TJU-DHD-Traffic, Caltech Pedestrian, KITTI, INRIA Person and Cityscapes. Besides, the performance of the integrated models between YOLOv8 and RT-DETR was also investigated. Thorough analyses were conducted, and it was concluded that YOLOv8 achieved a faster inference speed than RT-DETR regarding limited GPU resources. Besides, the integrated achieved comparable speed with YOLOv8, with accuracies comparable to or surpassing the RT-DETR models, highlighting the feasibility of integrating both detectors. Future work can include alternating integrated models to attain optimal results. Besides, tuning and experimenting on larger batch sizes shall also be included to conduct a more comprehensive comparison. |
author2 |
Vidya Sudarshan |
author_facet |
Vidya Sudarshan Wong, Ying Xuan |
format |
Final Year Project |
author |
Wong, Ying Xuan |
author_sort |
Wong, Ying Xuan |
title |
Comparative analysis of YOLO and transformers for pedestrian detection |
title_short |
Comparative analysis of YOLO and transformers for pedestrian detection |
title_full |
Comparative analysis of YOLO and transformers for pedestrian detection |
title_fullStr |
Comparative analysis of YOLO and transformers for pedestrian detection |
title_full_unstemmed |
Comparative analysis of YOLO and transformers for pedestrian detection |
title_sort |
comparative analysis of yolo and transformers for pedestrian detection |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/175269 |
_version_ |
1800916097727201280 |