A benchmark of CNN backbones on DINO-DETR performance in object detection

Recent developments in DETR-based models have made significant improvements in training convergence but not small object detection. This paper combines the ConvNeXt and FocalNet backbones with DINO-DETR using timm and detrex, and presents a benchmark and analysis of the resulting model performances...

Full description

Saved in:
Bibliographic Details
Main Author: Liew, Zon Hur Zhen
Other Authors: Lu Shijian
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172020
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172020
record_format dspace
spelling sg-ntu-dr.10356-1720202023-11-24T15:37:07Z A benchmark of CNN backbones on DINO-DETR performance in object detection Liew, Zon Hur Zhen Lu Shijian School of Computer Science and Engineering Shijian.Lu@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Recent developments in DETR-based models have made significant improvements in training convergence but not small object detection. This paper combines the ConvNeXt and FocalNet backbones with DINO-DETR using timm and detrex, and presents a benchmark and analysis of the resulting model performances on MS-COCO and SODA-D. The results affirm many conclusions from the ConvNeXt and FocalNet papers while exhibiting inconsistencies for FocalNets on SODA-D. Finally, the results show encouraging performance for DINO-DETR with recent backbones on general object detection and the need for further improvement on small object detection with DINO-DETR across all backbones. Further efforts should be made to integrate state-of-the-art features from concurrent developments to produce new benchmarks on small object detection datasets with accessible existing technology. Bachelor of Engineering (Computer Science) 2023-11-20T09:11:43Z 2023-11-20T09:11:43Z 2023 Final Year Project (FYP) Liew, Z. H. Z. (2023). A benchmark of CNN backbones on DINO-DETR performance in object detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/172020 https://hdl.handle.net/10356/172020 en SCSE22-0660 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Liew, Zon Hur Zhen
A benchmark of CNN backbones on DINO-DETR performance in object detection
description Recent developments in DETR-based models have made significant improvements in training convergence but not small object detection. This paper combines the ConvNeXt and FocalNet backbones with DINO-DETR using timm and detrex, and presents a benchmark and analysis of the resulting model performances on MS-COCO and SODA-D. The results affirm many conclusions from the ConvNeXt and FocalNet papers while exhibiting inconsistencies for FocalNets on SODA-D. Finally, the results show encouraging performance for DINO-DETR with recent backbones on general object detection and the need for further improvement on small object detection with DINO-DETR across all backbones. Further efforts should be made to integrate state-of-the-art features from concurrent developments to produce new benchmarks on small object detection datasets with accessible existing technology.
author2 Lu Shijian
author_facet Lu Shijian
Liew, Zon Hur Zhen
format Final Year Project
author Liew, Zon Hur Zhen
author_sort Liew, Zon Hur Zhen
title A benchmark of CNN backbones on DINO-DETR performance in object detection
title_short A benchmark of CNN backbones on DINO-DETR performance in object detection
title_full A benchmark of CNN backbones on DINO-DETR performance in object detection
title_fullStr A benchmark of CNN backbones on DINO-DETR performance in object detection
title_full_unstemmed A benchmark of CNN backbones on DINO-DETR performance in object detection
title_sort benchmark of cnn backbones on dino-detr performance in object detection
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/172020
_version_ 1783955525201821696