Single-fusion detector: Towards faster multi-scale object detection
Despite recent improvements, the arbitrary sizes of objects still impede the predictive ability of object detectors. Recent solutions combine feature maps of different receptive fields to detect multi-scale objects. However, these methods have large computational costs resulting to slower inference ti...
Saved in:
Main Author: | |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2019
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etd_masteral/6609 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13597/viewcontent/Antioquia__Arren_Matthew_Capuchino2___Single_Fusion_Detector_Towards_Faster_Multi_Scale_Object_Detection___ANTIOQUIA_Redacted.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etd_masteral-13597 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etd_masteral-135972023-03-01T07:18:50Z Single-fusion detector: Towards faster multi-scale object detection Antioquia, Arren Matthew C. Despite recent improvements, the arbitrary sizes of objects still impede the predictive ability of object detectors. Recent solutions combine feature maps of different receptive fields to detect multi-scale objects. However, these methods have large computational costs resulting to slower inference time, which is not practical for real-time applications. Contrarily, fusion methods depending on large networks with many skip connections require larger memory footprint, prohibiting usage in devices with limited memory. In this paper, we propose a simpler novel fusion method which integrates multiple feature maps using a single concatenation operation. Our method can flexibly adapt to any base network, allowing for tailored performance for different computational requirements. Our approach achieves 81.7% mAP at 41 FPS on the PASCAL VOC dataset using ResNet-50 as the base network, which is superior in terms of both speed and mAP as compared to several state-of-the-art baselines that uses larger base networks. 2019-03-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etd_masteral/6609 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13597/viewcontent/Antioquia__Arren_Matthew_Capuchino2___Single_Fusion_Detector_Towards_Faster_Multi_Scale_Object_Detection___ANTIOQUIA_Redacted.pdf Master's Theses English Animo Repository Image converters Neural networks (Computer science) Computer Sciences |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Image converters Neural networks (Computer science) Computer Sciences |
spellingShingle |
Image converters Neural networks (Computer science) Computer Sciences Antioquia, Arren Matthew C. Single-fusion detector: Towards faster multi-scale object detection |
description |
Despite recent improvements, the arbitrary sizes of objects still impede the predictive ability of object detectors. Recent solutions combine feature maps of different receptive fields to detect multi-scale objects. However, these methods have large computational costs resulting to slower inference time, which is not practical for real-time applications. Contrarily, fusion methods depending on large networks with many skip connections require larger memory footprint, prohibiting usage in devices with limited memory. In this paper, we propose a simpler novel fusion method which integrates multiple feature maps using a single concatenation operation. Our method can flexibly adapt to any base network, allowing for tailored performance for different computational requirements. Our approach achieves 81.7% mAP at 41 FPS on the PASCAL VOC dataset using ResNet-50 as the base network, which is superior in terms of both speed and mAP as compared to several state-of-the-art baselines that uses larger base networks. |
format |
text |
author |
Antioquia, Arren Matthew C. |
author_facet |
Antioquia, Arren Matthew C. |
author_sort |
Antioquia, Arren Matthew C. |
title |
Single-fusion detector: Towards faster multi-scale object detection |
title_short |
Single-fusion detector: Towards faster multi-scale object detection |
title_full |
Single-fusion detector: Towards faster multi-scale object detection |
title_fullStr |
Single-fusion detector: Towards faster multi-scale object detection |
title_full_unstemmed |
Single-fusion detector: Towards faster multi-scale object detection |
title_sort |
single-fusion detector: towards faster multi-scale object detection |
publisher |
Animo Repository |
publishDate |
2019 |
url |
https://animorepository.dlsu.edu.ph/etd_masteral/6609 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13597/viewcontent/Antioquia__Arren_Matthew_Capuchino2___Single_Fusion_Detector_Towards_Faster_Multi_Scale_Object_Detection___ANTIOQUIA_Redacted.pdf |
_version_ |
1767196930090205184 |