Single-fusion detector: Towards faster multi-scale object detection

Despite recent improvements, the arbitrary sizes of objects still impede the predictive ability of object detectors. Recent solutions combine feature maps of different receptive fields to detect multi-scale objects. However, these methods have large computational costs resulting to slower inference ti...

Full description

Saved in:
Bibliographic Details
Main Author: Antioquia, Arren Matthew C.
Format: text
Language:English
Published: Animo Repository 2019
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_masteral/6609
https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13597/viewcontent/Antioquia__Arren_Matthew_Capuchino2___Single_Fusion_Detector_Towards_Faster_Multi_Scale_Object_Detection___ANTIOQUIA_Redacted.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_masteral-13597
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_masteral-135972023-03-01T07:18:50Z Single-fusion detector: Towards faster multi-scale object detection Antioquia, Arren Matthew C. Despite recent improvements, the arbitrary sizes of objects still impede the predictive ability of object detectors. Recent solutions combine feature maps of different receptive fields to detect multi-scale objects. However, these methods have large computational costs resulting to slower inference time, which is not practical for real-time applications. Contrarily, fusion methods depending on large networks with many skip connections require larger memory footprint, prohibiting usage in devices with limited memory. In this paper, we propose a simpler novel fusion method which integrates multiple feature maps using a single concatenation operation. Our method can flexibly adapt to any base network, allowing for tailored performance for different computational requirements. Our approach achieves 81.7% mAP at 41 FPS on the PASCAL VOC dataset using ResNet-50 as the base network, which is superior in terms of both speed and mAP as compared to several state-of-the-art baselines that uses larger base networks. 2019-03-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etd_masteral/6609 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13597/viewcontent/Antioquia__Arren_Matthew_Capuchino2___Single_Fusion_Detector_Towards_Faster_Multi_Scale_Object_Detection___ANTIOQUIA_Redacted.pdf Master's Theses English Animo Repository Image converters Neural networks (Computer science) Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Image converters
Neural networks (Computer science)
Computer Sciences
spellingShingle Image converters
Neural networks (Computer science)
Computer Sciences
Antioquia, Arren Matthew C.
Single-fusion detector: Towards faster multi-scale object detection
description Despite recent improvements, the arbitrary sizes of objects still impede the predictive ability of object detectors. Recent solutions combine feature maps of different receptive fields to detect multi-scale objects. However, these methods have large computational costs resulting to slower inference time, which is not practical for real-time applications. Contrarily, fusion methods depending on large networks with many skip connections require larger memory footprint, prohibiting usage in devices with limited memory. In this paper, we propose a simpler novel fusion method which integrates multiple feature maps using a single concatenation operation. Our method can flexibly adapt to any base network, allowing for tailored performance for different computational requirements. Our approach achieves 81.7% mAP at 41 FPS on the PASCAL VOC dataset using ResNet-50 as the base network, which is superior in terms of both speed and mAP as compared to several state-of-the-art baselines that uses larger base networks.
format text
author Antioquia, Arren Matthew C.
author_facet Antioquia, Arren Matthew C.
author_sort Antioquia, Arren Matthew C.
title Single-fusion detector: Towards faster multi-scale object detection
title_short Single-fusion detector: Towards faster multi-scale object detection
title_full Single-fusion detector: Towards faster multi-scale object detection
title_fullStr Single-fusion detector: Towards faster multi-scale object detection
title_full_unstemmed Single-fusion detector: Towards faster multi-scale object detection
title_sort single-fusion detector: towards faster multi-scale object detection
publisher Animo Repository
publishDate 2019
url https://animorepository.dlsu.edu.ph/etd_masteral/6609
https://animorepository.dlsu.edu.ph/context/etd_masteral/article/13597/viewcontent/Antioquia__Arren_Matthew_Capuchino2___Single_Fusion_Detector_Towards_Faster_Multi_Scale_Object_Detection___ANTIOQUIA_Redacted.pdf
_version_ 1767196930090205184