Deep active learning for training object detection

While there have been extensive applications deploying object detection, one of its limitations is the continuous need for a large amount of annotated images for reliable performance. This can be attributed to the limitation of the conventional workflow of training supervised object detection algori...

Full description

Saved in:
Bibliographic Details
Main Author: Jose, John Anthony
Format: text
Language:English
Published: Animo Repository 2022
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etdd_ece/2
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1001&context=etdd_ece
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etdd_ece-1001
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etdd_ece-10012022-07-21T23:02:28Z Deep active learning for training object detection Jose, John Anthony While there have been extensive applications deploying object detection, one of its limitations is the continuous need for a large amount of annotated images for reliable performance. This can be attributed to the limitation of the conventional workflow of training supervised object detection algorithms. The aim of this study is to propose a new workflow that reduces the amount of annotated images needed for training by "intelligently" sampling the most informative unlabeled image, known as \textit{active learning}. Existing active learning literature has focused on incorporating prediction uncertainty to identify the most informative image. While it is significant and has merit, focusing on improving uncertainty estimation is not holistic. This study proposes that there are two more factors that are equally important to be considered: (1) improving the representation in a limited label setting, (2) suppressing noisy prediction when intelligently sampling for new images. Using these simple modifications, it is able to acquire 76% mean average precision (mAP) using 20% of the data, which beats state-of-the-art by a large margin. By comparing its performance with conventional training workflow, it is able to garner 95% of the performance using only 20% of the images. 2022-07-01T07:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdd_ece/2 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1001&context=etdd_ece Electronics And Communications Engineering Dissertations English Animo Repository Computer vision Deep learning (Machine learning) Electrical and Computer Engineering Electrical and Electronics Systems and Communications
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Computer vision
Deep learning (Machine learning)
Electrical and Computer Engineering
Electrical and Electronics
Systems and Communications
spellingShingle Computer vision
Deep learning (Machine learning)
Electrical and Computer Engineering
Electrical and Electronics
Systems and Communications
Jose, John Anthony
Deep active learning for training object detection
description While there have been extensive applications deploying object detection, one of its limitations is the continuous need for a large amount of annotated images for reliable performance. This can be attributed to the limitation of the conventional workflow of training supervised object detection algorithms. The aim of this study is to propose a new workflow that reduces the amount of annotated images needed for training by "intelligently" sampling the most informative unlabeled image, known as \textit{active learning}. Existing active learning literature has focused on incorporating prediction uncertainty to identify the most informative image. While it is significant and has merit, focusing on improving uncertainty estimation is not holistic. This study proposes that there are two more factors that are equally important to be considered: (1) improving the representation in a limited label setting, (2) suppressing noisy prediction when intelligently sampling for new images. Using these simple modifications, it is able to acquire 76% mean average precision (mAP) using 20% of the data, which beats state-of-the-art by a large margin. By comparing its performance with conventional training workflow, it is able to garner 95% of the performance using only 20% of the images.
format text
author Jose, John Anthony
author_facet Jose, John Anthony
author_sort Jose, John Anthony
title Deep active learning for training object detection
title_short Deep active learning for training object detection
title_full Deep active learning for training object detection
title_fullStr Deep active learning for training object detection
title_full_unstemmed Deep active learning for training object detection
title_sort deep active learning for training object detection
publisher Animo Repository
publishDate 2022
url https://animorepository.dlsu.edu.ph/etdd_ece/2
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1001&context=etdd_ece
_version_ 1740844630672408576