Deep active learning for training object detection
While there have been extensive applications deploying object detection, one of its limitations is the continuous need for a large amount of annotated images for reliable performance. This can be attributed to the limitation of the conventional workflow of training supervised object detection algori...
Saved in:
Main Author: | |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2022
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etdd_ece/2 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1001&context=etdd_ece |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etdd_ece-1001 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etdd_ece-10012022-07-21T23:02:28Z Deep active learning for training object detection Jose, John Anthony While there have been extensive applications deploying object detection, one of its limitations is the continuous need for a large amount of annotated images for reliable performance. This can be attributed to the limitation of the conventional workflow of training supervised object detection algorithms. The aim of this study is to propose a new workflow that reduces the amount of annotated images needed for training by "intelligently" sampling the most informative unlabeled image, known as \textit{active learning}. Existing active learning literature has focused on incorporating prediction uncertainty to identify the most informative image. While it is significant and has merit, focusing on improving uncertainty estimation is not holistic. This study proposes that there are two more factors that are equally important to be considered: (1) improving the representation in a limited label setting, (2) suppressing noisy prediction when intelligently sampling for new images. Using these simple modifications, it is able to acquire 76% mean average precision (mAP) using 20% of the data, which beats state-of-the-art by a large margin. By comparing its performance with conventional training workflow, it is able to garner 95% of the performance using only 20% of the images. 2022-07-01T07:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdd_ece/2 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1001&context=etdd_ece Electronics And Communications Engineering Dissertations English Animo Repository Computer vision Deep learning (Machine learning) Electrical and Computer Engineering Electrical and Electronics Systems and Communications |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Computer vision Deep learning (Machine learning) Electrical and Computer Engineering Electrical and Electronics Systems and Communications |
spellingShingle |
Computer vision Deep learning (Machine learning) Electrical and Computer Engineering Electrical and Electronics Systems and Communications Jose, John Anthony Deep active learning for training object detection |
description |
While there have been extensive applications deploying object detection, one of its limitations is the continuous need for a large amount of annotated images for reliable performance. This can be attributed to the limitation of the conventional workflow of training supervised object detection algorithms.
The aim of this study is to propose a new workflow that reduces the amount of annotated images needed for training by "intelligently" sampling the most informative unlabeled image, known as \textit{active learning}. Existing active learning literature has focused on incorporating prediction uncertainty to identify the most informative image. While it is significant and has merit, focusing on improving uncertainty estimation is not holistic.
This study proposes that there are two more factors that are equally important to be considered: (1) improving the representation in a limited label setting, (2) suppressing noisy prediction when intelligently sampling for new images. Using these simple modifications, it is able to acquire 76% mean average precision (mAP) using 20% of the data, which beats state-of-the-art by a large margin. By comparing its performance with conventional training workflow, it is able to garner 95% of the performance using only 20% of the images. |
format |
text |
author |
Jose, John Anthony |
author_facet |
Jose, John Anthony |
author_sort |
Jose, John Anthony |
title |
Deep active learning for training object detection |
title_short |
Deep active learning for training object detection |
title_full |
Deep active learning for training object detection |
title_fullStr |
Deep active learning for training object detection |
title_full_unstemmed |
Deep active learning for training object detection |
title_sort |
deep active learning for training object detection |
publisher |
Animo Repository |
publishDate |
2022 |
url |
https://animorepository.dlsu.edu.ph/etdd_ece/2 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1001&context=etdd_ece |
_version_ |
1740844630672408576 |