Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning

An intelligent vehicle counting camera network has the potential to provide automation, aid, or both, for many processes involved in the development of smart cities. Some example applications include parking slot management and fee collection, criminal car tracking, parking anti-theft, contactless r...

Full description

Saved in:
Bibliographic Details
Main Author: Chan, Patrick Matthew J.
Format: text
Language:English
Published: Animo Repository 2022
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etdm_ece/19
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1023&context=etdm_ece
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etdm_ece-1023
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etdm_ece-10232023-01-06T00:44:33Z Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning Chan, Patrick Matthew J. An intelligent vehicle counting camera network has the potential to provide automation, aid, or both, for many processes involved in the development of smart cities. Some example applications include parking slot management and fee collection, criminal car tracking, parking anti-theft, contactless road violator apprehension, etc. The commonly used approach for vehicle counting algorithms is through fully supervised Convolutional Neural Networks (CNN). However, deploying these systems still requires vast amounts of manual data annotation for practically every single camera added to the network. As a result, expanding this intelligent network to be able to cover a wide area ends up being a slow and expensive process. This study proposes a method of integrating innovations in the recently emerging field of semi-supervised learning (SSL), into alleviating this issue for an existing workflow. Due to the core advantage of the SSL paradigm, the proposed approach can significantly reduce time and labor costs by greatly reducing the amount of manually annotated data needed; thus, paving the way for a more commercially viable usage of vehicle counting-based technologies. Using a separate neural network based on CycleGAN, the size of the training dataset for the existing workflow can be augmented in a new way, via a synthesized “training dataset” generated from the already available datasets of previously deployed cameras. Here, the approach is tested by checking the changes in the mean average precision (mAP) values of the detectron2 core of the vehicle counting network, after the addition of the synthetic dataset to its training pool. Upon evaluation on the CATCH-ALL vehicle detection dataset, the proposed method provided an improved object detection performance from an mAP of 71.644 to 77.523. This improvement was achieved, despite both runs starting from COCO pre-trained weights, and including classical augmentation approaches like random flipping and shortest edge resizing. 2022-12-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdm_ece/19 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1023&context=etdm_ece Electronics And Communications Engineering Master's Theses English Animo Repository Computer vision Vehicle detectors Supervised learning (Machine learning) Neural networks (Computer science)
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Computer vision
Vehicle detectors
Supervised learning (Machine learning)
Neural networks (Computer science)
spellingShingle Computer vision
Vehicle detectors
Supervised learning (Machine learning)
Neural networks (Computer science)
Chan, Patrick Matthew J.
Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
description An intelligent vehicle counting camera network has the potential to provide automation, aid, or both, for many processes involved in the development of smart cities. Some example applications include parking slot management and fee collection, criminal car tracking, parking anti-theft, contactless road violator apprehension, etc. The commonly used approach for vehicle counting algorithms is through fully supervised Convolutional Neural Networks (CNN). However, deploying these systems still requires vast amounts of manual data annotation for practically every single camera added to the network. As a result, expanding this intelligent network to be able to cover a wide area ends up being a slow and expensive process. This study proposes a method of integrating innovations in the recently emerging field of semi-supervised learning (SSL), into alleviating this issue for an existing workflow. Due to the core advantage of the SSL paradigm, the proposed approach can significantly reduce time and labor costs by greatly reducing the amount of manually annotated data needed; thus, paving the way for a more commercially viable usage of vehicle counting-based technologies. Using a separate neural network based on CycleGAN, the size of the training dataset for the existing workflow can be augmented in a new way, via a synthesized “training dataset” generated from the already available datasets of previously deployed cameras. Here, the approach is tested by checking the changes in the mean average precision (mAP) values of the detectron2 core of the vehicle counting network, after the addition of the synthetic dataset to its training pool. Upon evaluation on the CATCH-ALL vehicle detection dataset, the proposed method provided an improved object detection performance from an mAP of 71.644 to 77.523. This improvement was achieved, despite both runs starting from COCO pre-trained weights, and including classical augmentation approaches like random flipping and shortest edge resizing.
format text
author Chan, Patrick Matthew J.
author_facet Chan, Patrick Matthew J.
author_sort Chan, Patrick Matthew J.
title Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_short Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_full Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_fullStr Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_full_unstemmed Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_sort synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
publisher Animo Repository
publishDate 2022
url https://animorepository.dlsu.edu.ph/etdm_ece/19
https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1023&context=etdm_ece
_version_ 1754713718425387008