Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning

An intelligent vehicle counting camera network has the potential to provide automation, aid, or both, for many processes involved in the development of smart cities. Some example applications include parking slot management and fee collection, criminal car tracking, parking anti-theft, contactless r...

Full description

Saved in:

Bibliographic Details
Main Author:	Chan, Patrick Matthew J.
Format:	text
Language:	English
Published:	Animo Repository 2022
Subjects:	Computer vision Vehicle detectors Supervised learning (Machine learning) Neural networks (Computer science)
Online Access:	https://animorepository.dlsu.edu.ph/etdm_ece/19 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1023&context=etdm_ece
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	De La Salle University
Language:	English

id	oai:animorepository.dlsu.edu.ph:etdm_ece-1023
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:etdm_ece-10232023-01-06T00:44:33Z Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning Chan, Patrick Matthew J. An intelligent vehicle counting camera network has the potential to provide automation, aid, or both, for many processes involved in the development of smart cities. Some example applications include parking slot management and fee collection, criminal car tracking, parking anti-theft, contactless road violator apprehension, etc. The commonly used approach for vehicle counting algorithms is through fully supervised Convolutional Neural Networks (CNN). However, deploying these systems still requires vast amounts of manual data annotation for practically every single camera added to the network. As a result, expanding this intelligent network to be able to cover a wide area ends up being a slow and expensive process. This study proposes a method of integrating innovations in the recently emerging field of semi-supervised learning (SSL), into alleviating this issue for an existing workflow. Due to the core advantage of the SSL paradigm, the proposed approach can significantly reduce time and labor costs by greatly reducing the amount of manually annotated data needed; thus, paving the way for a more commercially viable usage of vehicle counting-based technologies. Using a separate neural network based on CycleGAN, the size of the training dataset for the existing workflow can be augmented in a new way, via a synthesized “training dataset” generated from the already available datasets of previously deployed cameras. Here, the approach is tested by checking the changes in the mean average precision (mAP) values of the detectron2 core of the vehicle counting network, after the addition of the synthetic dataset to its training pool. Upon evaluation on the CATCH-ALL vehicle detection dataset, the proposed method provided an improved object detection performance from an mAP of 71.644 to 77.523. This improvement was achieved, despite both runs starting from COCO pre-trained weights, and including classical augmentation approaches like random flipping and shortest edge resizing. 2022-12-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdm_ece/19 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1023&context=etdm_ece Electronics And Communications Engineering Master's Theses English Animo Repository Computer vision Vehicle detectors Supervised learning (Machine learning) Neural networks (Computer science)
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
language	English
topic	Computer vision Vehicle detectors Supervised learning (Machine learning) Neural networks (Computer science)
spellingShingle	Computer vision Vehicle detectors Supervised learning (Machine learning) Neural networks (Computer science) Chan, Patrick Matthew J. Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
description	An intelligent vehicle counting camera network has the potential to provide automation, aid, or both, for many processes involved in the development of smart cities. Some example applications include parking slot management and fee collection, criminal car tracking, parking anti-theft, contactless road violator apprehension, etc. The commonly used approach for vehicle counting algorithms is through fully supervised Convolutional Neural Networks (CNN). However, deploying these systems still requires vast amounts of manual data annotation for practically every single camera added to the network. As a result, expanding this intelligent network to be able to cover a wide area ends up being a slow and expensive process. This study proposes a method of integrating innovations in the recently emerging field of semi-supervised learning (SSL), into alleviating this issue for an existing workflow. Due to the core advantage of the SSL paradigm, the proposed approach can significantly reduce time and labor costs by greatly reducing the amount of manually annotated data needed; thus, paving the way for a more commercially viable usage of vehicle counting-based technologies. Using a separate neural network based on CycleGAN, the size of the training dataset for the existing workflow can be augmented in a new way, via a synthesized “training dataset” generated from the already available datasets of previously deployed cameras. Here, the approach is tested by checking the changes in the mean average precision (mAP) values of the detectron2 core of the vehicle counting network, after the addition of the synthetic dataset to its training pool. Upon evaluation on the CATCH-ALL vehicle detection dataset, the proposed method provided an improved object detection performance from an mAP of 71.644 to 77.523. This improvement was achieved, despite both runs starting from COCO pre-trained weights, and including classical augmentation approaches like random flipping and shortest edge resizing.
format	text
author	Chan, Patrick Matthew J.
author_facet	Chan, Patrick Matthew J.
author_sort	Chan, Patrick Matthew J.
title	Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_short	Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_full	Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_fullStr	Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_full_unstemmed	Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
title_sort	synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning
publisher	Animo Repository
publishDate	2022
url	https://animorepository.dlsu.edu.ph/etdm_ece/19 https://animorepository.dlsu.edu.ph/cgi/viewcontent.cgi?article=1023&context=etdm_ece
_version_	1754713718425387008

Synthesis of annotated images as dataset for vehicle counting neural networks using semi-supervised learning

Similar Items