Smart object counter

With the world’s urban population drastically increasing during the past decades, the over-crowded city suggests need for effective measures in areas such as crowd control, surveillance, and dynamic traffic planning. This project on object counting focuses on crowd counting and vehicle counting. The...

Full description

Saved in:
Bibliographic Details
Main Author: Kang, Xinhui
Other Authors: Loke Yuan Ren
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156770
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156770
record_format dspace
spelling sg-ntu-dr.10356-1567702022-04-23T12:09:49Z Smart object counter Kang, Xinhui Loke Yuan Ren School of Computer Science and Engineering yrloke@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision With the world’s urban population drastically increasing during the past decades, the over-crowded city suggests need for effective measures in areas such as crowd control, surveillance, and dynamic traffic planning. This project on object counting focuses on crowd counting and vehicle counting. The use case is also extended to microscopic cell counting task, assisting medical and biological research, and increasing throughput. This project aims to deploy density-estimation-based approach on convoluted neural network to develop efficient, accurate and robust deep learning model for counting task in highly congested scenes. The datasets ShanghaiTech, TRaffic ANd COngestionS (TRANCOS) dataset and P. vivax Malaria dataset are used for training and testing. This project assessed the effectiveness of Inception_v4 and Inception-Resnet-v1 building blocks. The Inception modules consists of different kernel sizes and can extract multi-scale information, and the skip connection in the ResNet design can alleviate the gradient varnishing issue. The combination of the two allows the model to be both wider and deeper, and hence able to recognise more complex features. This report reviews the existing works on object counting, especially the ones that utilised Inception, ResNet and their variants. It is found that the previous works either implemented structures similar to Inception, used older versions of Inception, combined Inception with other networks, and/ or produce inferior results. Main contributions: This project utilised the Inception_v4 and Inception-Resnet-v1 building blocks proposed 3 new models, which are trained, tested and proven to be robust across different use cases in crowd counting, vehicle counting and microscopic cell counting. The models exhibit low error rate and fast convergence and can be trained with limited computational resources. The lowest mean absolute error (MAE) achieved is 7.8 for crowd, 1.5 for vehicle and 2.7 for cell. The model robustness is also tested using scenes outside the training dataset (for instance, Orchard Road pedestrians, NTU North Spine canteen and LTA live traffic). In addition, this report also covers other techniques used: density map generation using Gaussian kernel and the application of novel curriculum loss function. To test the usability of the model trained, a demonstrative web application using Flask is developed to retrieve live LTA traffic photos in every 60 seconds from the API by Data.gov.sg. The number of vehicles is estimated and displayed real-time. It takes 3-4 seconds to generate model output for 1 image, which is more than sufficient for the 60 second interval. Bachelor of Engineering (Computer Science) 2022-04-23T12:09:04Z 2022-04-23T12:09:04Z 2022 Final Year Project (FYP) Kang, X. (2022). Smart object counter. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156770 https://hdl.handle.net/10356/156770 en SCSE21-0244 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Kang, Xinhui
Smart object counter
description With the world’s urban population drastically increasing during the past decades, the over-crowded city suggests need for effective measures in areas such as crowd control, surveillance, and dynamic traffic planning. This project on object counting focuses on crowd counting and vehicle counting. The use case is also extended to microscopic cell counting task, assisting medical and biological research, and increasing throughput. This project aims to deploy density-estimation-based approach on convoluted neural network to develop efficient, accurate and robust deep learning model for counting task in highly congested scenes. The datasets ShanghaiTech, TRaffic ANd COngestionS (TRANCOS) dataset and P. vivax Malaria dataset are used for training and testing. This project assessed the effectiveness of Inception_v4 and Inception-Resnet-v1 building blocks. The Inception modules consists of different kernel sizes and can extract multi-scale information, and the skip connection in the ResNet design can alleviate the gradient varnishing issue. The combination of the two allows the model to be both wider and deeper, and hence able to recognise more complex features. This report reviews the existing works on object counting, especially the ones that utilised Inception, ResNet and their variants. It is found that the previous works either implemented structures similar to Inception, used older versions of Inception, combined Inception with other networks, and/ or produce inferior results. Main contributions: This project utilised the Inception_v4 and Inception-Resnet-v1 building blocks proposed 3 new models, which are trained, tested and proven to be robust across different use cases in crowd counting, vehicle counting and microscopic cell counting. The models exhibit low error rate and fast convergence and can be trained with limited computational resources. The lowest mean absolute error (MAE) achieved is 7.8 for crowd, 1.5 for vehicle and 2.7 for cell. The model robustness is also tested using scenes outside the training dataset (for instance, Orchard Road pedestrians, NTU North Spine canteen and LTA live traffic). In addition, this report also covers other techniques used: density map generation using Gaussian kernel and the application of novel curriculum loss function. To test the usability of the model trained, a demonstrative web application using Flask is developed to retrieve live LTA traffic photos in every 60 seconds from the API by Data.gov.sg. The number of vehicles is estimated and displayed real-time. It takes 3-4 seconds to generate model output for 1 image, which is more than sufficient for the 60 second interval.
author2 Loke Yuan Ren
author_facet Loke Yuan Ren
Kang, Xinhui
format Final Year Project
author Kang, Xinhui
author_sort Kang, Xinhui
title Smart object counter
title_short Smart object counter
title_full Smart object counter
title_fullStr Smart object counter
title_full_unstemmed Smart object counter
title_sort smart object counter
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156770
_version_ 1731235753883598848