Pedestrian detection using faster-RCNN

Pedestrian detection has been an active research topic for some time now. Before the advent of neural networks and deep learning, we had a relatively simpler task in hand of extracting the features and classifying the object based on it’s features. The Histogram of oriented Gradients (HoG)[1] mod...

Full description

Saved in:

Bibliographic Details
Main Author:	Munshi Harsh Hemangkumar
Other Authors:	Justin Dauwels
Format:	Theses and Dissertations
Language:	English
Published:	2017
Subjects:	DRNTU::Engineering::Electrical and electronic engineering
Online Access:	http://hdl.handle.net/10356/69526
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-69526
record_format	dspace
spelling	sg-ntu-dr.10356-695262023-07-04T15:49:04Z Pedestrian detection using faster-RCNN Munshi Harsh Hemangkumar Justin Dauwels School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Pedestrian detection has been an active research topic for some time now. Before the advent of neural networks and deep learning, we had a relatively simpler task in hand of extracting the features and classifying the object based on it’s features. The Histogram of oriented Gradients (HoG)[1] model was the first strong proposed theory which surpassed the performance of it competitiors. Following to the HoG model there were a lot of variants of HoG like HoGlbp[2], MultiFtr[3] which tried to boost the accuracy of the existing HoG+SVM[1] model(s). On the other end of spectrum the object detection algorithm, region-based convolutional neural network (RCNN), is very popular in recent years. It boosts the performance significantly by making a combination of two key insights. The first one is to localize and segment objects by applying high-capacity convolutional neural network to bottom-up region proposals. We try to train a model using our own variant of a deep architecture, using the open source implementation of faster-RCNN[4] using the existing datasets. In this thesis, we present a custom caffe model which is inspired from ZF Neural Network and set it up for faster RCNN object detection scheme. We train the system in two different ways, one with only pedestrian images and other with multiple classes. We then test the system with custom built test images with annotations and observe the performance and compare it. Finally, in the multiclass approach, with the help of deep visualizations we observe the learnt detector and discuss how can we use it in the future work section of this thesis. The average precision of pedestrian only model was found to be ~81% and that of multiclass detector was found to be ~70%. The result are discussed in the results and experiments section with details on train time, test time and accuracies. Master of Science (Computer Control and Automation) 2017-02-02T06:11:54Z 2017-02-02T06:11:54Z 2017 Thesis http://hdl.handle.net/10356/69526 en 61 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering Munshi Harsh Hemangkumar Pedestrian detection using faster-RCNN
description	Pedestrian detection has been an active research topic for some time now. Before the advent of neural networks and deep learning, we had a relatively simpler task in hand of extracting the features and classifying the object based on it’s features. The Histogram of oriented Gradients (HoG)[1] model was the first strong proposed theory which surpassed the performance of it competitiors. Following to the HoG model there were a lot of variants of HoG like HoGlbp[2], MultiFtr[3] which tried to boost the accuracy of the existing HoG+SVM[1] model(s). On the other end of spectrum the object detection algorithm, region-based convolutional neural network (RCNN), is very popular in recent years. It boosts the performance significantly by making a combination of two key insights. The first one is to localize and segment objects by applying high-capacity convolutional neural network to bottom-up region proposals. We try to train a model using our own variant of a deep architecture, using the open source implementation of faster-RCNN[4] using the existing datasets. In this thesis, we present a custom caffe model which is inspired from ZF Neural Network and set it up for faster RCNN object detection scheme. We train the system in two different ways, one with only pedestrian images and other with multiple classes. We then test the system with custom built test images with annotations and observe the performance and compare it. Finally, in the multiclass approach, with the help of deep visualizations we observe the learnt detector and discuss how can we use it in the future work section of this thesis. The average precision of pedestrian only model was found to be ~81% and that of multiclass detector was found to be ~70%. The result are discussed in the results and experiments section with details on train time, test time and accuracies.
author2	Justin Dauwels
author_facet	Justin Dauwels Munshi Harsh Hemangkumar
format	Theses and Dissertations
author	Munshi Harsh Hemangkumar
author_sort	Munshi Harsh Hemangkumar
title	Pedestrian detection using faster-RCNN
title_short	Pedestrian detection using faster-RCNN
title_full	Pedestrian detection using faster-RCNN
title_fullStr	Pedestrian detection using faster-RCNN
title_full_unstemmed	Pedestrian detection using faster-RCNN
title_sort	pedestrian detection using faster-rcnn
publishDate	2017
url	http://hdl.handle.net/10356/69526
_version_	1772828442501316608

Pedestrian detection using faster-RCNN

Similar Items