A machine learning method for pedestrian detection

Pedestrian detection has always been a challenging task of computer vision research for many decades. This dissertation presents a system that realizes the pedestrian detection in the surveillance video based on Convolutional Neural Network and video processing. The detection performance is tested o...

Full description

Saved in:
Bibliographic Details
Main Author: Wang, Yi
Other Authors: Chau Lap Pui
Format: Theses and Dissertations
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/75956
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-75956
record_format dspace
spelling sg-ntu-dr.10356-759562023-07-04T15:56:03Z A machine learning method for pedestrian detection Wang, Yi Chau Lap Pui School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Pedestrian detection has always been a challenging task of computer vision research for many decades. This dissertation presents a system that realizes the pedestrian detection in the surveillance video based on Convolutional Neural Network and video processing. The detection performance is tested on the VOC2007 testing dataset. Comparing with other scenes, pedestrian detection at a bus stop often involves small target (pedestrian) size and a high degree of occlusion. To address these issues, Single Shot MultiBox Detector (SSD) algorithm is proposed recently. I use SSD algorithm to design the pedestrian detection system in a bus stop environment. In this dissertation, we employ ResNet50 to extract features, rather than VGG16 that was used in the SSD paper. This method trained by the VOC2007+2012 training dataset improves the mean average precision (mAP) from 79.7 to 79.9 on VOC2007 testing dataset. Moreover, by training the network with COCO datasets, we can achieve the best results with mAP of 85.1 compared with Fast R-CCN, Faster R-CNN, and original SSD. Apart from using machine learning algorithms for this dissertation, some of the other works involves video processing. The system uses the surveillance camera at a campus bus stop to collect the videos. Then, the system detects the pedestrian in the region of interest (ROI) of each frame and divides them into two groups according to their positions. One is “wait” which is the pedestrian who is waiting at the bus stop; the other is “cross” which is the pedestrian who is crossing the road. Finally, the system marks the total number of pedestrians in the ROI. After the system has finished detecting a single frame of the surveillance video, it will automatically read and detect the next frame. To achieve higher detection accuracy and faster speed, some techniques such as image cropping of ROI and queue or multi-threading structure are implemented. Master of Science (Signal Processing) 2018-09-10T08:31:31Z 2018-09-10T08:31:31Z 2018 Thesis http://hdl.handle.net/10356/75956 en 62 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Wang, Yi
A machine learning method for pedestrian detection
description Pedestrian detection has always been a challenging task of computer vision research for many decades. This dissertation presents a system that realizes the pedestrian detection in the surveillance video based on Convolutional Neural Network and video processing. The detection performance is tested on the VOC2007 testing dataset. Comparing with other scenes, pedestrian detection at a bus stop often involves small target (pedestrian) size and a high degree of occlusion. To address these issues, Single Shot MultiBox Detector (SSD) algorithm is proposed recently. I use SSD algorithm to design the pedestrian detection system in a bus stop environment. In this dissertation, we employ ResNet50 to extract features, rather than VGG16 that was used in the SSD paper. This method trained by the VOC2007+2012 training dataset improves the mean average precision (mAP) from 79.7 to 79.9 on VOC2007 testing dataset. Moreover, by training the network with COCO datasets, we can achieve the best results with mAP of 85.1 compared with Fast R-CCN, Faster R-CNN, and original SSD. Apart from using machine learning algorithms for this dissertation, some of the other works involves video processing. The system uses the surveillance camera at a campus bus stop to collect the videos. Then, the system detects the pedestrian in the region of interest (ROI) of each frame and divides them into two groups according to their positions. One is “wait” which is the pedestrian who is waiting at the bus stop; the other is “cross” which is the pedestrian who is crossing the road. Finally, the system marks the total number of pedestrians in the ROI. After the system has finished detecting a single frame of the surveillance video, it will automatically read and detect the next frame. To achieve higher detection accuracy and faster speed, some techniques such as image cropping of ROI and queue or multi-threading structure are implemented.
author2 Chau Lap Pui
author_facet Chau Lap Pui
Wang, Yi
format Theses and Dissertations
author Wang, Yi
author_sort Wang, Yi
title A machine learning method for pedestrian detection
title_short A machine learning method for pedestrian detection
title_full A machine learning method for pedestrian detection
title_fullStr A machine learning method for pedestrian detection
title_full_unstemmed A machine learning method for pedestrian detection
title_sort machine learning method for pedestrian detection
publishDate 2018
url http://hdl.handle.net/10356/75956
_version_ 1772828053025587200