Video surveillance system for human detection
The problem I need to solve is the pedestrian detection in campus monitor. Computer vision studies based on deep learning algorithm provide relatively precise result for frame and video detection. Among all kinks of deep learning frameworks and algorithms, Caffe and Faster R-CNN perform outstandingl...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/72557 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-72557 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-725572023-07-04T15:48:35Z Video surveillance system for human detection Xu, Wanxin Chau Lap Pui School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering The problem I need to solve is the pedestrian detection in campus monitor. Computer vision studies based on deep learning algorithm provide relatively precise result for frame and video detection. Among all kinks of deep learning frameworks and algorithms, Caffe and Faster R-CNN perform outstandingly in both detection speed and accuracy rate. In this dissertation, Caffe and Faster R-CNN are applied on both CPU and GPU to detect the people in campus station video based on VGG network and ZF network. To visualized display the detection process and result, I created an interface using python. In the interface, functions of video selection, algorithm detection, result displaying are integrated. After the test of algorithm, the mean average precision for people detection is around 0.76. In the detection of campus monitor, most of complete pedestrians with proper size can be detected. The original request of the dissertation is satisfied. To further improve the performance of the algorithm, the network is trained using VOC 2007, VOC2012, VOC 2007+VOC 2012 and the superposition of other labeled pedestrian datasets. After the enhance training, the number of detected objects in campus monitor video increased. These results proved that increasing training samples of a specific class contributes to the performance of Faster R-CNN algorithm. Master of Science (Communications Engineering) 2017-08-28T11:41:36Z 2017-08-28T11:41:36Z 2017 Thesis http://hdl.handle.net/10356/72557 en 71 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Xu, Wanxin Video surveillance system for human detection |
description |
The problem I need to solve is the pedestrian detection in campus monitor. Computer vision studies based on deep learning algorithm provide relatively precise result for frame and video detection. Among all kinks of deep learning frameworks and algorithms, Caffe and Faster R-CNN perform outstandingly in both detection speed and accuracy rate. In this dissertation, Caffe and Faster R-CNN are applied on both CPU and GPU to detect the people in campus station video based on VGG network and ZF network. To visualized display the detection process and result, I created an interface using python. In the interface, functions of video selection, algorithm detection, result displaying are integrated. After the test of algorithm, the mean average precision for people detection is around 0.76. In the detection of campus monitor, most of complete pedestrians with proper size can be detected. The original request of the dissertation is satisfied. To further improve the performance of the algorithm, the network is trained using VOC 2007, VOC2012, VOC 2007+VOC 2012 and the superposition of other labeled pedestrian datasets. After the enhance training, the number of detected objects in campus monitor video increased. These results proved that increasing training samples of a specific class contributes to the performance of Faster R-CNN algorithm. |
author2 |
Chau Lap Pui |
author_facet |
Chau Lap Pui Xu, Wanxin |
format |
Theses and Dissertations |
author |
Xu, Wanxin |
author_sort |
Xu, Wanxin |
title |
Video surveillance system for human detection |
title_short |
Video surveillance system for human detection |
title_full |
Video surveillance system for human detection |
title_fullStr |
Video surveillance system for human detection |
title_full_unstemmed |
Video surveillance system for human detection |
title_sort |
video surveillance system for human detection |
publishDate |
2017 |
url |
http://hdl.handle.net/10356/72557 |
_version_ |
1772827182225162240 |