Crowd estimation in images

The idea of estimating sizes of large distant crowds in images taken from high mounted cameras is often seen as a difficult task in the field of computer vision. This project aims to investigate how various texture analysis methods and computer vision image processing techniques could be done to...

Full description

Saved in:
Bibliographic Details
Main Author: See, Chi Chao
Other Authors: Cham Tat Jen
Format: Final Year Project
Language:English
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/58955
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-58955
record_format dspace
spelling sg-ntu-dr.10356-589552023-03-03T20:43:27Z Crowd estimation in images See, Chi Chao Cham Tat Jen School of Computer Engineering Centre for Multimedia and Network Technology DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision The idea of estimating sizes of large distant crowds in images taken from high mounted cameras is often seen as a difficult task in the field of computer vision. This project aims to investigate how various texture analysis methods and computer vision image processing techniques could be done to solve the problem. Through research, a multi output regression model for crowd counting was implemented for the sole purpose to reduce the over complications found in global and local regression models. The global model consists of a single model for counting, but the local model contains multiple regression functions and count localised spatial regions but becomes hard to scale if the model expands. However, the proposed regression model combines both approaches that involve a single model which can train multiple images and produced multi structural outputs. The implementation involves features selection of several low level imagery features such as segmentation, edge and texture which are inter-independent and are important for density estimation. These features can inherently produce a heat map where how it performs in weighted contribution and information sharing based on whole images and cells in partitioned images. A ground truth phase was manually dot annotated rigorously and forms the basis of comparison between the actual and expected counts in the training and testing set. Several statistical evaluation metrics were measured to test the bias and variance of the estimator model. The evaluation metrics would indicate the effectiveness of estimating crowd sizes. In future work, dynamic crowd structure segmentation could be implemented to improve the accuracy of crowd counting models, and other methods such as support vector machines and Maximum Excess over SubArrays (MESA) function could be recommended to extend the idea of crowd counting. Bachelor of Engineering (Computer Science) 2014-04-17T03:14:19Z 2014-04-17T03:14:19Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/58955 en Nanyang Technological University 50 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
See, Chi Chao
Crowd estimation in images
description The idea of estimating sizes of large distant crowds in images taken from high mounted cameras is often seen as a difficult task in the field of computer vision. This project aims to investigate how various texture analysis methods and computer vision image processing techniques could be done to solve the problem. Through research, a multi output regression model for crowd counting was implemented for the sole purpose to reduce the over complications found in global and local regression models. The global model consists of a single model for counting, but the local model contains multiple regression functions and count localised spatial regions but becomes hard to scale if the model expands. However, the proposed regression model combines both approaches that involve a single model which can train multiple images and produced multi structural outputs. The implementation involves features selection of several low level imagery features such as segmentation, edge and texture which are inter-independent and are important for density estimation. These features can inherently produce a heat map where how it performs in weighted contribution and information sharing based on whole images and cells in partitioned images. A ground truth phase was manually dot annotated rigorously and forms the basis of comparison between the actual and expected counts in the training and testing set. Several statistical evaluation metrics were measured to test the bias and variance of the estimator model. The evaluation metrics would indicate the effectiveness of estimating crowd sizes. In future work, dynamic crowd structure segmentation could be implemented to improve the accuracy of crowd counting models, and other methods such as support vector machines and Maximum Excess over SubArrays (MESA) function could be recommended to extend the idea of crowd counting.
author2 Cham Tat Jen
author_facet Cham Tat Jen
See, Chi Chao
format Final Year Project
author See, Chi Chao
author_sort See, Chi Chao
title Crowd estimation in images
title_short Crowd estimation in images
title_full Crowd estimation in images
title_fullStr Crowd estimation in images
title_full_unstemmed Crowd estimation in images
title_sort crowd estimation in images
publishDate 2014
url http://hdl.handle.net/10356/58955
_version_ 1759854048074268672