Crowd estimation in images
The idea of estimating sizes of large distant crowds in images taken from high mounted cameras is often seen as a difficult task in the field of computer vision. This project aims to investigate how various texture analysis methods and computer vision image processing techniques could be done to...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2014
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/58955 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-58955 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-589552023-03-03T20:43:27Z Crowd estimation in images See, Chi Chao Cham Tat Jen School of Computer Engineering Centre for Multimedia and Network Technology DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision The idea of estimating sizes of large distant crowds in images taken from high mounted cameras is often seen as a difficult task in the field of computer vision. This project aims to investigate how various texture analysis methods and computer vision image processing techniques could be done to solve the problem. Through research, a multi output regression model for crowd counting was implemented for the sole purpose to reduce the over complications found in global and local regression models. The global model consists of a single model for counting, but the local model contains multiple regression functions and count localised spatial regions but becomes hard to scale if the model expands. However, the proposed regression model combines both approaches that involve a single model which can train multiple images and produced multi structural outputs. The implementation involves features selection of several low level imagery features such as segmentation, edge and texture which are inter-independent and are important for density estimation. These features can inherently produce a heat map where how it performs in weighted contribution and information sharing based on whole images and cells in partitioned images. A ground truth phase was manually dot annotated rigorously and forms the basis of comparison between the actual and expected counts in the training and testing set. Several statistical evaluation metrics were measured to test the bias and variance of the estimator model. The evaluation metrics would indicate the effectiveness of estimating crowd sizes. In future work, dynamic crowd structure segmentation could be implemented to improve the accuracy of crowd counting models, and other methods such as support vector machines and Maximum Excess over SubArrays (MESA) function could be recommended to extend the idea of crowd counting. Bachelor of Engineering (Computer Science) 2014-04-17T03:14:19Z 2014-04-17T03:14:19Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/58955 en Nanyang Technological University 50 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision See, Chi Chao Crowd estimation in images |
description |
The idea of estimating sizes of large distant crowds in images taken from high mounted cameras is often seen as a difficult task in the field of computer vision.
This project aims to investigate how various texture analysis methods and computer vision image processing techniques could be done to solve the problem.
Through research, a multi output regression model for crowd counting was implemented for the sole purpose to reduce the over complications found in global and local regression models. The global model consists of a single model for counting, but the local model contains multiple regression functions and count localised spatial regions but becomes hard to scale if the model expands. However, the proposed regression model combines both approaches that involve a single model which can train multiple images and produced multi structural outputs.
The implementation involves features selection of several low level imagery features such as segmentation, edge and texture which are inter-independent and are important for density estimation. These features can inherently produce a heat map where how it performs in weighted contribution and information sharing based on whole images and cells in partitioned images.
A ground truth phase was manually dot annotated rigorously and forms the basis of comparison between the actual and expected counts in the training and testing set.
Several statistical evaluation metrics were measured to test the bias and variance of the estimator model. The evaluation metrics would indicate the effectiveness of estimating crowd sizes.
In future work, dynamic crowd structure segmentation could be implemented to improve the accuracy of crowd counting models, and other methods such as support vector machines and Maximum Excess over SubArrays (MESA) function could be recommended to extend the idea of crowd counting. |
author2 |
Cham Tat Jen |
author_facet |
Cham Tat Jen See, Chi Chao |
format |
Final Year Project |
author |
See, Chi Chao |
author_sort |
See, Chi Chao |
title |
Crowd estimation in images |
title_short |
Crowd estimation in images |
title_full |
Crowd estimation in images |
title_fullStr |
Crowd estimation in images |
title_full_unstemmed |
Crowd estimation in images |
title_sort |
crowd estimation in images |
publishDate |
2014 |
url |
http://hdl.handle.net/10356/58955 |
_version_ |
1759854048074268672 |