Person detection and tracking from aerial videos

In this study, a system that can detect and locate humans in video recordings collected by an unmanned aerial vehicle (UAV) was developed. Several machine vision algorithms were implemented. Lucas and Kanade optic flow computation and YUV color space conversion were used for feature point selection,...

Full description

Saved in:
Bibliographic Details
Main Authors: Garcia, Alghie Marie B., Rufino, Ma. Andrea V., Sangalang, Louie Carlo A., Teodoro, John Arcy R.
Format: text
Language:English
Published: Animo Repository 2014
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/11042
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_bachelors-11687
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_bachelors-116872022-02-28T06:21:42Z Person detection and tracking from aerial videos Garcia, Alghie Marie B. Rufino, Ma. Andrea V. Sangalang, Louie Carlo A. Teodoro, John Arcy R. In this study, a system that can detect and locate humans in video recordings collected by an unmanned aerial vehicle (UAV) was developed. Several machine vision algorithms were implemented. Lucas and Kanade optic flow computation and YUV color space conversion were used for feature point selection, and Mean-Shift clustering in feature space for image segmentation. Histogram of Oriented Gradients (HOG), Haar, and Speeded Up Robust Features (SURF) were used for feature extraction, while Support Vector Machines (SVM) and Adaptive Boosting (AdaBoost) were utilized for training and classification. Kalman filter was employed for tracking humans. The Person Detection and Tracking from Aerial Videos (PDTrAV system was tested on videos taken under different weather conditions and was able to successfully detect and tract people in them. System tests, however, indicated more false positives and false negatives than true positives. A minimum threshold for the ratio of the blob area and its bounding box area was enforced in order to reduce the number of false positives, which were attributed to diagonal lines or edges that were detected as people. The use of Haar-like features attained the best recall at 32.3925% while the use of HOG features attained the best precision at 24.5945% While improvements can be done to make the system suitable for any real world application, this study has proven that it is possible to detect humans from an aerial perspective using fewer and more cost effective resources. The system was able to detect humans in aerial videos taken using an ordinary digital camera. Further improvements on the Mean-Shift clustering, Kalman filter, and the feature extraction methods are recommended for better performance of the system. For the Mean-Shift Algorithm, bandwidth estimators can be used since the system only used a constant bandwidth input. Since the Kalman filter used in the system can only estimate constant velocity and location of the blobs, adaptive Kalman filter can be used to have more accurate estimates when the velocity of the blobs or of the camera becomes time-varying. For HOG and Haar feature extraction, a larger and more unique dataset should be used to have more features to compare with. For SURF extraction, the size of the dataset set, the number of clusters in the visual codebook, the number of octaves and scale levels, and the threshold value should be varied in order to determine the most suitable values for a certain application. 2014-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/11042 Bachelor's Theses English Animo Repository Drone aircraft in remote sensing Aerial videography Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Drone aircraft in remote sensing
Aerial videography
Computer Sciences
spellingShingle Drone aircraft in remote sensing
Aerial videography
Computer Sciences
Garcia, Alghie Marie B.
Rufino, Ma. Andrea V.
Sangalang, Louie Carlo A.
Teodoro, John Arcy R.
Person detection and tracking from aerial videos
description In this study, a system that can detect and locate humans in video recordings collected by an unmanned aerial vehicle (UAV) was developed. Several machine vision algorithms were implemented. Lucas and Kanade optic flow computation and YUV color space conversion were used for feature point selection, and Mean-Shift clustering in feature space for image segmentation. Histogram of Oriented Gradients (HOG), Haar, and Speeded Up Robust Features (SURF) were used for feature extraction, while Support Vector Machines (SVM) and Adaptive Boosting (AdaBoost) were utilized for training and classification. Kalman filter was employed for tracking humans. The Person Detection and Tracking from Aerial Videos (PDTrAV system was tested on videos taken under different weather conditions and was able to successfully detect and tract people in them. System tests, however, indicated more false positives and false negatives than true positives. A minimum threshold for the ratio of the blob area and its bounding box area was enforced in order to reduce the number of false positives, which were attributed to diagonal lines or edges that were detected as people. The use of Haar-like features attained the best recall at 32.3925% while the use of HOG features attained the best precision at 24.5945% While improvements can be done to make the system suitable for any real world application, this study has proven that it is possible to detect humans from an aerial perspective using fewer and more cost effective resources. The system was able to detect humans in aerial videos taken using an ordinary digital camera. Further improvements on the Mean-Shift clustering, Kalman filter, and the feature extraction methods are recommended for better performance of the system. For the Mean-Shift Algorithm, bandwidth estimators can be used since the system only used a constant bandwidth input. Since the Kalman filter used in the system can only estimate constant velocity and location of the blobs, adaptive Kalman filter can be used to have more accurate estimates when the velocity of the blobs or of the camera becomes time-varying. For HOG and Haar feature extraction, a larger and more unique dataset should be used to have more features to compare with. For SURF extraction, the size of the dataset set, the number of clusters in the visual codebook, the number of octaves and scale levels, and the threshold value should be varied in order to determine the most suitable values for a certain application.
format text
author Garcia, Alghie Marie B.
Rufino, Ma. Andrea V.
Sangalang, Louie Carlo A.
Teodoro, John Arcy R.
author_facet Garcia, Alghie Marie B.
Rufino, Ma. Andrea V.
Sangalang, Louie Carlo A.
Teodoro, John Arcy R.
author_sort Garcia, Alghie Marie B.
title Person detection and tracking from aerial videos
title_short Person detection and tracking from aerial videos
title_full Person detection and tracking from aerial videos
title_fullStr Person detection and tracking from aerial videos
title_full_unstemmed Person detection and tracking from aerial videos
title_sort person detection and tracking from aerial videos
publisher Animo Repository
publishDate 2014
url https://animorepository.dlsu.edu.ph/etd_bachelors/11042
_version_ 1726158563668131840