Human action capturing and classification

In this report, a vision-based framework is proposed for learning and inferring occupant activities at different levels. These levels range from short temporal interval movements, to intermediate level events and long temporal term activities. Our research is focused on using a combined tracking-cla...

Full description

Saved in:
Bibliographic Details
Main Author: Feng, Zhou
Other Authors: Cham Tat Jen
Format: Theses and Dissertations
Language:English
Published: 2008
Subjects:
Online Access:https://hdl.handle.net/10356/13599
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In this report, a vision-based framework is proposed for learning and inferring occupant activities at different levels. These levels range from short temporal interval movements, to intermediate level events and long temporal term activities. Our research is focused on using a combined tracking-classification framework for the unsupervised classification of human action. An initial comparative study was done to evaluate several existing foreground segmentation methods that employ background modeling. Our own probabilistic foreground-background segmentation method is proposed to extract human-centric reference frames. Based on the human-centric reference frames, a principled analysis of the correspondence problem leads to a novel probabilistic action representation called the correspondence-ambiguous feature histogram array (CAFHA) that is robust to variations across similar actions. CAFHA is shown to be effective in unsupervised action classification and quasi real-time action inference. A novel feature selection method is proposed to select the optimal features to improve the CAFHA representation, such that the best discrimination between different action clusters may be found via unsupervised spectral clustering. Finally, a number of potential future directions are proposed that are targeted at further improvements to our framework and creating new research methods required to recognize human activity at longer temporal scales.