DEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM
Spatiotemporal human action localization system is a field in computer vision and is of interest for real-world applications implemented in smart surveillance cameras, such as to improve public security, monitor patients' activities, or even detect any early symptoms of certain diseases....
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/56136 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:56136 |
---|---|
spelling |
id-itb.:561362021-06-21T13:20:51ZDEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM Nathania, Jesslyn Indonesia Final Project computer vision, smart surveillance camera, human action localization INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/56136 Spatiotemporal human action localization system is a field in computer vision and is of interest for real-world applications implemented in smart surveillance cameras, such as to improve public security, monitor patients' activities, or even detect any early symptoms of certain diseases. The system presented in this thesis followed the YOWO machine learning architecture reference, which was proposed by Köpüklü etc. (2019). YOWO extracts both spatial and temporal information. Bounding box regression and action classification can be done end-to-end. This aims to generate output faster compare to other state-of-the-art approaches. The implementation of this system is trained and tested with J-HMDB and NTU RGB+D datasets. Using certain specifications of machine defined, the system is just able to process video at 0.75seconds per frame with an accepted accuracy value. However, the system succeeds in increasing the human action localization accuracy from the YOWO reference with an accuracy of 41.6% to 43.84%. The result of the experiments shows that the modified architecture is able to improve the accuracy of YOWO. However, it slows down the frame rate of video processing text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Spatiotemporal human action localization system is a field in computer
vision and is of interest for real-world applications implemented in smart
surveillance cameras, such as to improve public security, monitor patients'
activities, or even detect any early symptoms of certain diseases.
The system presented in this thesis followed the YOWO machine learning
architecture reference, which was proposed by Köpüklü etc. (2019). YOWO
extracts both spatial and temporal information. Bounding box regression and action
classification can be done end-to-end. This aims to generate output faster compare
to other state-of-the-art approaches.
The implementation of this system is trained and tested with J-HMDB and
NTU RGB+D datasets. Using certain specifications of machine defined, the system
is just able to process video at 0.75seconds per frame with an accepted accuracy
value. However, the system succeeds in increasing the human action localization
accuracy from the YOWO reference with an accuracy of 41.6% to 43.84%.
The result of the experiments shows that the modified architecture is able to
improve the accuracy of YOWO. However, it slows down the frame rate of video
processing |
format |
Final Project |
author |
Nathania, Jesslyn |
spellingShingle |
Nathania, Jesslyn DEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM |
author_facet |
Nathania, Jesslyn |
author_sort |
Nathania, Jesslyn |
title |
DEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM |
title_short |
DEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM |
title_full |
DEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM |
title_fullStr |
DEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM |
title_full_unstemmed |
DEEP LEARNING BASED HUMAN ACTION RECOGNITION AND SPATIOTEMPORAL LOCALIZATION SYSTEM |
title_sort |
deep learning based human action recognition and spatiotemporal localization system |
url |
https://digilib.itb.ac.id/gdl/view/56136 |
_version_ |
1822930106614546432 |