Deep learning for image and video understanding

Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus...

Full description

Saved in:
Bibliographic Details
Main Author: Yue, Kunlun
Other Authors: Tan Yap Peng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/139497
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-139497
record_format dspace
spelling sg-ntu-dr.10356-1394972023-07-07T18:03:07Z Deep learning for image and video understanding Yue, Kunlun Tan Yap Peng School of Electrical and Electronic Engineering eyptan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus on another topic under gaze analysis---Looking at each other(LAEO). Knowing whether peoples are LAEO can help us understand their relationships because mutual gaze between people is a very important non-verbal communication. Most of the methods presented and used focus on analyzing mutual gaze in an individual frame. But this report will talk about a new method, which will conduct this analysis in a spatio-temporal approach. Continual frames and videos will be used as input data. Then we will extract the heads to create tracks(a list of heads that heaped in accordance with time) and get the respective heads_map as inputs for the model. Finally the system will decide whether the peoples are LAEO by giving the probability(Given by LAEO score) of LAEO. The results on common meeting room videos demonstrate the effectiveness of the new method and model. Hopefully, this system can be used in the real-time applications to monitor or analyze people in a meeting room after some future works. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-05-20T02:13:27Z 2020-05-20T02:13:27Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/139497 en A3292-191 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies
Engineering::Electrical and electronic engineering
spellingShingle Engineering::Computer science and engineering::Computing methodologies
Engineering::Electrical and electronic engineering
Yue, Kunlun
Deep learning for image and video understanding
description Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus on another topic under gaze analysis---Looking at each other(LAEO). Knowing whether peoples are LAEO can help us understand their relationships because mutual gaze between people is a very important non-verbal communication. Most of the methods presented and used focus on analyzing mutual gaze in an individual frame. But this report will talk about a new method, which will conduct this analysis in a spatio-temporal approach. Continual frames and videos will be used as input data. Then we will extract the heads to create tracks(a list of heads that heaped in accordance with time) and get the respective heads_map as inputs for the model. Finally the system will decide whether the peoples are LAEO by giving the probability(Given by LAEO score) of LAEO. The results on common meeting room videos demonstrate the effectiveness of the new method and model. Hopefully, this system can be used in the real-time applications to monitor or analyze people in a meeting room after some future works.
author2 Tan Yap Peng
author_facet Tan Yap Peng
Yue, Kunlun
format Final Year Project
author Yue, Kunlun
author_sort Yue, Kunlun
title Deep learning for image and video understanding
title_short Deep learning for image and video understanding
title_full Deep learning for image and video understanding
title_fullStr Deep learning for image and video understanding
title_full_unstemmed Deep learning for image and video understanding
title_sort deep learning for image and video understanding
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/139497
_version_ 1772826293912469504