Deep learning for image and video understanding
Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/139497 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-139497 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1394972023-07-07T18:03:07Z Deep learning for image and video understanding Yue, Kunlun Tan Yap Peng School of Electrical and Electronic Engineering eyptan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus on another topic under gaze analysis---Looking at each other(LAEO). Knowing whether peoples are LAEO can help us understand their relationships because mutual gaze between people is a very important non-verbal communication. Most of the methods presented and used focus on analyzing mutual gaze in an individual frame. But this report will talk about a new method, which will conduct this analysis in a spatio-temporal approach. Continual frames and videos will be used as input data. Then we will extract the heads to create tracks(a list of heads that heaped in accordance with time) and get the respective heads_map as inputs for the model. Finally the system will decide whether the peoples are LAEO by giving the probability(Given by LAEO score) of LAEO. The results on common meeting room videos demonstrate the effectiveness of the new method and model. Hopefully, this system can be used in the real-time applications to monitor or analyze people in a meeting room after some future works. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-05-20T02:13:27Z 2020-05-20T02:13:27Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/139497 en A3292-191 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering Yue, Kunlun Deep learning for image and video understanding |
description |
Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus on another topic under gaze analysis---Looking at each other(LAEO). Knowing whether peoples are LAEO can help us understand their relationships because mutual gaze between people is a very important non-verbal communication. Most of the methods presented and used focus on analyzing mutual gaze in an individual frame. But this report will talk about a new method, which will conduct this analysis in a spatio-temporal approach. Continual frames and videos will be used as input data. Then we will extract the heads to create tracks(a list of heads that heaped in accordance with time) and get the respective heads_map as inputs for the model. Finally the system will decide whether the peoples are LAEO by giving the probability(Given by LAEO score) of LAEO. The results on common meeting room videos demonstrate the effectiveness of the new method and model. Hopefully, this system can be used in the real-time applications to monitor or analyze people in a meeting room after some future works. |
author2 |
Tan Yap Peng |
author_facet |
Tan Yap Peng Yue, Kunlun |
format |
Final Year Project |
author |
Yue, Kunlun |
author_sort |
Yue, Kunlun |
title |
Deep learning for image and video understanding |
title_short |
Deep learning for image and video understanding |
title_full |
Deep learning for image and video understanding |
title_fullStr |
Deep learning for image and video understanding |
title_full_unstemmed |
Deep learning for image and video understanding |
title_sort |
deep learning for image and video understanding |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/139497 |
_version_ |
1772826293912469504 |