Deep learning for image and video understanding

Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus...

全面介紹

Saved in:

書目詳細資料
主要作者:	Yue, Kunlun
其他作者:	Tan Yap Peng
格式:	Final Year Project
語言:	English
出版:	Nanyang Technological University 2020
主題:	Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering
在線閱讀:	https://hdl.handle.net/10356/139497
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Nanyang Technological University
語言:	English

id	sg-ntu-dr.10356-139497
record_format	dspace
spelling	sg-ntu-dr.10356-1394972023-07-07T18:03:07Z Deep learning for image and video understanding Yue, Kunlun Tan Yap Peng School of Electrical and Electronic Engineering eyptan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus on another topic under gaze analysis---Looking at each other(LAEO). Knowing whether peoples are LAEO can help us understand their relationships because mutual gaze between people is a very important non-verbal communication. Most of the methods presented and used focus on analyzing mutual gaze in an individual frame. But this report will talk about a new method, which will conduct this analysis in a spatio-temporal approach. Continual frames and videos will be used as input data. Then we will extract the heads to create tracks(a list of heads that heaped in accordance with time) and get the respective heads_map as inputs for the model. Finally the system will decide whether the peoples are LAEO by giving the probability(Given by LAEO score) of LAEO. The results on common meeting room videos demonstrate the effectiveness of the new method and model. Hopefully, this system can be used in the real-time applications to monitor or analyze people in a meeting room after some future works. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-05-20T02:13:27Z 2020-05-20T02:13:27Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/139497 en A3292-191 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Computer science and engineering::Computing methodologies Engineering::Electrical and electronic engineering Yue, Kunlun Deep learning for image and video understanding
description	Gaze detection is a sub-area under object detection and becomes more and more popular for its wide applications that are useful in our daily life. For example, the gaze following analysis can be quite useful in smart-study system to monitor the students’ studying situations. In this report, we focus on another topic under gaze analysis---Looking at each other(LAEO). Knowing whether peoples are LAEO can help us understand their relationships because mutual gaze between people is a very important non-verbal communication. Most of the methods presented and used focus on analyzing mutual gaze in an individual frame. But this report will talk about a new method, which will conduct this analysis in a spatio-temporal approach. Continual frames and videos will be used as input data. Then we will extract the heads to create tracks(a list of heads that heaped in accordance with time) and get the respective heads_map as inputs for the model. Finally the system will decide whether the peoples are LAEO by giving the probability(Given by LAEO score) of LAEO. The results on common meeting room videos demonstrate the effectiveness of the new method and model. Hopefully, this system can be used in the real-time applications to monitor or analyze people in a meeting room after some future works.
author2	Tan Yap Peng
author_facet	Tan Yap Peng Yue, Kunlun
format	Final Year Project
author	Yue, Kunlun
author_sort	Yue, Kunlun
title	Deep learning for image and video understanding
title_short	Deep learning for image and video understanding
title_full	Deep learning for image and video understanding
title_fullStr	Deep learning for image and video understanding
title_full_unstemmed	Deep learning for image and video understanding
title_sort	deep learning for image and video understanding
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/139497
_version_	1772826293912469504

Deep learning for image and video understanding

相似書籍