Machine learning for image and video summarization

With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the...

Full description

Saved in:
Bibliographic Details
Main Author: Liu, Liuziyi
Other Authors: Tan Yap Peng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/136788
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-136788
record_format dspace
spelling sg-ntu-dr.10356-1367882023-07-07T16:58:07Z Machine learning for image and video summarization Liu, Liuziyi Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering::Electrical and electronic engineering With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the interaction with the digital display as the gaze is the fastest way of showing interest on a subject. Current gazing tracking systems implement various machine learning methods such as Neural Networks, Gaussian process regression, Ensemble of Regression Trees for landmark detection and head pose estimation. However, there is no robust solution as most of the systems are still subject to limitations, including unsatisfied accuracy, significant head movement, expensive geometric setups, inconsistent lighting conditions and cumbersome calibrations. In this way, there is not enough robustness for real-world applications. Besides, while most existing gaze tracking system focuses only on estimating the gaze direction, more efforts are needed for studying the gaze tracking on a digital display. This project studies gaze tracking on a digital display with a webcam camera through a machine learning approach. Different functions, including facial landmark detection, head pose estimation, gaze projection and image processing, are studied and integrated to realize the purpose of tracking gaze on the digital display. The project was to design a gaze tracking system that provides accurate performance on a digital display that is applicable for analysis of students’ behaviors during the E-learning process. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-01-28T04:36:04Z 2020-01-28T04:36:04Z 2019 Final Year Project (FYP) https://hdl.handle.net/10356/136788 en 3242-182 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Liu, Liuziyi
Machine learning for image and video summarization
description With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the interaction with the digital display as the gaze is the fastest way of showing interest on a subject. Current gazing tracking systems implement various machine learning methods such as Neural Networks, Gaussian process regression, Ensemble of Regression Trees for landmark detection and head pose estimation. However, there is no robust solution as most of the systems are still subject to limitations, including unsatisfied accuracy, significant head movement, expensive geometric setups, inconsistent lighting conditions and cumbersome calibrations. In this way, there is not enough robustness for real-world applications. Besides, while most existing gaze tracking system focuses only on estimating the gaze direction, more efforts are needed for studying the gaze tracking on a digital display. This project studies gaze tracking on a digital display with a webcam camera through a machine learning approach. Different functions, including facial landmark detection, head pose estimation, gaze projection and image processing, are studied and integrated to realize the purpose of tracking gaze on the digital display. The project was to design a gaze tracking system that provides accurate performance on a digital display that is applicable for analysis of students’ behaviors during the E-learning process.
author2 Tan Yap Peng
author_facet Tan Yap Peng
Liu, Liuziyi
format Final Year Project
author Liu, Liuziyi
author_sort Liu, Liuziyi
title Machine learning for image and video summarization
title_short Machine learning for image and video summarization
title_full Machine learning for image and video summarization
title_fullStr Machine learning for image and video summarization
title_full_unstemmed Machine learning for image and video summarization
title_sort machine learning for image and video summarization
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/136788
_version_ 1772826682954088448