Machine learning for image and video summarization
With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/136788 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-136788 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1367882023-07-07T16:58:07Z Machine learning for image and video summarization Liu, Liuziyi Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering::Electrical and electronic engineering With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the interaction with the digital display as the gaze is the fastest way of showing interest on a subject. Current gazing tracking systems implement various machine learning methods such as Neural Networks, Gaussian process regression, Ensemble of Regression Trees for landmark detection and head pose estimation. However, there is no robust solution as most of the systems are still subject to limitations, including unsatisfied accuracy, significant head movement, expensive geometric setups, inconsistent lighting conditions and cumbersome calibrations. In this way, there is not enough robustness for real-world applications. Besides, while most existing gaze tracking system focuses only on estimating the gaze direction, more efforts are needed for studying the gaze tracking on a digital display. This project studies gaze tracking on a digital display with a webcam camera through a machine learning approach. Different functions, including facial landmark detection, head pose estimation, gaze projection and image processing, are studied and integrated to realize the purpose of tracking gaze on the digital display. The project was to design a gaze tracking system that provides accurate performance on a digital display that is applicable for analysis of students’ behaviors during the E-learning process. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-01-28T04:36:04Z 2020-01-28T04:36:04Z 2019 Final Year Project (FYP) https://hdl.handle.net/10356/136788 en 3242-182 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Liu, Liuziyi Machine learning for image and video summarization |
description |
With the digital evolution of the information, the interaction with the digital display has been studied and applied in fields ranging from text entry, mouse controlling, and to online learning, human-computer interaction. The study of gaze tracking is the central part of the research regarding the interaction with the digital display as the gaze is the fastest way of showing interest on a subject. Current gazing tracking systems implement various machine learning methods such as Neural Networks, Gaussian process regression, Ensemble of Regression Trees for landmark detection and head pose estimation. However, there is no robust solution as most of the systems are still subject to limitations, including unsatisfied accuracy, significant head movement, expensive geometric setups, inconsistent lighting conditions and cumbersome calibrations. In this way, there is not enough robustness for real-world applications. Besides, while most existing gaze tracking system focuses only on estimating the gaze direction, more efforts are needed for studying the gaze tracking on a digital display.
This project studies gaze tracking on a digital display with a webcam camera through a machine learning approach. Different functions, including facial landmark detection, head pose estimation, gaze projection and image processing, are studied and integrated to realize the purpose of tracking gaze on the digital display. The project was to design a gaze tracking system that provides accurate performance on a digital display that is applicable for analysis of students’ behaviors during the E-learning process. |
author2 |
Tan Yap Peng |
author_facet |
Tan Yap Peng Liu, Liuziyi |
format |
Final Year Project |
author |
Liu, Liuziyi |
author_sort |
Liu, Liuziyi |
title |
Machine learning for image and video summarization |
title_short |
Machine learning for image and video summarization |
title_full |
Machine learning for image and video summarization |
title_fullStr |
Machine learning for image and video summarization |
title_full_unstemmed |
Machine learning for image and video summarization |
title_sort |
machine learning for image and video summarization |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/136788 |
_version_ |
1772826682954088448 |