Collecting and annotating videos that teach MS PowerPoint

The central aim of this project is to generate a comprehensive dataset for training an artificial intelligence (AI) that is able to operate Microsoft PowerPoint autonomously. This project encompasses several different phases: Starting with the identification of videos that teach Microsoft PowerPo...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Isaac Jun Hong
Other Authors: Li Boyang
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/171932
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-171932
record_format dspace
spelling sg-ntu-dr.10356-1719322023-11-17T15:37:20Z Collecting and annotating videos that teach MS PowerPoint Tan, Isaac Jun Hong Li Boyang School of Computer Science and Engineering boyang.li@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence The central aim of this project is to generate a comprehensive dataset for training an artificial intelligence (AI) that is able to operate Microsoft PowerPoint autonomously. This project encompasses several different phases: Starting with the identification of videos that teach Microsoft PowerPoint following which we will download the identified videos using Jupyter Notebook with the help of the Pytube library. This is followed by the transcribing of videos that lack closed captions with the Whisper Model. Following this, the annotation process is then executed whereby the keystroke and the mouse clicks are then labeled using Sequence labeling in Doccano. The project then transits into the model training phase where both T5 and FLAN-T5 neural network models are experimented on for their ability to interpret and translate narrated instructions into corresponding mouse and keyboard actions to decide which model would achieve the better performance. Given the limitations of YouTube’s dataset, data augmentation techniques were employed using ChatGPT to improve model training. Bachelor of Engineering (Computer Science) 2023-11-16T08:53:09Z 2023-11-16T08:53:09Z 2023 Final Year Project (FYP) Tan, I. J. H. (2023). Collecting and annotating videos that teach MS PowerPoint. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/171932 https://hdl.handle.net/10356/171932 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Tan, Isaac Jun Hong
Collecting and annotating videos that teach MS PowerPoint
description The central aim of this project is to generate a comprehensive dataset for training an artificial intelligence (AI) that is able to operate Microsoft PowerPoint autonomously. This project encompasses several different phases: Starting with the identification of videos that teach Microsoft PowerPoint following which we will download the identified videos using Jupyter Notebook with the help of the Pytube library. This is followed by the transcribing of videos that lack closed captions with the Whisper Model. Following this, the annotation process is then executed whereby the keystroke and the mouse clicks are then labeled using Sequence labeling in Doccano. The project then transits into the model training phase where both T5 and FLAN-T5 neural network models are experimented on for their ability to interpret and translate narrated instructions into corresponding mouse and keyboard actions to decide which model would achieve the better performance. Given the limitations of YouTube’s dataset, data augmentation techniques were employed using ChatGPT to improve model training.
author2 Li Boyang
author_facet Li Boyang
Tan, Isaac Jun Hong
format Final Year Project
author Tan, Isaac Jun Hong
author_sort Tan, Isaac Jun Hong
title Collecting and annotating videos that teach MS PowerPoint
title_short Collecting and annotating videos that teach MS PowerPoint
title_full Collecting and annotating videos that teach MS PowerPoint
title_fullStr Collecting and annotating videos that teach MS PowerPoint
title_full_unstemmed Collecting and annotating videos that teach MS PowerPoint
title_sort collecting and annotating videos that teach ms powerpoint
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/171932
_version_ 1783955541796585472