Predicting MS Powerpoint mouse/keyboard actions

This project explores the application of Generative Pre-trained Transformer (GPT) models, specifically GPT-2 and GPT-3, for predicting the textual instructions corresponding to user actions in Microsoft PowerPoint, such as mouse movements and keyboard inputs. Through extensive experimentation and im...

全面介紹

Saved in:
書目詳細資料
主要作者: Chong, Kass Min
其他作者: Li Boyang
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/175104
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
id sg-ntu-dr.10356-175104
record_format dspace
spelling sg-ntu-dr.10356-1751042024-04-26T15:40:21Z Predicting MS Powerpoint mouse/keyboard actions Chong, Kass Min Li Boyang School of Computer Science and Engineering boyang.li@ntu.edu.sg Computer and Information Science Artificial intelligence This project explores the application of Generative Pre-trained Transformer (GPT) models, specifically GPT-2 and GPT-3, for predicting the textual instructions corresponding to user actions in Microsoft PowerPoint, such as mouse movements and keyboard inputs. Through extensive experimentation and implementation, we were able to observe how soft prompting with GPT-2 and in-context learning with GPT-3 exceed baseline performance established through hyperparameter tuning of the GPT-2 model. This achievement is particularly notable in two domains: the prediction of user intentions and the prediction of procedural instructions. Hence, this study underscores the efficacy of these techniques in augmenting the capabilities of the employed models. By illustrating the potential of AI-driven solutions to streamline interactions with software applications, this work sets a foundation for a shift in user experience within productivity tools, driven by seamless, natural language commands. Bachelor's degree 2024-04-22T00:11:38Z 2024-04-22T00:11:38Z 2024 Final Year Project (FYP) Chong, K. M. (2024). Predicting MS Powerpoint mouse/keyboard actions. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175104 https://hdl.handle.net/10356/175104 en SCSE23-0710 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Artificial intelligence
spellingShingle Computer and Information Science
Artificial intelligence
Chong, Kass Min
Predicting MS Powerpoint mouse/keyboard actions
description This project explores the application of Generative Pre-trained Transformer (GPT) models, specifically GPT-2 and GPT-3, for predicting the textual instructions corresponding to user actions in Microsoft PowerPoint, such as mouse movements and keyboard inputs. Through extensive experimentation and implementation, we were able to observe how soft prompting with GPT-2 and in-context learning with GPT-3 exceed baseline performance established through hyperparameter tuning of the GPT-2 model. This achievement is particularly notable in two domains: the prediction of user intentions and the prediction of procedural instructions. Hence, this study underscores the efficacy of these techniques in augmenting the capabilities of the employed models. By illustrating the potential of AI-driven solutions to streamline interactions with software applications, this work sets a foundation for a shift in user experience within productivity tools, driven by seamless, natural language commands.
author2 Li Boyang
author_facet Li Boyang
Chong, Kass Min
format Final Year Project
author Chong, Kass Min
author_sort Chong, Kass Min
title Predicting MS Powerpoint mouse/keyboard actions
title_short Predicting MS Powerpoint mouse/keyboard actions
title_full Predicting MS Powerpoint mouse/keyboard actions
title_fullStr Predicting MS Powerpoint mouse/keyboard actions
title_full_unstemmed Predicting MS Powerpoint mouse/keyboard actions
title_sort predicting ms powerpoint mouse/keyboard actions
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/175104
_version_ 1800916136395538432