Supporting software engineers with large language model-based automation

In recent years, software engineering (SE) has witnessed significant growth, leading to the creation and sharing of an abundance of software artifacts such as source code, bug reports, and pull requests. Analyzing these artifacts is crucial for comprehending the sentiments of software developers and...

Full description

Saved in:

Bibliographic Details
Main Author:	ZHANG, Ting
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	large language models sentiment analysis software engineering duplicate bug reports pull request Programming Languages and Compilers Software Engineering
Online Access:	https://ink.library.smu.edu.sg/etd_coll/545 https://ink.library.smu.edu.sg/context/etd_coll/article/1543/viewcontent/GPIS_AY2019_PhD_ZHANGTing.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.etd_coll-1543
record_format	dspace
spelling	sg-smu-ink.etd_coll-15432024-06-20T01:46:56Z Supporting software engineers with large language model-based automation ZHANG, Ting In recent years, software engineering (SE) has witnessed significant growth, leading to the creation and sharing of an abundance of software artifacts such as source code, bug reports, and pull requests. Analyzing these artifacts is crucial for comprehending the sentiments of software developers and automating various SE tasks, ultimately leading to more human-centered automated SE and enhancing software development efficiency. However, the diverse and unstructured nature of software text poses a significant challenge to this analysis. In response, researchers have investigated a variety of approaches, including the utilization of natural language processing techniques. The advent of large language models (LLMs), ranging from smaller-size LLMs (sLLMs) like BERT to bigger ones (bLLMs) such as LLaMA, has ignited a growing interest in their potential for analyzing software-related text. This dissertation explores how LLMs can automate different SE tasks involving classification, ranking, and generation tasks. In the first study, we assess the efficacy of sLLMs, such as BERT, in SE sentiment analysis, comparing them to existing SE-specific tools. Furthermore, we compare the performance of bLLMs with sLLMs in this context. In the second study, we address the issue of retrieving duplicate bug reports. First, we create a benchmark and then use bLLMs to enhance the accuracy of this process, with a specific focus on employing GPT-3.5 for suggesting duplicate bug reports. In the third study, we propose to leverage sLLMs to create precise and concise pull request titles. In conclusion, this dissertation contributes to the SE field by exploring the potential of LLMs to support software developers in understanding sentiments and improving the efficiency of software development. 2023-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/etd_coll/545 https://ink.library.smu.edu.sg/context/etd_coll/article/1543/viewcontent/GPIS_AY2019_PhD_ZHANGTing.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Dissertations and Theses Collection (Open Access) eng Institutional Knowledge at Singapore Management University large language models sentiment analysis software engineering duplicate bug reports pull request Programming Languages and Compilers Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	large language models sentiment analysis software engineering duplicate bug reports pull request Programming Languages and Compilers Software Engineering
spellingShingle	large language models sentiment analysis software engineering duplicate bug reports pull request Programming Languages and Compilers Software Engineering ZHANG, Ting Supporting software engineers with large language model-based automation
description	In recent years, software engineering (SE) has witnessed significant growth, leading to the creation and sharing of an abundance of software artifacts such as source code, bug reports, and pull requests. Analyzing these artifacts is crucial for comprehending the sentiments of software developers and automating various SE tasks, ultimately leading to more human-centered automated SE and enhancing software development efficiency. However, the diverse and unstructured nature of software text poses a significant challenge to this analysis. In response, researchers have investigated a variety of approaches, including the utilization of natural language processing techniques. The advent of large language models (LLMs), ranging from smaller-size LLMs (sLLMs) like BERT to bigger ones (bLLMs) such as LLaMA, has ignited a growing interest in their potential for analyzing software-related text. This dissertation explores how LLMs can automate different SE tasks involving classification, ranking, and generation tasks. In the first study, we assess the efficacy of sLLMs, such as BERT, in SE sentiment analysis, comparing them to existing SE-specific tools. Furthermore, we compare the performance of bLLMs with sLLMs in this context. In the second study, we address the issue of retrieving duplicate bug reports. First, we create a benchmark and then use bLLMs to enhance the accuracy of this process, with a specific focus on employing GPT-3.5 for suggesting duplicate bug reports. In the third study, we propose to leverage sLLMs to create precise and concise pull request titles. In conclusion, this dissertation contributes to the SE field by exploring the potential of LLMs to support software developers in understanding sentiments and improving the efficiency of software development.
format	text
author	ZHANG, Ting
author_facet	ZHANG, Ting
author_sort	ZHANG, Ting
title	Supporting software engineers with large language model-based automation
title_short	Supporting software engineers with large language model-based automation
title_full	Supporting software engineers with large language model-based automation
title_fullStr	Supporting software engineers with large language model-based automation
title_full_unstemmed	Supporting software engineers with large language model-based automation
title_sort	supporting software engineers with large language model-based automation
publisher	Institutional Knowledge at Singapore Management University
publishDate	2023
url	https://ink.library.smu.edu.sg/etd_coll/545 https://ink.library.smu.edu.sg/context/etd_coll/article/1543/viewcontent/GPIS_AY2019_PhD_ZHANGTing.pdf
_version_	1814047579466366976

Supporting software engineers with large language model-based automation

Similar Items