Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers?

Effectively onboarding newcomers is essential for the success of open source projects. These projects often provide onboarding guidelines in their ‘CONTRIBUTING’ files (e.g., CONTRIBUTING.md on GitHub). These files explain, for example, how to find open tasks, implement solutions, and submit code fo...

Full description

Saved in:
Bibliographic Details
Main Authors: FRONCHETTI, Felipe, SHEPHERD, David, WIESE, Igor, TREUDE, Christoph, GEROSA, Marco, STEINMACHER, Igor
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8897
https://ink.library.smu.edu.sg/context/sis_research/article/9900/viewcontent/FSE2023_Fronchetti.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9900
record_format dspace
spelling sg-smu-ink.sis_research-99002024-06-27T08:59:25Z Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers? FRONCHETTI, Felipe SHEPHERD, David WIESE, Igor TREUDE, Christoph GEROSA, Marco STEINMACHER, Igor Effectively onboarding newcomers is essential for the success of open source projects. These projects often provide onboarding guidelines in their ‘CONTRIBUTING’ files (e.g., CONTRIBUTING.md on GitHub). These files explain, for example, how to find open tasks, implement solutions, and submit code for review. However, these files often do not follow a standard structure, can be too large, and miss barriers commonly found by newcomers. In this paper, we propose an automated approach to parse these CONTRIBUTING files and assess how they address onboarding barriers. We manually classified a sample of files according to a model of onboarding barriers from the literature, trained a machine learning classifier that automatically predicts the categories of each paragraph (precision: 0.655, recall: 0.662), and surveyed developers to investigate their perspective of the predictions’ adequacy (75% of the predictions were considered adequate). We found that CONTRIBUTING files typically do not cover the barriers newcomers face (52% of the analyzed projects missed at least 3 out of the 6 barriers faced by newcomers; 84% missed at least 2). Our analysis also revealed that information about choosing a task and talking with the community, two of the most recurrent barriers newcomers face, are neglected in more than 75% of the projects. We made available our classifier as an online service that analyzes the content of a given CONTRIBUTING file. Our approach may help community builders identify missing information in the project ecosystem they maintain and newcomers can understand what to expect in CONTRIBUTING files. 2023-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8897 info:doi/10.1145/3611643.3616288 https://ink.library.smu.edu.sg/context/sis_research/article/9900/viewcontent/FSE2023_Fronchetti.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University novices onboarding FLOSS open source software engineering Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic novices
onboarding
FLOSS
open source
software engineering
Software Engineering
spellingShingle novices
onboarding
FLOSS
open source
software engineering
Software Engineering
FRONCHETTI, Felipe
SHEPHERD, David
WIESE, Igor
TREUDE, Christoph
GEROSA, Marco
STEINMACHER, Igor
Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers?
description Effectively onboarding newcomers is essential for the success of open source projects. These projects often provide onboarding guidelines in their ‘CONTRIBUTING’ files (e.g., CONTRIBUTING.md on GitHub). These files explain, for example, how to find open tasks, implement solutions, and submit code for review. However, these files often do not follow a standard structure, can be too large, and miss barriers commonly found by newcomers. In this paper, we propose an automated approach to parse these CONTRIBUTING files and assess how they address onboarding barriers. We manually classified a sample of files according to a model of onboarding barriers from the literature, trained a machine learning classifier that automatically predicts the categories of each paragraph (precision: 0.655, recall: 0.662), and surveyed developers to investigate their perspective of the predictions’ adequacy (75% of the predictions were considered adequate). We found that CONTRIBUTING files typically do not cover the barriers newcomers face (52% of the analyzed projects missed at least 3 out of the 6 barriers faced by newcomers; 84% missed at least 2). Our analysis also revealed that information about choosing a task and talking with the community, two of the most recurrent barriers newcomers face, are neglected in more than 75% of the projects. We made available our classifier as an online service that analyzes the content of a given CONTRIBUTING file. Our approach may help community builders identify missing information in the project ecosystem they maintain and newcomers can understand what to expect in CONTRIBUTING files.
format text
author FRONCHETTI, Felipe
SHEPHERD, David
WIESE, Igor
TREUDE, Christoph
GEROSA, Marco
STEINMACHER, Igor
author_facet FRONCHETTI, Felipe
SHEPHERD, David
WIESE, Igor
TREUDE, Christoph
GEROSA, Marco
STEINMACHER, Igor
author_sort FRONCHETTI, Felipe
title Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers?
title_short Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers?
title_full Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers?
title_fullStr Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers?
title_full_unstemmed Do CONTRIBUTING files provide information about OSS newcomers' onboarding barriers?
title_sort do contributing files provide information about oss newcomers' onboarding barriers?
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8897
https://ink.library.smu.edu.sg/context/sis_research/article/9900/viewcontent/FSE2023_Fronchetti.pdf
_version_ 1814047625102491648