Can identifier splitting improve open-vocabulary language model of code?

Statistical language models on source code have successfully assisted software engineering tasks. However, developers can create or pick arbitrary identifiers when writing source code. Freely chosen identifiers lead to the notorious out-of-vocabulary (OOV) problem that negatively affects model perfo...

全面介紹

Saved in:
書目詳細資料
Main Authors: SHI, Jieke, YANG, Zhou, HE, Junda, XU, Bowen, LO, David
格式: text
語言:English
出版: Institutional Knowledge at Singapore Management University 2022
主題:
在線閱讀:https://ink.library.smu.edu.sg/sis_research/7698
https://ink.library.smu.edu.sg/context/sis_research/article/8701/viewcontent/can_identifier.pdf
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!