Gotcha ! This model uses my code ! Evaluating membership leakage risks in code models
Leveraging large-scale datasets from open-source projects and advances in large language models, recent progress has led to sophisticated code models for key software engineering tasks, such as program repair and code completion. These models are trained on data from various sources, including publi...
Saved in:
Main Authors: | YANG, Zhou, ZHAO, Zhipeng, WANG, Chenyu, SHI, Jieke, KIM, Dongsum, HAN, Donggyun, LO, David |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9889 https://ink.library.smu.edu.sg/context/sis_research/article/10889/viewcontent/2310.01166v2.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
TOWARD A BETTER UNDERSTANDING OF PRIVACY LEAKAGE IN MACHINE LEARNING, USING DATASET PRUNING ATTACK
by: VICTOR MICHEL THEODORE MASIAK
Published: (2023) -
Membership Encoding for Deep Learning
by: Congzheng, Song, et al.
Published: (2020) -
Privacy Risks of Securing Machine Learning Models against Adversarial Examples
by: Liwei Song, et al.
Published: (2020) -
Unveiling memorization in code models
by: YANG, Zhou, et al.
Published: (2024) -
Learning without coding
by: Jain, S., et al.
Published: (2013)