A simple data mixing prior for improving self-supervised learning

Data mixing (e.g., Mixup, Cutmix, ResizeMix) is an essential component for advancing recognition models. In this paper, we focus on studying its effectiveness in the self-supervised setting. By noticing the mixed images that share the same source images are intrinsically related to each other, we he...

Full description

Saved in:

Bibliographic Details
Main Authors:	REN, Sucheng, WANG, Huiyu, GAO, Zhengqi, HE, Shengfeng, YUILLE, Alan, ZHOU, Yuyin, XIE, Cihang
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	Categorization Representation learning Retrieval Self- & semi- & meta- recognition Detection Databases and Information Systems Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/8445 https://ink.library.smu.edu.sg/context/sis_research/article/9448/viewcontent/Ren_A_Simple_Data_Mixing_Prior_for_Improving_Self_Supervised_Learning_CVPR_2022_paper.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Description
Summary:	Data mixing (e.g., Mixup, Cutmix, ResizeMix) is an essential component for advancing recognition models. In this paper, we focus on studying its effectiveness in the self-supervised setting. By noticing the mixed images that share the same source images are intrinsically related to each other, we hereby propose SDMP, short for Simple Data Mixing Prior, to capture this straightforward yet essential prior, and position such mixed images as additional positive pairs to facilitate self-supervised representation learning. Our experiments verify that the proposed SDMP enables data mixing to help a set of self-supervised learning frameworks (e.g., MoCo) achieve better accuracy and out-of-distribution robustness. More notably, our SDMP is the first method that successfully leverages data mixing to improve (rather than hurt) the performance of Vision Transformers in the self-supervised setting. Code is publicly available at https://github.com/OliverRensu/SDMP.

A simple data mixing prior for improving self-supervised learning

Similar Items