Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification

The empirical risk minimization approach of contemporary machine learning leads to potential failures under distribution shifts. While out-of-distribution data can be used to probe for robustness issues, collecting this at scale in the wild can be difficult given its nature. We propose a novel metho...

Full description

Saved in:
Bibliographic Details
Main Authors: Ramos, Ryan, Alampay, Raphael, Abu, Patricia Angela R
Format: text
Published: Archīum Ateneo 2023
Subjects:
Online Access:https://archium.ateneo.edu/discs-faculty-pubs/380
https://doi.org/10.1007/978-3-031-41630-9_16
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.discs-faculty-pubs-1380
record_format eprints
spelling ph-ateneo-arc.discs-faculty-pubs-13802024-02-21T03:15:13Z Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification Ramos, Ryan Alampay, Raphael Abu, Patricia Angela R The empirical risk minimization approach of contemporary machine learning leads to potential failures under distribution shifts. While out-of-distribution data can be used to probe for robustness issues, collecting this at scale in the wild can be difficult given its nature. We propose a novel method to generate this data using pretrained foundation models. We train a language model to generate class-conditioned image captions that minimize their cosine similarity with that of corresponding class images from the original distribution. We then use these captions to synthesize new images with off-the-shelf text-to-image generative models. We show our method’s ability to generate samples from shifted distributions, and the quality of the data for both robustness testing and as additional training data to improve generalization. 2023-01-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/380 https://doi.org/10.1007/978-3-031-41630-9_16 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo foundation models Robustness to distribution shift synthetic data Computer Engineering Electrical and Computer Engineering Engineering Systems and Communications
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic foundation models
Robustness to distribution shift
synthetic data
Computer Engineering
Electrical and Computer Engineering
Engineering
Systems and Communications
spellingShingle foundation models
Robustness to distribution shift
synthetic data
Computer Engineering
Electrical and Computer Engineering
Engineering
Systems and Communications
Ramos, Ryan
Alampay, Raphael
Abu, Patricia Angela R
Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification
description The empirical risk minimization approach of contemporary machine learning leads to potential failures under distribution shifts. While out-of-distribution data can be used to probe for robustness issues, collecting this at scale in the wild can be difficult given its nature. We propose a novel method to generate this data using pretrained foundation models. We train a language model to generate class-conditioned image captions that minimize their cosine similarity with that of corresponding class images from the original distribution. We then use these captions to synthesize new images with off-the-shelf text-to-image generative models. We show our method’s ability to generate samples from shifted distributions, and the quality of the data for both robustness testing and as additional training data to improve generalization.
format text
author Ramos, Ryan
Alampay, Raphael
Abu, Patricia Angela R
author_facet Ramos, Ryan
Alampay, Raphael
Abu, Patricia Angela R
author_sort Ramos, Ryan
title Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification
title_short Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification
title_full Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification
title_fullStr Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification
title_full_unstemmed Exploring Text-Guided Synthetic Distribution Shifts for Robust Image Classification
title_sort exploring text-guided synthetic distribution shifts for robust image classification
publisher Archīum Ateneo
publishDate 2023
url https://archium.ateneo.edu/discs-faculty-pubs/380
https://doi.org/10.1007/978-3-031-41630-9_16
_version_ 1792202614611378176