Skip to main navigation Skip to search Skip to main content

Debiasing CLIP with Feature Augmentation and Selective Update to Preserve Zero-Shot Capability

  • Sungkyunkwan University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Vision-language models, including CLIP, have shown impressive performance across various tasks. However, these models often struggle to distinguish meaningful features from spurious correlations in dataset biases, resulting in gen-eralization issues. Mitigating these biases without compromising CLIP's core strengths for zero-shot performance is crucial but has been overlooked. We propose a novel adversarial training method to debias CLIP, combining PGD on the projection layer for feature augmentation and Selective Update for Debiasing (SUD). Our method targets the projection layer, using minimally distorted feature augmentation and selective update strategy guided by model predictions. PGD uses the embedding layer's gradients to generate biased features that induce misclassifications with minimal distortion. SUD applies different objective functions to refine features based on prediction accuracy. These methods effectively mitigate bias by training on misclassified samples and preserve the existing embedding space. Experiments confirm our method improves model robustness against biases while maintaining zero-shot performance. This approach offers a promising solution for debiasing vision-language models without degradation in performance.

Original languageEnglish
Title of host publication2024 Joint 13th International Conference on Soft Computing and Intelligent Systems and 25th International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350373332
DOIs
StatePublished - 2024
EventJoint 13th International Conference on Soft Computing and Intelligent Systems and 25th International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2024 - Himeji, Japan
Duration: 9 Nov 202412 Nov 2024

Publication series

Name2024 Joint 13th International Conference on Soft Computing and Intelligent Systems and 25th International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2024

Conference

ConferenceJoint 13th International Conference on Soft Computing and Intelligent Systems and 25th International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2024
Country/TerritoryJapan
CityHimeji
Period9/11/2412/11/24

Keywords

  • Adversarial Training
  • Debiasing
  • Feature Augmentation
  • Selective Up-date
  • Vision-Language Model

Fingerprint

Dive into the research topics of 'Debiasing CLIP with Feature Augmentation and Selective Update to Preserve Zero-Shot Capability'. Together they form a unique fingerprint.

Cite this