Combating Dataset Misalignment for Robust AI-Generated Image Detection in the Real World

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

AI-generated images are increasingly prevalent on the web, raising concerns about the real-world applicability of detection methods. While current detectors perform well on benchmark datasets, they suffer significant performance degradation on real-world datasets. Misalignment within benchmark datasets, caused by discrepancies in how data from different classes are encoded or transformed, leads models to learn shortcuts. These shortcuts make detectors overly reliant on factors such as image compression, causing biased predictions of real-world images that inevitably undergo compression. In this work, we reveal the misalignment in widely used benchmark datasets and demonstrate that aligning datasets improves model robustness and generalizability. Additionally, we propose leveraging pre-trained visual encoders to further enhance performance in real-world scenarios. Our approach achieves significant performance gains, highlighting the importance of dataset alignment for real-world AI-generated image detection.

Original languageEnglish
Title of host publicationACM WDC 2025 - Proceedings of the 4th Workshop on the security implications of Deepfakes and Cheapfakes
PublisherAssociation for Computing Machinery, Inc
Pages15-20
Number of pages6
ISBN (Electronic)9798400714191
DOIs
StatePublished - 25 Aug 2025
Event4th Workshop on Security Implications of Deepfakes and Cheapfakes, WDC 2025 - Hanoi, Viet Nam
Duration: 25 Aug 202529 Aug 2025

Publication series

NameACM WDC 2025 - Proceedings of the 4th Workshop on the security implications of Deepfakes and Cheapfakes

Conference

Conference4th Workshop on Security Implications of Deepfakes and Cheapfakes, WDC 2025
Country/TerritoryViet Nam
CityHanoi
Period25/08/2529/08/25

Keywords

  • AI-Generated Image Detection
  • Dataset Alignment
  • Generalization

Cite this