TY - GEN
T1 - Combating Dataset Misalignment for Robust AI-Generated Image Detection in the Real World
AU - Choi, Hyeongjun
AU - Jung, Inho
AU - Woo, Simon S.
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/8/25
Y1 - 2025/8/25
N2 - AI-generated images are increasingly prevalent on the web, raising concerns about the real-world applicability of detection methods. While current detectors perform well on benchmark datasets, they suffer significant performance degradation on real-world datasets. Misalignment within benchmark datasets, caused by discrepancies in how data from different classes are encoded or transformed, leads models to learn shortcuts. These shortcuts make detectors overly reliant on factors such as image compression, causing biased predictions of real-world images that inevitably undergo compression. In this work, we reveal the misalignment in widely used benchmark datasets and demonstrate that aligning datasets improves model robustness and generalizability. Additionally, we propose leveraging pre-trained visual encoders to further enhance performance in real-world scenarios. Our approach achieves significant performance gains, highlighting the importance of dataset alignment for real-world AI-generated image detection.
AB - AI-generated images are increasingly prevalent on the web, raising concerns about the real-world applicability of detection methods. While current detectors perform well on benchmark datasets, they suffer significant performance degradation on real-world datasets. Misalignment within benchmark datasets, caused by discrepancies in how data from different classes are encoded or transformed, leads models to learn shortcuts. These shortcuts make detectors overly reliant on factors such as image compression, causing biased predictions of real-world images that inevitably undergo compression. In this work, we reveal the misalignment in widely used benchmark datasets and demonstrate that aligning datasets improves model robustness and generalizability. Additionally, we propose leveraging pre-trained visual encoders to further enhance performance in real-world scenarios. Our approach achieves significant performance gains, highlighting the importance of dataset alignment for real-world AI-generated image detection.
KW - AI-Generated Image Detection
KW - Dataset Alignment
KW - Generalization
UR - https://www.scopus.com/pages/publications/105019492951
U2 - 10.1145/3709022.3736541
DO - 10.1145/3709022.3736541
M3 - Conference contribution
AN - SCOPUS:105019492951
T3 - ACM WDC 2025 - Proceedings of the 4th Workshop on the security implications of Deepfakes and Cheapfakes
SP - 15
EP - 20
BT - ACM WDC 2025 - Proceedings of the 4th Workshop on the security implications of Deepfakes and Cheapfakes
PB - Association for Computing Machinery, Inc
T2 - 4th Workshop on Security Implications of Deepfakes and Cheapfakes, WDC 2025
Y2 - 25 August 2025 through 29 August 2025
ER -