TY - GEN
T1 - Medical SAM Adapter++
T2 - 2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025
AU - Jung, Seungho
AU - Lee, Jae Joon
AU - Shin, Jitae
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Vision Transformer (ViT) models have spearheaded the computer vision community for various image-related tasks, specifically showing significant results in interactive segmentation tasks using Segment Anything Model (SAM). Adapters can be incorporated into SAM for adaptation to the different domains, whilst significantly reducing training costs compared to the fine-tuning of the entire model. This was especially true for datasets from the medical domain, where it presented results superior to state-of-the-art models. However, the simple incorporation of adapters fails to leverage the statistical differences between the probability distribution of data from various domains. This paper addresses this issue by introducing Adaptive Layer Normalization into SAM, which has more theoretical background as a solution to domain differences than sequential adapters. Furthermore, this paper also proposes a novel method, domain interpreter, for determining the weights to Adaptive Layer Normalization. Focusing on the Osteoarthritis (OAI) Dataset, the proposed method shows improved results in IoU and DICE metric scores, showing that the distinct domain distributions are the cause of inaccuracies of baseline models, and that the models can quickly be adapted to different domains by adjusting to such data distributions.
AB - Vision Transformer (ViT) models have spearheaded the computer vision community for various image-related tasks, specifically showing significant results in interactive segmentation tasks using Segment Anything Model (SAM). Adapters can be incorporated into SAM for adaptation to the different domains, whilst significantly reducing training costs compared to the fine-tuning of the entire model. This was especially true for datasets from the medical domain, where it presented results superior to state-of-the-art models. However, the simple incorporation of adapters fails to leverage the statistical differences between the probability distribution of data from various domains. This paper addresses this issue by introducing Adaptive Layer Normalization into SAM, which has more theoretical background as a solution to domain differences than sequential adapters. Furthermore, this paper also proposes a novel method, domain interpreter, for determining the weights to Adaptive Layer Normalization. Focusing on the Osteoarthritis (OAI) Dataset, the proposed method shows improved results in IoU and DICE metric scores, showing that the distinct domain distributions are the cause of inaccuracies of baseline models, and that the models can quickly be adapted to different domains by adjusting to such data distributions.
KW - adapters
KW - medical AI
KW - segmentation
UR - https://www.scopus.com/pages/publications/105016377856
U2 - 10.1109/ITC-CSCC66376.2025.11137584
DO - 10.1109/ITC-CSCC66376.2025.11137584
M3 - Conference contribution
AN - SCOPUS:105016377856
T3 - 2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025
BT - 2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 7 July 2025 through 10 July 2025
ER -