TY - GEN
T1 - Self-supervised label augmentation via input transformations
AU - Lee, Hankook
AU - Hwang, Sung Ju
AU - Shin, Jinwoo
N1 - Publisher Copyright:
© International Conference on Machine Learning, ICML 2020. All rights reserved.
PY - 2020
Y1 - 2020
N2 - Self-supervised learning, which learns by con_structing artificial labels given only the input signals, has recently gained considerable at_tention for learning representations with unla_beled datasets, i.e., learning without any human_annotated supervision. In this paper, we show that such a technique can be used to significantly improve the model accuracy even under fully_labeled datasets. Our scheme trains the model to learn both original and self-supervised tasks, but is different from conventional multi-task learn_ing frameworks that optimize the summation of their corresponding losses. Our main idea is to learn a single unified task with respect to the joint distribution of the original and self-supervised labels, i.e., we augment original labels via self_supervision of input transformation. This simple, yet effective approach allows to train models eas_ier by relaxing a certain invariant constraint dur_ing learning the original and self-supervised tasks simultaneously. It also enables an aggregated inference which combines the predictions from different augmentations to improve the predic_tion accuracy. Furthermore, we propose a novel knowledge transfer technique, which we refer to as self-distillation, that has the effect of the aggre_gated inference in a single (faster) inference. We demonstrate the large accuracy improvement and wide applicability of our framework on various fully-supervised settings, e.g., the few-shot and imbalanced classification scenarios.
AB - Self-supervised learning, which learns by con_structing artificial labels given only the input signals, has recently gained considerable at_tention for learning representations with unla_beled datasets, i.e., learning without any human_annotated supervision. In this paper, we show that such a technique can be used to significantly improve the model accuracy even under fully_labeled datasets. Our scheme trains the model to learn both original and self-supervised tasks, but is different from conventional multi-task learn_ing frameworks that optimize the summation of their corresponding losses. Our main idea is to learn a single unified task with respect to the joint distribution of the original and self-supervised labels, i.e., we augment original labels via self_supervision of input transformation. This simple, yet effective approach allows to train models eas_ier by relaxing a certain invariant constraint dur_ing learning the original and self-supervised tasks simultaneously. It also enables an aggregated inference which combines the predictions from different augmentations to improve the predic_tion accuracy. Furthermore, we propose a novel knowledge transfer technique, which we refer to as self-distillation, that has the effect of the aggre_gated inference in a single (faster) inference. We demonstrate the large accuracy improvement and wide applicability of our framework on various fully-supervised settings, e.g., the few-shot and imbalanced classification scenarios.
UR - https://www.scopus.com/pages/publications/85092595793
M3 - Conference contribution
AN - SCOPUS:85092595793
T3 - 37th International Conference on Machine Learning, ICML 2020
SP - 5670
EP - 5680
BT - 37th International Conference on Machine Learning, ICML 2020
A2 - Daume, Hal
A2 - Singh, Aarti
PB - International Machine Learning Society (IMLS)
T2 - 37th International Conference on Machine Learning, ICML 2020
Y2 - 13 July 2020 through 18 July 2020
ER -