TY - GEN
T1 - Towards Squeezing-Averse Virtual Try-On via Sequential Deformation
AU - Shim, Sang Heon
AU - Chung, Jiwoo
AU - Heo, Jae Pil
N1 - Publisher Copyright:
Copyright © 2024, Association for the Advancement of Artificial Intelligence.
PY - 2024/3/25
Y1 - 2024/3/25
N2 - In this paper, we first investigate a visual quality degradation problem observed in recent high-resolution virtual tryon approach. The tendency is empirically found that the textures of clothes are squeezed at the sleeve, as visualized in the upper row of Fig. 1 (a). A main reason for the issue arises from a gradient conflict between two popular losses, the Total Variation (TV) and adversarial losses. Specifically, the TV loss aims to disconnect boundaries between the sleeve and torso in a warped clothing mask, whereas the adversarial loss aims to combine between them. Such contrary objectives feedback the misaligned gradients to a cascaded appearance flow estimation, resulting in undesirable squeezing artifacts. To reduce this, we propose a Sequential Deformation (SD-VITON) that disentangles the appearance flow prediction layers into TV objective-dominant (TVOB) layers and a task-coexistence (TACO) layer. Specifically, we coarsely fit the clothes onto a human body via the TVOB layers, and then keep on refining via the TACO layer. In addition, the bottom row of Fig. 1 (a) shows a different type of squeezing artifacts around the waist. To address it, we further propose that we first warp the clothes into a tuckedout shirts style, and then partially erase the texture from the warped clothes without hurting the smoothness of the appearance flows. Experimental results show that our SDVITON successfully resolves both types of artifacts and outperforms the baseline methods.
AB - In this paper, we first investigate a visual quality degradation problem observed in recent high-resolution virtual tryon approach. The tendency is empirically found that the textures of clothes are squeezed at the sleeve, as visualized in the upper row of Fig. 1 (a). A main reason for the issue arises from a gradient conflict between two popular losses, the Total Variation (TV) and adversarial losses. Specifically, the TV loss aims to disconnect boundaries between the sleeve and torso in a warped clothing mask, whereas the adversarial loss aims to combine between them. Such contrary objectives feedback the misaligned gradients to a cascaded appearance flow estimation, resulting in undesirable squeezing artifacts. To reduce this, we propose a Sequential Deformation (SD-VITON) that disentangles the appearance flow prediction layers into TV objective-dominant (TVOB) layers and a task-coexistence (TACO) layer. Specifically, we coarsely fit the clothes onto a human body via the TVOB layers, and then keep on refining via the TACO layer. In addition, the bottom row of Fig. 1 (a) shows a different type of squeezing artifacts around the waist. To address it, we further propose that we first warp the clothes into a tuckedout shirts style, and then partially erase the texture from the warped clothes without hurting the smoothness of the appearance flows. Experimental results show that our SDVITON successfully resolves both types of artifacts and outperforms the baseline methods.
UR - https://www.scopus.com/pages/publications/85189495067
U2 - 10.1609/aaai.v38i5.28288
DO - 10.1609/aaai.v38i5.28288
M3 - Conference contribution
AN - SCOPUS:85189495067
T3 - Proceedings of the AAAI Conference on Artificial Intelligence
SP - 4856
EP - 4863
BT - Technical Tracks 14
A2 - Wooldridge, Michael
A2 - Dy, Jennifer
A2 - Natarajan, Sriraam
PB - Association for the Advancement of Artificial Intelligence
T2 - 38th AAAI Conference on Artificial Intelligence, AAAI 2024
Y2 - 20 February 2024 through 27 February 2024
ER -