TY - JOUR
T1 - Axial Constraints for Global Matching-Based Optical Flow Estimation
AU - Kim, Euiyeon
AU - Jun, Woojin
AU - Heo, Jae Pil
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2023
Y1 - 2023
N2 - Optical flow estimation is a fundamental task that aims to find the 2-dimensional motion field by identifying correspondences between two input images. For quite a long time, the correlation volume followed by convolutional neural networks (CNN) to directly estimates the optical flow was a predominant pipeline. However, several pioneering methods proposed global matching recently, pointing out the limitation that CNN-based methods are struggling to handle large displacements due to their locality. Global matching is the step that identifies global correspondences at the pixel-level using entire correlation volumes at once with simple operations like softmax. However, when global matching with softmax is combined with commonly used regression loss in optical flow estimation, there will be a vast number of possible correlation volumes that can minimize the regression loss and correctly estimate correspondences. In other words, the training objective induces a one-to-many solution problem resulting in the presence of noisy gradients. In this paper, the necessity for more constraints on the correlation volume to mitigate the aforementioned ill-posed problem is discussed. To acquire such constraints, axial cross-entropy loss (i.e. axial constraints) to restrict the correlation volume to have low variance with designed pseudo ground truth is proposed. Experimental results show that axial constraints are applicable to off-the-shelves global matching-based optical flow estimation frameworks easily and lead to both quantitative and qualitative performance improvement without any architectural changes.
AB - Optical flow estimation is a fundamental task that aims to find the 2-dimensional motion field by identifying correspondences between two input images. For quite a long time, the correlation volume followed by convolutional neural networks (CNN) to directly estimates the optical flow was a predominant pipeline. However, several pioneering methods proposed global matching recently, pointing out the limitation that CNN-based methods are struggling to handle large displacements due to their locality. Global matching is the step that identifies global correspondences at the pixel-level using entire correlation volumes at once with simple operations like softmax. However, when global matching with softmax is combined with commonly used regression loss in optical flow estimation, there will be a vast number of possible correlation volumes that can minimize the regression loss and correctly estimate correspondences. In other words, the training objective induces a one-to-many solution problem resulting in the presence of noisy gradients. In this paper, the necessity for more constraints on the correlation volume to mitigate the aforementioned ill-posed problem is discussed. To acquire such constraints, axial cross-entropy loss (i.e. axial constraints) to restrict the correlation volume to have low variance with designed pseudo ground truth is proposed. Experimental results show that axial constraints are applicable to off-the-shelves global matching-based optical flow estimation frameworks easily and lead to both quantitative and qualitative performance improvement without any architectural changes.
KW - Global matching
KW - optical flow estimation
KW - under-constrained problem
UR - https://www.scopus.com/pages/publications/85163545925
U2 - 10.1109/ACCESS.2023.3290993
DO - 10.1109/ACCESS.2023.3290993
M3 - Article
AN - SCOPUS:85163545925
SN - 2169-3536
VL - 11
SP - 69989
EP - 70000
JO - IEEE Access
JF - IEEE Access
ER -