Efficient Recurrent Optical Flow Refinement Using Mamba and Multi-Scale Loss

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Optical flow estimation plays a critical role in various computer vision tasks, including video understanding and autonomous driving. Recent models such as RAFT and FlowFormer refine flow predictions iteratively using recurrent modules based on Convolutional Gated Recurrent Unit (Con-vGRU). However, ConvGRU has limitations in modeling long-range dependencies and requires a large number of parameters for decoder refinement. In this paper, we propose replacing the ConvGRU module in FlowFormer's decoder with Mamba, a state space sequence model optimized for efficient and expressive temporal modeling. Additionally, we introduce a multi-scale loss structure that incorporates low-resolution supervision to encourage global motion consistency and improve training stability. Our method maintains the original input structure of FlowFormer while improving both temporal modeling and multi-scale learning. Experiments on the KITTI benchmark show that our Mamba-based decoder achieves significant improvements over the original FlowFormer, reducing average end-point-error (AEPE) by 5.81% and F1-All by 13.41%, while also reducing decoder parameters by 32.65% and FLOPs by 22.88%. These results demonstrate that Mamba, combined with multi-scale loss, is a strong and lightweight alternative to ConvGRU for optical flow refinement.

Original languageEnglish
Title of host publication2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331553630
DOIs
StatePublished - 2025
Externally publishedYes
Event2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025 - Seoul, Korea, Republic of
Duration: 7 Jul 202510 Jul 2025

Publication series

Name2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025

Conference

Conference2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period7/07/2510/07/25

Keywords

  • mamba
  • multi-scale loss
  • optical flow

Fingerprint

Dive into the research topics of 'Efficient Recurrent Optical Flow Refinement Using Mamba and Multi-Scale Loss'. Together they form a unique fingerprint.

Cite this