TY - GEN
T1 - PDET
T2 - 27th International Conference on Pattern Recognition, ICPR 2024
AU - Xiong, Mingfu
AU - Liang, Jingbang
AU - Guo, Yifei
AU - Lee, Ik Hyun
AU - Bakshi, Sambit
AU - Muhammad, Khan
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - Visible-Infrared Person Re-identification (VI-ReID) would effectively improve the recognition performance in weak-lighting and nighttime scenes, which is an important research direction in pattern recognition and computer vision. However, existing methods usually focus on reducing the image differences between modalities (visible and infrared) to extract more reliable features, while neglecting the ability to discriminate the different identities with similar appearances. To address this problem, we propose a framework called “Progressive Diversity Expansion Transformer (PDET)”, which includes a Diversity Distinguishing Vision Transformer Module (DDViTM) and a Cross-Modality Similarity Matching (CMSM) module for VI-ReID in this study. The DDViTM is proposed to implement the multiple embedded output vectors for a single input, learning feature representations of individual pedestrians in different modalities. The second module (CMSM) is used to improve the feature similarity between visible and infrared images, and dynamically adjust the image sequence weights of the two modalities to complete the training and optimization efficiency for the entire network. We conducted extensive experiments on the SYSU-MM01 and RegDB datasets, widely recognized public datasets for VR-ReID. The results demonstrate that the algorithm presented in this work has achieved promising performance compared to state-of-the-art methods. The code is available at https://github.com/jxsiaj/PEDT.git.
AB - Visible-Infrared Person Re-identification (VI-ReID) would effectively improve the recognition performance in weak-lighting and nighttime scenes, which is an important research direction in pattern recognition and computer vision. However, existing methods usually focus on reducing the image differences between modalities (visible and infrared) to extract more reliable features, while neglecting the ability to discriminate the different identities with similar appearances. To address this problem, we propose a framework called “Progressive Diversity Expansion Transformer (PDET)”, which includes a Diversity Distinguishing Vision Transformer Module (DDViTM) and a Cross-Modality Similarity Matching (CMSM) module for VI-ReID in this study. The DDViTM is proposed to implement the multiple embedded output vectors for a single input, learning feature representations of individual pedestrians in different modalities. The second module (CMSM) is used to improve the feature similarity between visible and infrared images, and dynamically adjust the image sequence weights of the two modalities to complete the training and optimization efficiency for the entire network. We conducted extensive experiments on the SYSU-MM01 and RegDB datasets, widely recognized public datasets for VR-ReID. The results demonstrate that the algorithm presented in this work has achieved promising performance compared to state-of-the-art methods. The code is available at https://github.com/jxsiaj/PEDT.git.
KW - Cross-modality Retrieval
KW - Progressive Diversity Expansion
KW - Transformer
KW - Visible-Infrared Person Re-identification
UR - https://www.scopus.com/pages/publications/85211804362
U2 - 10.1007/978-3-031-78341-8_28
DO - 10.1007/978-3-031-78341-8_28
M3 - Conference contribution
AN - SCOPUS:85211804362
SN - 9783031783401
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 439
EP - 454
BT - Pattern Recognition - 27th International Conference, ICPR 2024, Proceedings
A2 - Antonacopoulos, Apostolos
A2 - Chaudhuri, Subhasis
A2 - Chellappa, Rama
A2 - Liu, Cheng-Lin
A2 - Bhattacharya, Saumik
A2 - Pal, Umapada
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 1 December 2024 through 5 December 2024
ER -