Context-Aware Recognition of Elevator Buttons Using a Sequential Training Methodology

  • Arpan Ghosh
  • , Kyeong Jin Joo
  • , Gilberto Galvis Giraldo
  • , Tae Yong Kuc

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we present a sequential training methodology aimed at improving the recognition of elevator buttons using the YOLOv5 object detection model. The methodology is structured into three distinct phases. In the first phase, we generate a synthetic dataset where elevator buttons, cropped from their original context, are placed on random image backgrounds. This phase is designed to help the model learn to identify buttons independently of their surroundings, ensuring a foundational understanding of button features without contextual distractions. In the second phase, we augment the cropped button dataset by applying various transformations such as random flips, rotations, and scaling. These augmentations increase the diversity and robustness of the training data, allowing the model to generalize better to variations in button appearances. The final phase involves training the model on images of full elevator panels. This step is crucial for helping the model understand the contextual placement and spatial relationships of the buttons within the panel, which is essential for accurate detection in real-world scenarios. Additionally, we enhance the real-time video input exposure to improve visibility under varying lighting conditions, addressing common challenges faced in practical applications. For post-processing, we integrate a Channel and Spatial Reliability Tracker (CSRT) to maintain button-tracking consistency in video sequences. This tracker helps ensure that once a button is detected, its position is reliably followed across frames, improving the overall accuracy and reliability of the system. This comprehensive approach, which combines the use of synthetic data, extensive data augmentation techniques, and contextual training on full panel images, aims to better simulate real-world scenarios. As a result, the proposed methodology significantly enhances the robustness and reliability of the YOLOv5 model in recognizing elevator buttons under diverse conditions.

Original languageEnglish
Title of host publication2024 24th International Conference on Control, Automation and Systems, ICCAS 2024
PublisherIEEE Computer Society
Pages596-601
Number of pages6
ISBN (Electronic)9788993215380
DOIs
StatePublished - 2024
Event24th International Conference on Control, Automation and Systems, ICCAS 2024 - Jeju, Korea, Republic of
Duration: 29 Oct 20241 Nov 2024

Publication series

NameInternational Conference on Control, Automation and Systems
ISSN (Print)1598-7833

Conference

Conference24th International Conference on Control, Automation and Systems, ICCAS 2024
Country/TerritoryKorea, Republic of
CityJeju
Period29/10/241/11/24

Keywords

  • Context-Aware
  • CSRT tracker
  • Data augmentation
  • Object detection
  • Sequential training
  • Synthetic data
  • YOLOv5

Fingerprint

Dive into the research topics of 'Context-Aware Recognition of Elevator Buttons Using a Sequential Training Methodology'. Together they form a unique fingerprint.

Cite this