Timing guarantees for inference of AI models in embedded systems

  • Seunghoon Lee
  • , Woosung Kang
  • , Marko Bertogna
  • , Hoon Sung Chwa
  • , Jinkyu Lee

Research output: Contribution to journalArticlepeer-review

Abstract

Machine learning (ML) is increasingly being integrated into real-time embedded systems, enabling intelligent decision-making in applications such as autonomous driving and industrial automation. However, ensuring predictable execution of deep neural network (DNN) inference remains a major challenge, as real-time systems must meet strict timing constraints to guarantee safety and reliability. This paper identifies key challenges in achieving real-time AI inference in embedded systems, including limited memory capacity, high energy consumption, efficient multi-DNN scheduling, and heterogeneous resource management. To address these challenges, we emphasize the need for advanced scheduling algorithms to efficiently allocate heterogeneous computing resources across multiple DNNs, hierarchical memory management to reduce memory bottlenecks, and real-time neural architecture search and optimization techniques to enhance AI model performance under strict timing constraints. Furthermore, we discuss future research directions aimed at improving real-time AI execution, including time-predictable scheduling frameworks to ensure consistent inference latency, cross-device AI workload management to optimize resource utilization across heterogeneous processors, and benchmarking methodologies to systematically evaluate performance, timing guarantees, and energy efficiency in real-time AI systems. Advancing these research areas will enhance the reliability, efficiency, and scalability of AI-driven embedded systems, bridging the gap between ML advancements and real-time system requirements.

Original languageEnglish
Pages (from-to)259-267
Number of pages9
JournalReal-Time Systems
Volume61
Issue number2
DOIs
StatePublished - Jun 2025

Keywords

  • Embedded systems
  • Inference
  • Machine learning
  • Timing guarantees

Fingerprint

Dive into the research topics of 'Timing guarantees for inference of AI models in embedded systems'. Together they form a unique fingerprint.

Cite this