C-AFA: A Conditionally Approximate Full Adder for Efficient DNN Inference in CIM Arrays

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Digital compute-in-memory (DCIM) architectures are becoming crucial for real-time and accurate deep neural network (DNN) inference due to their capacity for precise computations. However, traditional DCIM systems often struggle to balance precise data processing with computational efficiency. In scenarios where exact computations are not always necessary, especially in DNNs featuring sparse data, existing multi-bit operations frequently fail to achieve an ideal balance between accuracy and efficiency. In this paper, we introduce a novel approach to DCIM architecture that selectively skips computations by using probabilistically determined values, effectively reducing the computational load. Our simulation results demonstrate that this method significantly reduces computing cycles, achieving a speedup of up to 1.5× compared to traditional methods, while maintaining accuracy with only a minimal decrease of 1.5%.

Original languageEnglish
Title of host publicationProceedings - International SoC Design Conference 2024, ISOCC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages382-383
Number of pages2
ISBN (Electronic)9798350377088
DOIs
StatePublished - 2024
Event21st International System-on-Chip Design Conference, ISOCC 2024 - Sapporo, Japan
Duration: 19 Aug 202422 Aug 2024

Publication series

NameProceedings - International SoC Design Conference 2024, ISOCC 2024

Conference

Conference21st International System-on-Chip Design Conference, ISOCC 2024
Country/TerritoryJapan
CitySapporo
Period19/08/2422/08/24

Keywords

  • approximate full adder (AFA)
  • deep neural network (DNN)
  • digital CIM (DCIM)

Fingerprint

Dive into the research topics of 'C-AFA: A Conditionally Approximate Full Adder for Efficient DNN Inference in CIM Arrays'. Together they form a unique fingerprint.

Cite this