A 44.1TOPS/W Precision-Scalable Accelerator for Quantized Neural Networks in 28nm CMOS

  • Sungju Ryu
  • , Hyungjun Kim
  • , Wooseok Yi
  • , Jongeun Koo
  • , Eunhwan Kim
  • , Yulhwa Kim
  • , Taesu Kim
  • , Jae Joon Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Supporting variable precision for computing quantized neural network in a hardware accelerator is an efficient way to reduce overall computation time and energy. However, in the previous precision-scalable hardware, bit-reconfiguration logic increases the chip area significantly. In this paper, we demonstrate a compact precision-scalable accelerator chip using bitwise summation and channel-wise aligning schemes. The measurement results show that the peak performance per compute area is improved by 5.1-7.7x and system-level energy-efficiency is improved by up to 64% compared to previous precision-scalable accelerators.

Original languageEnglish
Title of host publication2020 IEEE Custom Integrated Circuits Conference, CICC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728160313
DOIs
StatePublished - Mar 2020
Externally publishedYes
Event2020 IEEE Custom Integrated Circuits Conference, CICC 2020 - Boston, United States
Duration: 22 Mar 202025 Mar 2020

Publication series

NameProceedings of the Custom Integrated Circuits Conference
Volume2020-March
ISSN (Print)0886-5930

Conference

Conference2020 IEEE Custom Integrated Circuits Conference, CICC 2020
Country/TerritoryUnited States
CityBoston
Period22/03/2025/03/20

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Deep neural network
  • hardware accelerator. bit-precision scaling
  • multiply-accumulate unit

Fingerprint

Dive into the research topics of 'A 44.1TOPS/W Precision-Scalable Accelerator for Quantized Neural Networks in 28nm CMOS'. Together they form a unique fingerprint.

Cite this