Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays

  • Johnny Rhe
  • , Kang Eun Jeon
  • , Joo Chan Lee
  • , Seongmoon Jeong
  • , Jong Hwan Ko

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Processing-in-memory (PIM) architectures have been highlighted as one of the viable solutions for faster and more power-efficient convolutional neural networks (CNNs) inference. Recently, shift and duplicate kernel (SDK) convolutional weight mapping scheme was proposed, achieving up to 50% through-put improvement over the prior arts. However, the traditional pattern-based pruning methods, which were adopted for row-skipping and computing cycle reduction, are not optimal for the latest SDK mapping due to structural irregularity caused by the shifted and duplicated kernels. To address this issue, we propose a method called kernel shape control (KERNTROL) that aims to promote structural regularity for achieving a high row-skipping ratio and model accuracy. Instead of pruning certain weight elements permanently, KERNTROL controls the kernel shapes through the omission of certain weights based on their mapped columns. In comparison to the latest pattern-based pruning approaches, KERNTROL achieves up to 36.4% improvement in the compression rate, and 38.6% in array utilization with maintaining the original model accuracy.

Original languageEnglish
Title of host publication2023 42nd IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2023 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350315592
DOIs
StatePublished - 2023
Event42nd IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2023 - San Francisco, United States
Duration: 28 Oct 20232 Nov 2023

Publication series

NameIEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD
ISSN (Print)1092-3152

Conference

Conference42nd IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2023
Country/TerritoryUnited States
CitySan Francisco
Period28/10/232/11/23

Keywords

  • neural compression
  • processing-in-memory
  • shift and duplicate (SDK) weight mapping
  • weight pruning

Fingerprint

Dive into the research topics of 'Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays'. Together they form a unique fingerprint.

Cite this