Skip to main navigation Skip to search Skip to main content

Towards Accurate Low Bit DNNs with Filter-wise Quantization

  • Sungkyunkwan University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Execution of deep neural networks (DNNs) on a resource constraint device has become a rising issue of recent neural network research. Quantization using low bit-widths for networks is one of the most effective compression techniques. Although there have been many studies that use different bit-widths per layer to compress further the model instead of using a single bit-width, they achieved a limited reduction on the parameter size due to the layer-wise bit-width assignment. In this paper, we propose a more fine-grained and multi-precision quantization technique, called filter-wise quantization. Regularization is used while training networks to partition filters into various precision. In experiments, we show that our technique can provide better accuracy at a smaller parameter size at various DNN models for CIFAR-10 and CIFAR-100 data sets.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Consumer Electronics - Asia, ICCE-Asia 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728161648
DOIs
StatePublished - 1 Nov 2020
Externally publishedYes
Event2020 IEEE International Conference on Consumer Electronics - Asia, ICCE-Asia 2020 - Seoul, Korea, Republic of
Duration: 1 Nov 20203 Nov 2020

Publication series

Name2020 IEEE International Conference on Consumer Electronics - Asia, ICCE-Asia 2020

Conference

Conference2020 IEEE International Conference on Consumer Electronics - Asia, ICCE-Asia 2020
Country/TerritoryKorea, Republic of
CitySeoul
Period1/11/203/11/20

Keywords

  • Deep Neural Networks
  • Quantization
  • Regularization

Fingerprint

Dive into the research topics of 'Towards Accurate Low Bit DNNs with Filter-wise Quantization'. Together they form a unique fingerprint.

Cite this