TY - GEN
T1 - Towards Accurate Low Bit DNNs with Filter-wise Quantization
AU - Kim, Hoseung
AU - Lee, Kwangbae
AU - Shin, Dongkun
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/11/1
Y1 - 2020/11/1
N2 - Execution of deep neural networks (DNNs) on a resource constraint device has become a rising issue of recent neural network research. Quantization using low bit-widths for networks is one of the most effective compression techniques. Although there have been many studies that use different bit-widths per layer to compress further the model instead of using a single bit-width, they achieved a limited reduction on the parameter size due to the layer-wise bit-width assignment. In this paper, we propose a more fine-grained and multi-precision quantization technique, called filter-wise quantization. Regularization is used while training networks to partition filters into various precision. In experiments, we show that our technique can provide better accuracy at a smaller parameter size at various DNN models for CIFAR-10 and CIFAR-100 data sets.
AB - Execution of deep neural networks (DNNs) on a resource constraint device has become a rising issue of recent neural network research. Quantization using low bit-widths for networks is one of the most effective compression techniques. Although there have been many studies that use different bit-widths per layer to compress further the model instead of using a single bit-width, they achieved a limited reduction on the parameter size due to the layer-wise bit-width assignment. In this paper, we propose a more fine-grained and multi-precision quantization technique, called filter-wise quantization. Regularization is used while training networks to partition filters into various precision. In experiments, we show that our technique can provide better accuracy at a smaller parameter size at various DNN models for CIFAR-10 and CIFAR-100 data sets.
KW - Deep Neural Networks
KW - Quantization
KW - Regularization
UR - https://www.scopus.com/pages/publications/85098886539
U2 - 10.1109/ICCE-Asia49877.2020.9277419
DO - 10.1109/ICCE-Asia49877.2020.9277419
M3 - Conference contribution
AN - SCOPUS:85098886539
T3 - 2020 IEEE International Conference on Consumer Electronics - Asia, ICCE-Asia 2020
BT - 2020 IEEE International Conference on Consumer Electronics - Asia, ICCE-Asia 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 IEEE International Conference on Consumer Electronics - Asia, ICCE-Asia 2020
Y2 - 1 November 2020 through 3 November 2020
ER -