A Weight-Sharing Autoencoder with Dynamic Quantization for Efficient Feature Compression

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Collaborative inference (CI) enhances the inference efficiency of deep neural networks (DNNs) by partitioning a computational workload between an edge device and a cloud platform. Efficient inference using CI requires searching for the optimal partition layer that minimizes the end-to-end inference latency. In addition, the intermediate feature at the partitioned layer should be effectively compressed. However, recent DNN-based feature compression methods require independent models dedicated for each partition point, resulting in significant storage overhead. In this paper, we propose a novel method that efficiently compresses the features from variable partition layers using a single autoencoder. The proposed method incorporates a weight-sharing technique that shares the weights of autoencoders that compress each partition layer. In addition, dynamic bitwidths quantization is supported for flexibility in compression ratio. The experimental results show that the proposed method reduced the required parameter size by 4× compared to the existing independent model based method, while maintaining the accuracy loss within 0.5%.

Original languageEnglish
Title of host publicationICTC 2021 - 12th International Conference on ICT Convergence
Subtitle of host publicationBeyond the Pandemic Era with ICT Convergence Innovation
PublisherIEEE Computer Society
Pages1111-1113
Number of pages3
ISBN (Electronic)9781665423830
DOIs
StatePublished - 2021
Event12th International Conference on Information and Communication Technology Convergence, ICTC 2021 - Jeju Island, Korea, Republic of
Duration: 20 Oct 202122 Oct 2021

Publication series

NameInternational Conference on ICT Convergence
Volume2021-October
ISSN (Print)2162-1233
ISSN (Electronic)2162-1241

Conference

Conference12th International Conference on Information and Communication Technology Convergence, ICTC 2021
Country/TerritoryKorea, Republic of
CityJeju Island
Period20/10/2122/10/21

Keywords

  • Autoencoder
  • Collaborative Inference
  • Dynamic Quantization
  • Feature Compression

Fingerprint

Dive into the research topics of 'A Weight-Sharing Autoencoder with Dynamic Quantization for Efficient Feature Compression'. Together they form a unique fingerprint.

Cite this