Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT

doi:10.1007/978-3-031-19839-7_6

Book Chapter10.1007/978-3-031-19839-7_6

Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT

Xiufeng Xie, +3 more

- 01 Jan 2022

pp 88-104

3

TL;DR: In this article , a bandwidth-aware adaptive compression solution that learns the JPEG encoding parameters to optimize the DNN inference accuracy under bandwidth constraints is proposed, where the compressed image size is modeled as a closed-form function of encoding parameters by analyzing the JPEG codec workflow.

Abstract: The lightweight nature of IoT devices makes it challenging to run deep neural networks (DNNs) locally for applications like augmented reality. Recent advances in IoT communication like LTE-M have significantly boosted the link bandwidth, enabling IoT devices to stream visual data to edge servers running DNNs for inference. However, uncompressed visual data can still easily overload the IoT link, and the wireless spectrum is shared by numerous IoT devices, causing unstable link bandwidth. Mainstream codecs can reduce the traffic but at the cost of severe inference accuracy drops. Recent works on differentiable JPEG train the codec to tackle the damage to inference accuracy. But they rely on heuristic configurations in the loss function to balance the rate-accuracy tradeoff, providing no guarantee to meet the IoT bandwidth constraint. This paper presents AutoJPEG, a bandwidth-aware adaptive compression solution that learns the JPEG encoding parameters to optimize the DNN inference accuracy under bandwidth constraints. We model the compressed image size as a closed-form function of encoding parameters by analyzing the JPEG codec workflow. Furthermore, we formulate a constrained optimization framework to minimize the original DNN loss while ensuring the image size strictly meets the bandwidth constraint. Our evaluation validates AutoJPEG on various DNN models and datasets. In our experiments, AutoJPEG outperforms the mainstream codecs (like JPEG and WebP) and the state-of-the-art solutions that optimize the image codec for DNN inference.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2309.06978

Differentiable JPEG: The Devil is in the Details

Christoph Reich, +3 more

- 13 Sep 2023

- arXiv.org

TL;DR: This paper conducts a comprehensive review of existing diff.JPEG approaches and identifies critical details that have been missed by previous methods, and proposes a novel diff.

...read moreread less

Preprint•10.48550/arxiv.2308.16215

Deep Video Codec Control

Christoph Reich, +4 more

- 01 Jan 2023

TL;DR: Standard video codecs designed for minimizing video distortion w.r.t. human quality assessment significantly degrade deep vision model performance. This paper presents the first end-to-end learnable deep video codec control that considers both bandwidth constraints and downstream deep vision performance.

...read moreread less

Journal Article•10.1109/wacv57701.2024.00408

Differentiable JPEG: The Devil is in the Details

Christoph Reich, +3 more

- 03 Jan 2024

TL;DR: Differentiable JPEG approximates the standard JPEG coding well for both high and low JPEG quality values, while Shin et al. [24] does not approximate well at low quality values.

...read moreread less

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Proceedings Article•10.1109/CVPR.2016.350

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

- 01 Jun 2016

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

...read moreread less

11.5K

•Posted Content

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

- 06 Apr 2016

- arXiv: Computer Vision and Pattern Recog...

TL;DR: Cityscapes as discussed by the authors is a large-scale dataset for semantic urban scene understanding, consisting of 5000 images with high quality pixel-level annotations and 200,000 additional images with coarse annotations.

...read moreread less

7.8K

...

Expand

Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT

Chat with Paper

AI Agents for this Paper

Citations

Differentiable JPEG: The Devil is in the Details

Deep Video Codec Control

Differentiable JPEG: The Devil is in the Details

References

Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

The Cityscapes Dataset for Semantic Urban Scene Understanding

The Cityscapes Dataset for Semantic Urban Scene Understanding

Related Papers (5)

Robust image compression with packetization: the JPEG-2000 case

Implementation of a video streaming system using scalable extension of h.264

Lossy Image Compression with Compressive Autoencoders

Lossy Image Compression with Compressive Autoencoders

Quality and Complexity Assessment of Learning-Based Image Compression Solutions