DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression.

Open AccessPosted Content

DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression.

- 15 May 2019

18

TL;DR: DeepCABAC is presented, a novel context-adaptive binary arithmetic coder for compressing deep neural networks that quantizes each weight parameter by minimizing a weighted rate-distortion function, which implicitly takes the impact of quantization on to the accuracy of the network into account.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/MCOM.001.1900664

When Machine Learning Meets Wireless Cellular Networks: Deployment, Challenges, and Applications

Ursula Challita, +2 more

- 15 Jul 2020

- IEEE Communications Magazine

TL;DR: In this article, the authors provide an overview on the integration of AI functionalities in 5G and beyond networks and highlight applications to the physical layer, mobility management, wireless security, and localization.

...read moreread less

66

•Posted Content

When Machine Learning Meets Wireless Cellular Networks: Deployment, Challenges, and Applications

Ursula Challita, +2 more

- 08 Nov 2019

- arXiv: Information Theory

TL;DR: An overview on the integration of AI functionalities in 5G and beyond networks is provided and key factors for successful AI integration such as data, security, and explainable AI are highlighted.

...read moreread less

44

•Proceedings Article

Scalable Model Compression by Entropy Penalized Reparameterization

Deniz Oktay, +3 more

- 30 Apr 2020

TL;DR: In this article, the network parameters (weights and biases) are represented in a "latent" space, which is used to impose an entropy penalty on the parameter representation during training, and to compress the representation using a simple arithmetic coder after training.

...read moreread less

15

Proceedings Article•10.1109/DCC50243.2021.00033

Rate-Distortion Optimized Coding for Efficient CNN Compression

Wang Zhe, +5 more

- 23 Mar 2021

TL;DR: Zhang et al. as mentioned in this paper presented a coding framework for deep convolutional neural network compression, which incorporates three coding ingredients in the coding framework, including bit allocation, dead zone quantization, and Tunstall coding, to improve the rate-distortion frontier without noticeable system-level overhead introduced.

...read moreread less

13

Journal Article•10.1007/s11277-023-10558-2

Deep Learning Based Video Compression Techniques with Future Research Issues

Helen K Joy, +3 more

- 28 Jun 2023

- Wireless Personal Communications

TL;DR: The development of intelligent and self-trained steps in video compression with deep learning is reviewed in detail, and the relevant and noteworthy work that arose in each step of compression is inculcated in this paper.

...read moreread less

8

...

Expand

References

Journal Article•10.1038/NATURE14539

Deep learning

Yann LeCun, +4 more

- 28 May 2015

- Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

67K

•Proceedings Article

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Song Han, +3 more

- 15 Feb 2016

TL;DR: Deep Compression as mentioned in this paper proposes a three-stage pipeline: pruning, quantization, and Huffman coding to reduce the storage requirement of neural networks by 35x to 49x without affecting their accuracy.

...read moreread less

8.5K

Journal Article•10.1109/JRPROC.1952.273898

A Method for the Construction of Minimum-Redundancy Codes

David A. Huffman

- 01 Sep 1952

TL;DR: A minimum-redundancy code is one constructed in such a way that the average number of coding digits per message is minimized.

...read moreread less

6.1K

Journal Article•10.1007/BF02837279

A method for the construction of minimum-redundancy codes

David A. Huffman

- 01 Feb 2006

- Resonance

TL;DR: A minimum-redundancy code is one constructed in such a way that the average number of coding digits per message is minimized.

...read moreread less

5.2K

•Posted Content

Learning both Weights and Connections for Efficient Neural Networks

Song Han, +3 more

- 08 Jun 2015

- arXiv: Neural and Evolutionary Computing

TL;DR: A method to reduce the storage and computation required by neural networks by an order of magnitude without affecting their accuracy by learning only the important connections, and prunes redundant connections using a three-step method.

...read moreread less

4.2K