Topic

Software-defined memory

About: Software-defined memory is a research topic. Over the lifetime, 2 publications have been published within this topic.

...read moreread less

Topic Tools

Find unexplored research gaps

Generate a literature review

Explore related concepts

Papers

Proceedings Article•10.1109/NEWCAS49341.2020.9159769•

Quantized Guided Pruning for Efficient Hardware Implementations of Deep Neural Networks

[...]

Ghouthi Boukli Hacene¹, Vincent Gripon¹, Matthieu Arzel¹, Nicolas Farrugia¹, Yoshua Bengio - Show less +1 more•Institutions (1)

Université de Montréal¹

16 Jun 2020

TL;DR: This work proposes a combination of a pruning technique and a quantization scheme that effectively reduce the complexity and memory usage of convolutional layers of CNNs, by replacing the complex convolutionAL operation by a low-cost multiplexer.

...read moreread less

Abstract: Deep Neural Networks (DNNs) in general and Convolutional Neural Networks (CNNs) in particular are state-of-the-art in numerous computer vision tasks such as object classification and detection. However, the large amount of parameters they contain leads to a high computational complexity and strongly limits their usability in budget-constrained devices such as embedded devices. In this paper, we propose a combination of a pruning technique and a quantization scheme that effectively reduce the complexity and memory usage of convolutional layers of CNNs, by replacing the complex convolutional operation by a low-cost multiplexer. We perform experiments on CIFAR10, CIFAR100 and SVHN datasets and show that the proposed method achieves almost state-of-the-art accuracy, while drastically reducing the computational and memory footprints compared to the baselines. We also propose an efficient hardware architecture, implemented on Field Programmable Gate Arrays (FPGAs), to accelerate inference, which works as a pipeline and accommodates multiple layers working at the same time to speed up the inference process. In contrast with most proposed approaches which have used external memory or software defined memory controllers, our work is based on algorithmic optimization and full-hardware design, enabling a direct, on-chip memory implementation of a DNN while keeping close to state of the art accuracy.

...read moreread less

6 citations

Posted Content•

Supporting Massive DLRM Inference Through Software Defined Memory

[...]

Ehsan K. Ardestani¹, Changkyu Kim, Seung Jae Lee, Luoshang Pan, Valmiki Rampersad, Jens Axboe, Banit Agrawal, Fuxun Yu, Ansha Yu, Trung Le, Hector Yuen, Shishir Juluri, Akshat Nanda, Manoj Wodekar, Dheevatsa Mudigere, Krishnakumar Nair, Maxim Naumov, Chris Peterson, Mikhail Smelyanskiy, Vijay Rao - Show less +16 more•Institutions (1)

Facebook¹

21 Oct 2021-arXiv: Hardware Architecture

TL;DR: In this paper, the authors evaluate the major challenges in extending the memory hierarchy to SCM for DLRM, and present different techniques to improve performance through a Software Defined Memory.

...read moreread less

Abstract: Deep Learning Recommendation Models (DLRM) are widespread, account for a considerable data center footprint, and grow by more than 1.5x per year. With model size soon to be in terabytes range, leveraging Storage ClassMemory (SCM) for inference enables lower power consumption and cost. This paper evaluates the major challenges in extending the memory hierarchy to SCM for DLRM, and presents different techniques to improve performance through a Software Defined Memory. We show how underlying technologies such as Nand Flash and 3DXP differentiate, and relate to real world scenarios, enabling from 5% to 29% power savings.

...read moreread less

No. of papers in the topic in previous years
Year	Papers
2021	1
2020	1

Software-defined memory

Topic Tools

Papers

Quantized Guided Pruning for Efficient Hardware Implementations of Deep Neural Networks

Supporting Massive DLRM Inference Through Software Defined Memory

Performance Metrics