CORN: In-Buffer Computing for Binary Neural Network

doi:10.23919/DATE.2019.8715265

Proceedings Article10.23919/DATE.2019.8715265

CORN: In-Buffer Computing for Binary Neural Network

Liang Chang, +5 more

- 25 Mar 2019

- pp 384-389

17

TL;DR: A BNN computing accelerator, namely CORN, which consists of a Spin-Orbit-Torque Magnetic RAM based data buffer to perform the majority operation (to replace the pop-count process) with the SOT-MRAM-based IMC to accelerate the computing of BNNs.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Gemmini: An Agile Systolic Array Generator Enabling Systematic Evaluations of Deep-Learning Architectures

Hasan Genc, +13 more

- 22 Nov 2019

TL;DR: Gemmini is presented -- an open source and agile systolic array generator enabling systematic evaluations of deep-learning architectures and achieves two to three orders of magnitude speedup in deep neural network inference compared to the baseline execution on a host processor.

...read moreread less

82

Journal Article•10.1109/TVLSI.2019.2926984

PXNOR-BNN: In/With Spin-Orbit Torque MRAM Preset-XNOR Operation-Based Binary Neural Networks

Liang Chang, +5 more

- 22 Jul 2019

- IEEE Transactions on Very Large Scale In...

TL;DR: An NVM-based CIM architecture employing a Preset-XNOR operation in/with the spin–orbit torque magnetic random access memory (SOT-MRAM) to accelerate the computation of BNNs (PXNOR-BNN) is proposed.

...read moreread less

59

Journal Article•10.1007/S11432-021-3220-0

A survey of in-spin transfer torque MRAM computing

Hao Cai, +6 more

- 10 May 2021

- Science in China Series F: Information S...

TL;DR: This study reviews state-of-the-art techniques for managing IMC with an emphasis on spin-transfer torque-MRAM computing via design schemes at the bit-cell, circuit, and system levels and demonstrates the existing limitations of in- MRAM computing and potential methods for overcoming these issues.

...read moreread less

27

Journal Article•10.1109/TVLSI.2019.2912941

DASM: Data-Streaming-Based Computing in Nonvolatile Memory Architecture for Embedded System

Liang Chang, +6 more

- 09 May 2019

- IEEE Transactions on Very Large Scale In...

TL;DR: A data-streaming design for the NVM-based CIM (e.g., DASM), which achieves speedup compared to the NVIDIA Jetson TK1 embedded GPU board, Intel Xeon E5-2640 CPU, the state-of-the-art field-programmable gate array (FPGA) design, with much lower power consumption.

...read moreread less

17

Journal Article•10.1007/S11432-021-3234-0

Energy-efficient computing-in-memory architecture for AI processor: device, circuit, architecture perspective

Liang Chang, +9 more

- 11 May 2021

- Science in China Series F: Information S...

TL;DR: In this article, the authors analyze the requirement of AI algorithms on the data movement and low power requirement of the AI processors and present several novel solutions beyond traditional analog-digital mixed static random access memory (SRAM)-based CIM architecture.

...read moreread less

16

...

Expand

References

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Book Chapter•10.1007/978-3-319-46493-0_32

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

Mohammad Rastegari, +4 more

- 08 Oct 2016

TL;DR: The Binary-Weight-Network version of AlexNet is compared with recent network binarization methods, BinaryConnect and BinaryNets, and outperform these methods by large margins on ImageNet, more than \(16\,\%\) in top-1 accuracy.

...read moreread less

5.2K

•Posted Content

Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

Matthieu Courbariaux, +4 more

- 09 Feb 2016

- arXiv: Learning

TL;DR: A binary matrix multiplication GPU kernel is written with which it is possible to run the MNIST BNN 7 times faster than with an unoptimized GPU kernel, without suffering any loss in classification accuracy.

...read moreread less

2.8K

Journal Article•10.1109/TCAD.2012.2185930

NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory

Xiangyu Dong, +3 more

- 01 Jul 2012

- IEEE Transactions on Computer-Aided Desi...

TL;DR: NVSim is developed, a circuit-level model for NVM performance, energy, and area estimation, which supports various NVM technologies, including STT-RAM, PCRAM, ReRAM, and legacy NAND Flash and is expected to help boost architecture-level NVM-related studies.

...read moreread less

1.3K