DASM: Data-Streaming-Based Computing in Nonvolatile Memory Architecture for Embedded System

doi:10.1109/TVLSI.2019.2912941

Journal Article10.1109/TVLSI.2019.2912941

DASM: Data-Streaming-Based Computing in Nonvolatile Memory Architecture for Embedded System

Liang Chang, +6 more

- 09 May 2019

- IEEE Transactions on Very Large Scale In...

- Vol. 27, Iss: 9, pp 2046-2059

16

TL;DR: A data-streaming design for the NVM-based CIM (e.g., DASM), which achieves speedup compared to the NVIDIA Jetson TK1 embedded GPU board, Intel Xeon E5-2640 CPU, the state-of-the-art field-programmable gate array (FPGA) design, with much lower power consumption.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/TVLSI.2019.2926984

PXNOR-BNN: In/With Spin-Orbit Torque MRAM Preset-XNOR Operation-Based Binary Neural Networks

Liang Chang, +5 more

- 22 Jul 2019

- IEEE Transactions on Very Large Scale In...

TL;DR: An NVM-based CIM architecture employing a Preset-XNOR operation in/with the spin–orbit torque magnetic random access memory (SOT-MRAM) to accelerate the computation of BNNs (PXNOR-BNN) is proposed.

...read moreread less

59

Journal Article•10.1007/S11432-021-3220-0

A survey of in-spin transfer torque MRAM computing

Hao Cai, +6 more

- 10 May 2021

- Science in China Series F: Information S...

TL;DR: This study reviews state-of-the-art techniques for managing IMC with an emphasis on spin-transfer torque-MRAM computing via design schemes at the bit-cell, circuit, and system levels and demonstrates the existing limitations of in- MRAM computing and potential methods for overcoming these issues.

...read moreread less

27

Journal Article•10.1007/S11432-021-3234-0

Energy-efficient computing-in-memory architecture for AI processor: device, circuit, architecture perspective

Liang Chang, +9 more

- 11 May 2021

- Science in China Series F: Information S...

TL;DR: In this article, the authors analyze the requirement of AI algorithms on the data movement and low power requirement of the AI processors and present several novel solutions beyond traditional analog-digital mixed static random access memory (SRAM)-based CIM architecture.

...read moreread less

16

Proceedings Article•10.1109/ITC44170.2019.9000146

Fault-Tolerant Neuromorphic Computing Systems

Arjun Chaudhuri, +2 more

- 01 Nov 2019

TL;DR: A survey of research on fault modeling, test generation methodologies, and fault-tolerant design of neuromorphic computing systems based on RRAM and MRAM is presented.

...read moreread less

12

Proceedings Article•10.23919/DATE51398.2021.9474022

SpinLiM: Spin Orbit Torque Memory for Ternary Neural Networks Based on the Logic-in-Memory Architecture

Lichuan Luo, +5 more

- 01 Feb 2021

TL;DR: In this article, two magnetic tunnel junctions (MTJ) are driven by the interplay of field-free spin orbit torque (SOT) and spin transfer torque (STT) effects to achieve a novel state-of-the-art paradigm for ternary multiplication operations.

...read moreread less

10

...

Expand

References

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

Journal Article•10.1038/NATURE06932

The missing memristor found

Dmitri B. Strukov, +3 more

- 01 May 2008

- Nature

TL;DR: It is shown, using a simple analytical example, that memristance arises naturally in nanoscale systems in which solid-state electronic and ionic transport are coupled under an external bias voltage.

...read moreread less

11.1K

•Book Chapter•10.1007/978-3-319-46493-0_32

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

Mohammad Rastegari, +4 more

- 08 Oct 2016

TL;DR: The Binary-Weight-Network version of AlexNet is compared with recent network binarization methods, BinaryConnect and BinaryNets, and outperform these methods by large margins on ImageNet, more than \(16\,\%\) in top-1 accuracy.

...read moreread less

5.2K

•Posted Content

Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

Matthieu Courbariaux, +4 more

- 09 Feb 2016

- arXiv: Learning

TL;DR: A binary matrix multiplication GPU kernel is written with which it is possible to run the MNIST BNN 7 times faster than with an unoptimized GPU kernel, without suffering any loss in classification accuracy.

...read moreread less

2.8K