Mixed-Precision In-Memory Computing
Manuel Le Gallo,Manuel Le Gallo,Abu Sebastian,Roland Mathis,Matteo Manica,Matteo Manica,Heiner Giefers,Tomas Tuma,Costas Bekas,Alessandro Curioni,Evangelos Eleftheriou +10 more
TL;DR: In this article, a mixed precision in-memory computing (MIMO) system is proposed, which combines a von Neumann machine with a computational memory unit. But it does not address the limitations arising from device variability and nonideal device characteristics.
read more
Abstract: As CMOS scaling reaches its technological limits, a radical departure from traditional von Neumann systems, which involve separate processing and memory units, is needed in order to significantly extend the performance of today's computers. In-memory computing is a promising approach in which nanoscale resistive memory devices, organized in a computational memory unit, are used for both processing and memory. However, to reach the numerical accuracy typically required for data analytics and scientific computing, limitations arising from device variability and non-ideal device characteristics need to be addressed. Here we introduce the concept of mixed-precision in-memory computing, which combines a von Neumann machine with a computational memory unit. In this hybrid system, the computational memory unit performs the bulk of a computational task, while the von Neumann machine implements a backward method to iteratively improve the accuracy of the solution. The system therefore benefits from both the high precision of digital computing and the energy/areal efficiency of in-memory computing. We experimentally demonstrate the efficacy of the approach by accurately solving systems of linear equations, in particular, a system of 5,000 equations using 998,752 phase-change memory devices.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Towards spike-based machine intelligence with neuromorphic computing.
TL;DR: An overview of the developments in neuromorphic computing for both algorithms and hardware is provided and the fundamentals of learning and hardware frameworks are highlighted, with emphasis on algorithm–hardware codesign.
1.6K
Memory devices and applications for in-memory computing
TL;DR: This Review provides an overview of memory devices and the key computational primitives enabled by these memory devices as well as their applications spanning scientific computing, signal processing, optimization, machine learning, deep learning and stochastic computing.
1.5K
Parallel convolutional processing using an integrated photonic tensor core.
Johannes Feldmann,Nathan Youngblood,Nathan Youngblood,Maxim Karpov,Helge Gehring,Xuan Li,Maik Stappers,M. Le Gallo,Xin Fu,Anton Lukashchuk,Arslan S. Raja,Junqiu Liu,C.D. Wright,Abu Sebastian,Tobias J. Kippenberg,Wolfram H. P. Pernice,Harish Bhaskaran +16 more
TL;DR: In this paper, the authors demonstrate a computationally specific integrated photonic hardware accelerator (tensor core) that is capable of operating at speeds of trillions of multiply-accumulate operations per second.
1.1K
Resistive switching materials for information processing
Zhongrui Wang,Huaqiang Wu,Geoffrey W. Burr,Cheol Seong Hwang,Kang L. Wang,Qiangfei Xia,Jianhua Yang +6 more
TL;DR: This Review surveys the four physical mechanisms that lead to resistive switching materials enable novel, in-memory information processing, which may resolve the von Neumann bottleneck and examines the device requirements for systems based on RSMs.
948
Parallel convolution processing using an integrated photonic tensor core
Johannes Feldmann,Nathan Youngblood,Maxim Karpov,Helge Gehring,Xuan Li,Maik Stappers,Manuel Le Gallo,Xin Fu,Anton Lukashchuk,Arslan S. Raja,Junqiu Liu,David Wright,Abu Sebastian,Tobias J. Kippenberg,Wolfram H. P. Pernice,Harish Bhaskaran +15 more
TL;DR: The results indicate the potential of integrated photonics for parallel, fast, and efficient computational hardware in data-heavy AI applications such as autonomous driving, live video processing, and next-generation cloud computing services.
819
References
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome
Bo Li,Colin N. Dewey +1 more
TL;DR: It is shown that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads, and estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired- end reads, depending on the number of possible splice forms for each gene.
•Book
Iterative Methods for Sparse Linear Systems
Yousef Saad
- 01 Apr 2003
TL;DR: This chapter discusses methods related to the normal equations of linear algebra, and some of the techniques used in this chapter were derived from previous chapters of this book.
The missing memristor found
TL;DR: It is shown, using a simple analytical example, that memristance arises naturally in nanoscale systems in which solid-state electronic and ionic transport are coupled under an external bias voltage.
•Book
Accuracy and stability of numerical algorithms
Nicholas J. Higham
- 01 Jan 1991
TL;DR: This book gives a thorough, up-to-date treatment of the behavior of numerical algorithms in finite precision arithmetic by combining algorithmic derivations, perturbation theory, and rounding error analysis.
Role of autophagy in cancer
TL;DR: Evidence suggests that autophagy provides a protective function to limit tumour necrosis and inflammation, and to mitigate genome damage in tumour cells in response to metabolic stress.
Related Papers (5)
Evangelos S. Eleftheriou,M. Le Gallo,S. R. Nandakumar,Christophe Piveteau,Irem Boybat,Vinay Joshi,Riduan Khaddam-Aljameh,Martino Dazzi,Iason Giannopoulos,Geethan Karunaratne,Benedikt Kersting,Milos Stanisavljevic,V. P. Jonnalagadda,Nikolas Ioannou,Kornilios Kourtis,P. A. Francese,Abu Sebastian +16 more