Open AccessPosted Content
The Kanerva Machine: A Generative Distributed Memory
TL;DR: In this paper, an end-to-end trained memory system that quickly adapts to new data and generates samples like them inspired by Kanerva's sparse distributed memory, it has a robust distributed reading and writing mechanism and is analytically tractable.
read more
Abstract: We present an end-to-end trained memory system that quickly adapts to new data and generates samples like them Inspired by Kanerva's sparse distributed memory, it has a robust distributed reading and writing mechanism The memory is analytically tractable, which enables optimal on-line compression via a Bayesian update-rule We formulate it as a hierarchical conditional generative model, where memory provides a rich data-dependent prior distribution Consequently, the top-down memory and bottom-up perception are combined to produce the code representing an observation Empirically, we demonstrate that the adaptive memory significantly improves generative models trained on both the Omniglot and CIFAR datasets Compared with the Differentiable Neural Computer (DNC) and its variants, our memory model has greater capacity and is significantly easier to train
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Recurrent World Models Facilitate Policy Evolution
David Ha,Jürgen Schmidhuber +1 more
TL;DR: In this article, a generative recurrent neural network is trained in an unsupervised manner to model popular reinforcement learning environments through compressed spatio-temporal representations, which achieves state-of-the-art results in various environments.
599
Backpropagation through time and the brain.
TL;DR: Recent machine learning methods serve to strengthen BPTT's position as a useful normative guide for thinking about temporal credit assignment in artificial and biological systems alike by employing novel memory-based and attention-based architectures and algorithms.
151
An Integrated World Modeling Theory (IWMT) of Consciousness: Combining Integrated Information and Global Neuronal Workspace Theories With the Free Energy Principle and Active Inference Framework; Toward Solving the Hard Problem and Characterizing Agentic Causation.
Adam Safron
- 09 Jun 2020
TL;DR: This work describes how streams of consciousness may emerge as an evolving generation of sensorimotor predictions, with the precise composition of experiences depending on the integration abilities of synchronous complexes as self-organizing harmonic modes (SOHMs).
Robust High-dimensional Memory-augmented Neural Networks
Geethan Karunaratne,Geethan Karunaratne,Manuel Schmuck,Manuel Schmuck,Manuel Le Gallo,Giovanni Cherubini,Luca Benini,Abu Sebastian,Abbas Rahimi +8 more
TL;DR: This work proposes a robust architecture that employs a computational memory unit as the explicit memory performing analog in-memory computation on high-dimensional (HD) vectors, while closely matching 32-bit software-equivalent accuracy.
Revisiting HyperDimensional Learning for FPGA and Low-Power Architectures
Mohsen Imani,Zhuowen Zou,Samuel Bosch,Sanjay Anantha Rao,Sahand Salamat,Venkatesh Kumar,Yeseong Kim,Tajana Rosing +7 more
- 01 Feb 2021
TL;DR: In this paper, the authors proposed a novel architecture, LookHD, which enables real-time hyperdimensional computing (HDC) learning on low-power edge devices by exploiting computation reuse to memorize the encoding module and simplify its computation with single memory access.
85
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Neural networks and physical systems with emergent collective computational abilities
TL;DR: A model of a system having a large number of simple equivalent components, based on aspects of neurobiology but readily adapted to integrated circuits, produces a content-addressable memory which correctly yields an entire memory from any subpart of sufficient size.
19K
A learning algorithm for boltzmann machines
TL;DR: A general parallel search method is described, based on statistical mechanics, and it is shown how it leads to a general learning rule for modifying the connection strengths so as to incorporate knowledge about a task domain in an efficient way.
4.1K
Human-level concept learning through probabilistic program induction.
TL;DR: A computational model is described that learns in a similar fashion and does so better than current deep learning algorithms and can generate new letters of the alphabet that look “right” as judged by Turing-like tests of the model's output in comparison to what real humans produce.
Related Papers (5)
Alex Graves,Greg Wayne,Malcolm Reynolds,Tim Harley,Ivo Danihelka,Agnieszka Grabska-Barwinska,Sergio Gomez Colmenarejo,Edward Grefenstette,Tiago Ramalho,John P. Agapiou,Adrià Puigdomènech Badia,Karl Moritz Hermann,Yori Zwols,Georg Ostrovski,Adam Cain,Helen King,Christopher Summerfield,Phil Blunsom,Koray Kavukcuoglu,Demis Hassabis +19 more
Qi Cai,Yingwei Pan,Ting Yao,Chenggang Yan,Tao Mei +4 more
- 18 Jun 2018