Journal Article10.1109/TPAMI.2016.2574707
HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition
641
TL;DR: The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood and it is demonstrated that this concept can robustly be used at all stages of an event-based hierarchical model.
read more
Abstract: This paper describes novel event-based spatio-temporal features called time-surfaces and how they can be used to create a hierarchical event-based pattern recognition architecture. Unlike existing hierarchical architectures for pattern recognition, the presented model relies on a time oriented approach to extract spatio-temporal features from the asynchronously acquired dynamics of a visual scene. These dynamics are acquired using biologically inspired frameless asynchronous event-driven vision sensors. Similarly to cortical structures, subsequent layers in our hierarchy extract increasingly abstract features using increasingly large spatio-temporal windows. The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood. We demonstrate that this concept can robustly be used at all stages of an event-based hierarchical model. First layer feature units operate on groups of pixels, while subsequent layer feature units operate on the output of lower level feature units. We report results on a previously published 36 class character recognition task and a four class canonical dynamic card pip task, achieving near 100 percent accuracy on each. We introduce a new seven class moving face recognition task, achieving 79 percent accuracy.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Is Neuromorphic MNIST neuromorphic? Analyzing the discriminative power of neuromorphic datasets in the time domain
TL;DR: This study assesses if neuromorphic datasets recorded from static images are able to evaluate the ability of SNNs to use spike timings in their calculations, and compares N-MNIST and DvsGesture on two STDP algorithms that can classify only spatial data, and STDP-tempotron that classifies spatiotemporal data.
End-to-end Learning of Object Motion Estimation from Retinal Events for Event-based Object Tracking
TL;DR: A novel deep neural network is proposed to learn and regress a parametric object-level motion/transform model for event-based object tracking, which effectively encodes the spatio-temporal information of asynchronous retinal events into TSLTD frames with clear motion patterns.
Secrets of Event-Based Optical Flow
Shintaro Shiba,Yoshimitsu Aoki,Guillermo Ayala Gallego +2 more
- 20 Jul 2022
TL;DR: A principled method to extend the Contrast Maximization framework to estimate optical estimation from events alone and ranks first among unsupervised methods on the MVSEC benchmark, and is competitive on the DSEC benchmark.
50
Event-based visual place recognition with ensembles of temporal windows
Tobias Fischer,Michael Milford +1 more
TL;DR: In this article, an ensemble-based scheme for combining temporal windows of varying lengths that are processed in parallel is proposed, which achieves significant computational efficiencies without unduly compromising the original performance gains provided by the ensemble approach.
50
•Posted Content
Matrix-LSTM: a Differentiable Recurrent Surface for Asynchronous Event-Based Data.
Marco Cannici,Marco Ciccone,Andrea Romanoni,Matteo Matteucci +3 more
- 10 Jan 2020
TL;DR: In this paper, a grid of Long Short-Term Memory (LSTM) cells is proposed to learn end-to-end task-dependent event-surfaces, which shows good flexibility and expressiveness on optical flow estimation.
References
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner +6 more
- 01 Jan 1998
TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
53.5K
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner,Patrick Haffner +7 more
- 01 Jan 2001
TL;DR: This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques.
32.7K
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
TL;DR: It is shown that a learning algorithm that attempts to find sparse linear codes for natural scenes will develop a complete family of localized, oriented, bandpass receptive fields, similar to those found in the primary visual cortex.
•Proceedings Article
Large Scale Distributed Deep Networks
Jeffrey Dean,Greg S. Corrado,Rajat Monga,Kai Chen,Matthieu Devin,Mark Z. Mao,Marc'Aurelio Ranzato,Andrew W. Senior,Paul A. Tucker,Ke Yang,Quoc V. Le,Andrew Y. Ng +11 more
- 03 Dec 2012
TL;DR: This paper considers the problem of training a deep network with billions of parameters using tens of thousands of CPU cores and develops two algorithms for large-scale distributed training, Downpour SGD and Sandblaster L-BFGS, which increase the scale and speed of deep network training.