Journal Article10.1109/TPAMI.2016.2574707
HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition
641
TL;DR: The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood and it is demonstrated that this concept can robustly be used at all stages of an event-based hierarchical model.
read more
Abstract: This paper describes novel event-based spatio-temporal features called time-surfaces and how they can be used to create a hierarchical event-based pattern recognition architecture. Unlike existing hierarchical architectures for pattern recognition, the presented model relies on a time oriented approach to extract spatio-temporal features from the asynchronously acquired dynamics of a visual scene. These dynamics are acquired using biologically inspired frameless asynchronous event-driven vision sensors. Similarly to cortical structures, subsequent layers in our hierarchy extract increasingly abstract features using increasingly large spatio-temporal windows. The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood. We demonstrate that this concept can robustly be used at all stages of an event-based hierarchical model. First layer feature units operate on groups of pixels, while subsequent layer feature units operate on the output of lower level feature units. We report results on a previously published 36 class character recognition task and a four class canonical dynamic card pip task, achieving near 100 percent accuracy on each. We introduce a new seven class moving face recognition task, achieving 79 percent accuracy.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Scene Adaptive Sparse Transformer for Event-based Object Detection
Yansong Peng,Hebei Li,Yueyi Zhang,Xiaoyan Sun,Feng Wu +4 more
TL;DR: Scene Adaptive Sparse Transformer (SAST) achieves high performance and efficiency in event-based object detection by introducing window-token co-sparsification, scene-aware sparsity optimization, and dynamic sparsity level optimization based on scene complexity.
Event camera object recognition using spatiotemporal event time surface and reward-modulated spike-timing-dependent plasticity learning rule
Qian Zhou,Guimeng Zhang +1 more
TL;DR: Event camera object recognition using spatiotemporal event time surface and reward-modulated spike-timing-dependent plasticity learning rule. The system includes a denoising module and an object recognition module. The denoising module removes noise events using spatiotemporal information of the events. The object recognition module extracts primary spatiotemporal features and makes object recognition using spiking neural networks with reward-modulated spike-timing-dependent plasticity learning rule.
Label-Free Event-based Object Recognition via Joint Learning with Image Reconstruction from Events
Hoonhee Cho,Hyeon-Seong Kim,Yujeong Chae,Kuk-Jin Yoon +3 more
TL;DR: This paper studies label-free event-based object recognition where category labels and paired images are not available, and proposes category-guided attraction loss and category-agnostic repulsion loss to bridge the textual features of predicted categories and the visual features of reconstructed images using CLIP.
Brain Inspired Computing: A Systematic Survey and Future Trends
19 Jan 2023
TL;DR: In this paper , the authors present a comprehensive survey of Brain Inspired Computing (BIC) and summarize four components of BIC infrastructure development: 1) modeling/algorithm; 2) hardware platform; 3) software tool; 4) benchmark data.
PL-EVIO: Robust Monocular Event-Based Visual Inertial Odometry With Point and Line Features
Weipeng Guan,Peiyu Chen,Y. H. Xie,Peng Lu +3 more
TL;DR: Robust and real-time event-based visual-inertial odometry (VIO) incorporating point and line features for aggressive motion and large-scale outdoor experiments.
References
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner +6 more
- 01 Jan 1998
TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
53.5K
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner,Patrick Haffner +7 more
- 01 Jan 2001
TL;DR: This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques.
32.7K
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
TL;DR: It is shown that a learning algorithm that attempts to find sparse linear codes for natural scenes will develop a complete family of localized, oriented, bandpass receptive fields, similar to those found in the primary visual cortex.
•Proceedings Article
Large Scale Distributed Deep Networks
Jeffrey Dean,Greg S. Corrado,Rajat Monga,Kai Chen,Matthieu Devin,Mark Z. Mao,Marc'Aurelio Ranzato,Andrew W. Senior,Paul A. Tucker,Ke Yang,Quoc V. Le,Andrew Y. Ng +11 more
- 03 Dec 2012
TL;DR: This paper considers the problem of training a deep network with billions of parameters using tens of thousands of CPU cores and develops two algorithms for large-scale distributed training, Downpour SGD and Sandblaster L-BFGS, which increase the scale and speed of deep network training.