Journal Article10.1109/TPAMI.2016.2574707
HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition
641
TL;DR: The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood and it is demonstrated that this concept can robustly be used at all stages of an event-based hierarchical model.
read more
Abstract: This paper describes novel event-based spatio-temporal features called time-surfaces and how they can be used to create a hierarchical event-based pattern recognition architecture. Unlike existing hierarchical architectures for pattern recognition, the presented model relies on a time oriented approach to extract spatio-temporal features from the asynchronously acquired dynamics of a visual scene. These dynamics are acquired using biologically inspired frameless asynchronous event-driven vision sensors. Similarly to cortical structures, subsequent layers in our hierarchy extract increasingly abstract features using increasingly large spatio-temporal windows. The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood. We demonstrate that this concept can robustly be used at all stages of an event-based hierarchical model. First layer feature units operate on groups of pixels, while subsequent layer feature units operate on the output of lower level feature units. We report results on a previously published 36 class character recognition task and a four class canonical dynamic card pip task, achieving near 100 percent accuracy on each. We introduce a new seven class moving face recognition task, achieving 79 percent accuracy.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Recent Advances in Bio-Inspired Vision Sensor: A Review
Xiaoyu Zhong,Zi‐Shan Yu,Xiaofeng Gu +2 more
TL;DR: This paper provides a comprehensive overview of the emerging field of event-based vision, focusing on the operation principle, sampling mechanisms, and algorithms that take advantage of their superior features.
1
Fourier‐Based Action Recognition for Wildlife Behavior Quantification with Event Cameras
Friedhelm Hamann,Suman Ghosh,Ignacio Juárez Martínez,Tom Hart,Alex Kacelnik,Guillermo Gallego +5 more
TL;DR: This study proposes Fourier-based action recognition approaches for wildlife behavior quantification using event cameras, achieving effective results with significantly fewer parameters than deep neural networks, particularly in recognizing oscillating motion patterns in penguin behavior.
Enhancing Event-based Structured Light Imaging with a Single Frame
Huijiao Wang,Tangbo Liu,Chu He,Cheng Li,Jianzhuang Liu,Lei Yu +5 more
- 20 Sep 2022
TL;DR: A Multi-Modal Feature Fusion Network (MFFN) consisting of a feature fusion module and an upscale module to simultaneously fuse events and a single intensity frame, suppress event perturbations, and reconstruct a high-quality depth surface is proposed.
1
Semantic Segmentation on Neuromorphic Vision Sensor Event-Streams Using PointNet++ and UNet Based Processing Approaches
Tobias Bolten,Regina Pohle-Fröhlich,Klaus D. Tönnies +2 more
- 01 Jan 2023
TL;DR: PointNet++ based processing has been found advantageous over a UNet approach on lower resolution recordings with a comparatively lower event count and for recordings with ego-motion of the sensor and a resulting higher event count, UNet-based processing is advantageous.
1
Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition
Chenming Hu,Zheng Fang,Kuanxu Hou,Delei Kong,Jian Jiang,Hao Zhuang,Mingyuan Sun,Xinjie Huang +7 more
- 16 Feb 2024
TL;DR: Spike-EVPR is a deep spiking residual network designed for event-based visual place recognition tasks. It utilizes two novel event representations, a bifurcated spike residual encoder, a shared & specific descriptor extractor, and a cross-descriptor aggregation module to achieve superior performance compared to existing EVPR pipelines.
References
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner +6 more
- 01 Jan 1998
TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
53.5K
Gradient-based learning applied to document recognition
Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner,Patrick Haffner +7 more
- 01 Jan 2001
TL;DR: This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques.
32.7K
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
TL;DR: It is shown that a learning algorithm that attempts to find sparse linear codes for natural scenes will develop a complete family of localized, oriented, bandpass receptive fields, similar to those found in the primary visual cortex.
•Proceedings Article
Large Scale Distributed Deep Networks
Jeffrey Dean,Greg S. Corrado,Rajat Monga,Kai Chen,Matthieu Devin,Mark Z. Mao,Marc'Aurelio Ranzato,Andrew W. Senior,Paul A. Tucker,Ke Yang,Quoc V. Le,Andrew Y. Ng +11 more
- 03 Dec 2012
TL;DR: This paper considers the problem of training a deep network with billions of parameters using tens of thousands of CPU cores and develops two algorithms for large-scale distributed training, Downpour SGD and Sandblaster L-BFGS, which increase the scale and speed of deep network training.