Proceedings Article10.1109/MFI55806.2022.9913845
Enhancing Event-based Structured Light Imaging with a Single Frame
Huijiao Wang,Tangbo Liu,Chu He,Cheng Li,Jianzhuang Liu,Lei Yu +5 more
- 20 Sep 2022
pp 1-7
1
TL;DR: A Multi-Modal Feature Fusion Network (MFFN) consisting of a feature fusion module and an upscale module to simultaneously fuse events and a single intensity frame, suppress event perturbations, and reconstruct a high-quality depth surface is proposed.
read more
Abstract: Benefiting from the extremely low latency, events have been used for Structured Light Imaging (SLI) to predict the depth surface. However, existing methods only focus on improving scanning speeds but neglect perturbations from event noise and timestamp jittering for depth estimation. In this paper, we build a hybrid SLI system equipped with an event camera, a high-resolution frame camera, and a digital light projector, where a single intensity frame is adopted as a guidance to enhance the event-based SLI quality. To achieve this end, we propose a Multi-Modal Feature Fusion Network (MFFN) consisting of a feature fusion module and an upscale module to simultaneously fuse events and a single intensity frame, suppress event perturbations, and reconstruct a high-quality depth surface. Further, for training MFFN, we build a new Structured Light Imaging based on Event and Frame cameras (EF-SLI) dataset collected from the hybrid SLI system, containing paired inputs composed of a set of synchronized events and one single corresponding frame, and ground-truth references obtained by a high-quality SLI approach. Experiments demonstrate that our proposed MFFN outperforms state-of-the-art event-based SLI approaches in terms of accuracy at different scanning speeds.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
SGE: Structured Light System Based on Gray Code with an Event Camera
Xingyu Lu,Lei Sun,Diyang Gu,Zhijie Xu,Kaiwei Wang +4 more
TL;DR: High-speed structured light system based on Gray code and event camera achieves accurate depth estimation with minimal data redundancy and high acquisition speed.
References
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi,Jose Caballero,Ferenc Huszar,Johannes Totz,Andrew Peter Aitken,Rob Bishop,Daniel Rueckert,Zehan Wang +7 more
- 27 Jun 2016
TL;DR: This paper presents the first convolutional neural network capable of real-time SR of 1080p videos on a single K2 GPU and introduces an efficient sub-pixel convolution layer which learns an array of upscaling filters to upscale the final LR feature maps into the HR output.
Overview of three-dimensional shape measurement using optical methods
TL;DR: An overview of 3-D shape measurement using various optical methods, and a focus on structured light tech- niques where various optical configurations, image acquisition technology, data postprocessing and analysis methods and advantages and limitations are presented.
1.6K
HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition
TL;DR: The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood and it is demonstrated that this concept can robustly be used at all stages of an event-based hierarchical model.
653
What Makes Training Multi-Modal Classification Networks Hard?
Weiyao Wang,Du Tran,Matt Feiszli +2 more
- 14 Jun 2020
TL;DR: This paper identifies two main causes for this performance drop: first, multi-modal networks are often prone to overfitting due to increased capacity and second, different modalities overfit and generalize at different rates, so training them jointly with a single optimization strategy is sub-optimal.
•Posted Content
Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks
TL;DR: In this article, the authors proposed the deep Laplacian pyramid super-resolution network (LAPS-Net), which progressively reconstructs the sub-band residuals of high-resolution images at multiple pyramid levels.