Sensor-level computer vision with pixel processor arrays for agile robots
Piotr Dudek,Thomas Richardson,Laurie Bose,Stephen J. Carey,Jianing Chen,Colin Greatwood,Yanan Liu,Walterio W. Mayol-Cuevas +7 more
TL;DR: The history of image sensing and processing hardware from the perspective of in-pixel computing is reviewed and the key features of a state-of-the-art smart camera system based on a PPA device are outlined, through the description of the SCAMP-5 system.
read more
Abstract: Vision processing for control of agile autonomous robots requires low-latency computation, within a limited power and space budget. This is challenging for conventional computing hardware. Parallel processor arrays (PPAs) are a new class of vision sensor devices that exploit advances in semiconductor technology, embedding a processor within each pixel of the image sensor array. Sensed pixel data are processed on the focal plane, and only a small amount of relevant information is transmitted out of the vision sensor. This tight integration of sensing, processing, and memory within a massively parallel computing architecture leads to an interesting trade-off between high performance, low latency, low power, low cost, and versatility in a machine vision system. Here, we review the history of image sensing and processing hardware from the perspective of in-pixel computing and outline the key features of a state-of-the-art smart camera system based on a PPA device, through the description of the SCAMP-5 system. We describe several robotic applications for agile ground and aerial vehicles, demonstrating PPA sensing functionalities including high-speed odometry, target tracking, obstacle detection, and avoidance. In the conclusions, we provide some insight and perspective on the future development of PPA devices, including their application and benefits within agile, robust, adaptable, and lightweight robotics.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A two-dimensional mid-infrared optoelectronic retina enabling simultaneous perception and encoding
Fakun Wang,Fangchen Hu,Mingjin Dai,Song Zhu,Fang-Fang Sun,Ruihuan Duan,Chong Wang,Jiayue Han,Wenjie Deng,Wenduo Chen,Ming Ye,Song Han,Bo Qiang,Yuhao Jin,Yun Da Chua,Nan Chi,Shaohua Yu,Donguk Nam,Sang Hoon Chae,Zheng Liu,Qi Jie Wang +20 more
TL;DR: In this paper , a two-dimensional (2D) heterostructure for simultaneous data perception and encoding is presented, where a single device can perceive the illumination intensity of a MIR stimulus signal, while encoding the intensity into a spike train based on a rate encoding algorithm for subsequent neuromorphic computing.
In-sensor dynamic computing for intelligent machine vision
Yuekun Yang,Chen Pan,Yixiang Li,Xingjian Yangdong,Pengfei Wang,Zhu-An Li,Shuang Wang,Wentao Yu,Guanyu Liu,Bin Cheng,Zengfeng Di,Shi-Jun Liang,Feng Miao +12 more
34
Accuracy and safety of a haptic operated and machine vision controlled collaborative robot for dental implant placement: A translational study.
TL;DR: In this article , the authors reported the accuracy of robot-assisted dental implant placement in a preclinical model and further applied the robotic system in a clinical case series, where the use of a lock-on structure at robot arm-handpiece was tested in resin arch models.
31
AI-Based Computer Vision Techniques and Expert Systems
TL;DR: In this paper , the authors discuss some knowledge-based computer vision techniques that employ deep learning and discuss how these techniques can acquire the tacit knowledge of experts, which was not previously achievable with conventional expert systems.
Intelligent Recognition Using Ultralight Multifunctional Nano-Layered Carbon Aerogel Sensors with Human-Like Tactile Perception
Huiqi Zhao,Yizheng Zhang,Lei Han,Weiqi Qian,Jiabin Wang,Heting Wu,Jingchen Li,Yuan Dai,Zhengyou Zhang,Chris R Bowen,Ya Yang +10 more
TL;DR: A new form of ultralight multifunctional tactile nano-layered carbon aerogel sensor that provides pressure, temperature, material recognition and 3D location capabilities, which is combined with multimodal supervised learning algorithms for object recognition.
25
References
•Proceedings Article
Visual odometry
David Nister,Oleg Naroditsky,James R. Bergen +2 more
- 01 Jan 2004
TL;DR: A system that estimates the motion of a stereo head or a single moving camera based on video input in real-time with low delay and the motion estimates are used for navigational purposes.
1.9K
FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance
Mark Cummins,Paul Newman +1 more
TL;DR: A probabilistic approach to the problem of recognizing places based on their appearance that can determine that a new observation comes from a previously unseen place, and so augment its map, and is particularly suitable for online loop closure detection in mobile robotics.
•Book
Analog VLSI implementation of neural systems
Carver A. Mead,Mohammed Ismail +1 more
- 13 Oct 2011
TL;DR: A Neural Processor for Maze Solving and Issues in Analog VLSI and MOS Techniques for Neural Computing are discussed.
1.6K
Eye smarter than scientists believed: Neural computations in circuits of the retina
Tim Gollisch,Markus Meister +1 more
TL;DR: In this review, recent progress in understanding the computations performed in the vertebrate retina and how they are implemented by the neural circuitry are summarized.
819
The ILLIAC IV Computer
TL;DR: The structure of ILLIAC IV, a parallel-array computer containing 256 processing elements, is described, special features include multiarray processing, multiprecision arithmetic, and fast data-routing interconnections.
614