Robust AUV Visual Loop-Closure Detection Based on Variational Autoencoder Network
TL;DR: In this article , an underwater loop-closure detection method based on a variational autoencoder network is proposed, which can learn effective image representations to deal with the challenges caused by dynamic underwater environments.
read more
Abstract: The visual loop-closure detection for autonomous underwater vehicles (AUVs) is a key component to reduce the drift error accumulated in simultaneous localization and mapping tasks. However, due to viewpoint changes, textureless images, and fast-moving objects, the loop closure detection in dramatically changing underwater environments remains a challenging problem to traditional geometric methods. Inspired by strong feature learning ability of deep neural networks, we propose an underwater loop-closure detection method based on a variational autoencoder network in this article. Our proposed method can learn effective image representations to deal with the challenges caused by dynamic underwater environments. Specifically, the proposed network is an unsupervised method, which avoids the difficulty and cost of labeling a great quantity of underwater data. Also included is a semantic object segmentation module, which is utilized to segment the underwater environments and assign weights to objects in order to alleviate the impact of fast-moving objects. Furthermore, an underwater image description scheme is used to enable efficient access to geometric and object-level semantic information, which helps to build a robust and real-time system in dramatically changing underwater scenarios. Finally, we test the proposed system under complex underwater environments and get a recall rate of 92.31% in the tested environments.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
DeepThink IoT: The Strength of Deep Learning in Internet of Things
TL;DR: This survey paper provides a comprehensive overview of the impact of DL on IoT, including an analysis of sensor data to detect patterns and make predictions, and the implications for various industries such as healthcare, manufacturing, agriculture, and smart cities.
30
A Cooperative Evolutionary Computation Algorithm for Dynamic Multiobjective Multi-AUV Path Planning
TL;DR: In this article , a cooperative evolutionary computation algorithm was proposed to provide diverse and high-quality solutions for decision makers, where solutions are represented using a bi-layer encode scheme, in which the first layer indicates the surface location points and the second layer represents the traveling sequences of target missions.
19
UIESC: An Underwater Image Enhancement Framework via Self-attention and Contrastive Learning
TL;DR: In this article , the authors proposed an underwater image enhancement framework via self-attention and contrastive learning (UIESC) to solve low contrast, color distortion and blurred details.
12
Explicit Points-of-Interest Driven Siamese Transformer for 3D LiDAR Place Recognition in Outdoor Challenging Environments
TL;DR: Li et al. as discussed by the authors proposed an explicit points-of-interest driven PR method, which consists of a road segmentation module based on grid-wise patch U-Transformer (GP_UT) and a PR module based upon regions of interest Siamese Transformer NetVLAD (RI_STV).
11
Robust Real-Time AUV Self-Localization Based on Stereo Vision-Inertia
TL;DR: In this paper , the authors proposed a robust real-time AUV self-localization method based on stereo camera and inertial sensor, which merges point and diagonal features, as well as inertial measurements to overcome the challenges of poor visibility.
References
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.
ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras
Raul Mur-Artal,Juan D. Tardós +1 more
TL;DR: ORB-SLAM2, a complete simultaneous localization and mapping (SLAM) system for monocular, stereo and RGB-D cameras, including map reuse, loop closing, and relocalization capabilities, is presented, being in most cases the most accurate SLAM solution.
5.4K
NetVLAD: CNN Architecture for Weakly Supervised Place Recognition
TL;DR: A convolutional neural network architecture that is trainable in an end-to-end manner directly for the place recognition task, and significantly outperforms non-learnt image representations and off-the-shelf CNN descriptors on two challenging place recognition benchmarks.
1.6K
COCO-Stuff: Thing and Stuff Classes in Context
Holger Caesar,Jasper Uijlings,Vittorio Ferrari +2 more
- 17 Dec 2018
TL;DR: COCO-Stuff as mentioned in this paper augments all 164k images of the COCO 2017 dataset with pixel-wise annotations for 91 stuff classes, which leverages the original thing annotations.
Learning 6D Object Pose Estimation Using 3D Object Coordinates
Eric Brachmann,Alexander Krull,Frank Michel,Stefan Gumhold,Jamie Shotton,Carsten Rother +5 more
- 06 Sep 2014
TL;DR: This work addresses the problem of estimating the 6D Pose of specific objects from a single RGB-D image by presenting a learned, intermediate representation in form of a dense 3D object coordinate labelling paired with a dense class labelling.
Related Papers (5)
Pedro Aguiar,Jose M. F. Moura +1 more
- 26 Oct 1997
Ping An,Chaohui Lu,Zhaoyang Zhang +2 more
- 27 Jun 2004