Robust AUV Visual Loop-Closure Detection Based on Variational Autoencoder Network

doi:10.1109/tii.2022.3145860

Open AccessJournal Article10.1109/tii.2022.3145860

Robust AUV Visual Loop-Closure Detection Based on Variational Autoencoder Network

01 Dec 2022

- IEEE Transactions on Industrial Informat...

- Vol. 18, Iss: 12, pp 8829-8838

25

TL;DR: In this article , an underwater loop-closure detection method based on a variational autoencoder network is proposed, which can learn effective image representations to deal with the challenges caused by dynamic underwater environments.

Abstract: The visual loop-closure detection for autonomous underwater vehicles (AUVs) is a key component to reduce the drift error accumulated in simultaneous localization and mapping tasks. However, due to viewpoint changes, textureless images, and fast-moving objects, the loop closure detection in dramatically changing underwater environments remains a challenging problem to traditional geometric methods. Inspired by strong feature learning ability of deep neural networks, we propose an underwater loop-closure detection method based on a variational autoencoder network in this article. Our proposed method can learn effective image representations to deal with the challenges caused by dynamic underwater environments. Specifically, the proposed network is an unsupervised method, which avoids the difficulty and cost of labeling a great quantity of underwater data. Also included is a semantic object segmentation module, which is utilized to segment the underwater environments and assign weights to objects in order to alleviate the impact of fast-moving objects. Furthermore, an underwater image description scheme is used to enable efficient access to geometric and object-level semantic information, which helps to build a robust and real-time system in dramatically changing underwater scenarios. Finally, we test the proposed system under complex underwater environments and get a recall rate of 92.31% in the tested environments.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1007/s10462-023-10513-4

DeepThink IoT: The Strength of Deep Learning in Internet of Things

Divyansh Thakur, +1 more

- 04 Jun 2023

- Artificial Intelligence Review

TL;DR: This survey paper provides a comprehensive overview of the impact of DL on IoT, including an analysis of sensor data to detect patterns and make predictions, and the implications for various industries such as healthcare, manufacturing, agriculture, and smart cities.

...read moreread less

30

Journal Article•10.1109/tii.2023.3268760

A Cooperative Evolutionary Computation Algorithm for Dynamic Multiobjective Multi-AUV Path Planning

Xiaofeng Liu, +4 more

- 01 Jan 2023

- IEEE Transactions on Industrial Informat...

TL;DR: In this article , a cooperative evolutionary computation algorithm was proposed to provide diverse and high-quality solutions for decision makers, where solutions are represented using a bi-layer encode scheme, in which the first layer indicates the surface location points and the second layer represents the traveling sequences of target missions.

...read moreread less

19

Journal Article•10.1109/tii.2023.3249794

UIESC: An Underwater Image Enhancement Framework via Self-attention and Contrastive Learning

Renzhang Chen, +2 more

- 01 Jan 2023

- IEEE Transactions on Industrial Informat...

TL;DR: In this article , the authors proposed an underwater image enhancement framework via self-attention and contrastive learning (UIESC) to solve low contrast, color distortion and blurred details.

...read moreread less

12

Journal Article•10.1109/tii.2023.3240578

Explicit Points-of-Interest Driven Siamese Transformer for 3D LiDAR Place Recognition in Outdoor Challenging Environments

Dong Kong, +6 more

- 01 Jan 2023

- IEEE Transactions on Industrial Informat...

TL;DR: Li et al. as discussed by the authors proposed an explicit points-of-interest driven PR method, which consists of a road segmentation module based on grid-wise patch U-Transformer (GP_UT) and a PR module based upon regions of interest Siamese Transformer NetVLAD (RI_STV).

...read moreread less

11

•Journal Article•10.1109/tvt.2023.3241634

Robust Real-Time AUV Self-Localization Based on Stereo Vision-Inertia

01 Jun 2023

- IEEE Transactions on Vehicular Technolog...

TL;DR: In this paper , the authors proposed a robust real-time AUV self-localization method based on stereo camera and inertial sensor, which merges point and diagonal features, as well as inertial measurements to overcome the challenges of poor visibility.

...read moreread less

6

...

Expand

References

Journal Article•10.1145/358669.358692

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981

- Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

27.9K

•Journal Article•10.1109/TRO.2017.2705103

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

Raul Mur-Artal, +1 more

- 12 Jun 2017

- IEEE Transactions on Robotics

TL;DR: ORB-SLAM2, a complete simultaneous localization and mapping (SLAM) system for monocular, stereo and RGB-D cameras, including map reuse, loop closing, and relocalization capabilities, is presented, being in most cases the most accurate SLAM solution.

...read moreread less

5.4K

•Journal Article•10.1109/TPAMI.2017.2711011

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

Relja Arandjelovic, +4 more

- 01 Jun 2018

- IEEE Transactions on Pattern Analysis an...

TL;DR: A convolutional neural network architecture that is trainable in an end-to-end manner directly for the place recognition task, and significantly outperforms non-learnt image representations and off-the-shelf CNN descriptors on two challenging place recognition benchmarks.

...read moreread less

1.6K

•Proceedings Article•10.1109/CVPR.2018.00132

COCO-Stuff: Thing and Stuff Classes in Context

Holger Caesar, +2 more

- 17 Dec 2018

TL;DR: COCO-Stuff as mentioned in this paper augments all 164k images of the COCO 2017 dataset with pixel-wise annotations for 91 stuff classes, which leverages the original thing annotations.

...read moreread less

1.1K

•Book Chapter•10.1007/978-3-319-10605-2_35

Learning 6D Object Pose Estimation Using 3D Object Coordinates

Eric Brachmann, +5 more

- 06 Sep 2014

TL;DR: This work addresses the problem of estimating the 6D Pose of specific objects from a single RGB-D image by presenting a learned, intermediate representation in form of a dense 3D object coordinate labelling paired with a dense class labelling.

...read moreread less

993

...

Expand

Robust AUV Visual Loop-Closure Detection Based on Variational Autoencoder Network

Chat with Paper

AI Agents for this Paper

Citations

DeepThink IoT: The Strength of Deep Learning in Internet of Things

A Cooperative Evolutionary Computation Algorithm for Dynamic Multiobjective Multi-AUV Path Planning

UIESC: An Underwater Image Enhancement Framework via Self-attention and Contrastive Learning

Explicit Points-of-Interest Driven Siamese Transformer for 3D LiDAR Place Recognition in Outdoor Challenging Environments

Robust Real-Time AUV Self-Localization Based on Stereo Vision-Inertia

References

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

COCO-Stuff: Thing and Stuff Classes in Context

Learning 6D Object Pose Estimation Using 3D Object Coordinates

Related Papers (5)

Facial Expression Recognition Based on Convolutional Denoising Autoencoder and XGBoost

Detecting and solving template ambiguities in motion segmentation

Object segmentation using stereo images

Sparse autoencoder based feature learning for unmanned aerial vehicle landforms image classification

Real-time object segmentation for visual object detection in dynamic scenes