ALIGNet: Partial-Shape Agnostic Alignment via Unsupervised Learning

doi:10.1145/3267347

Journal Article10.1145/3267347

ALIGNet: Partial-Shape Agnostic Alignment via Unsupervised Learning

Rana Hanocka, +5 more

- 14 Dec 2018

- ACM Transactions on Graphics

- Vol. 38, Iss: 1, pp 1-14

159

TL;DR: This work presents an approach based on a deep neural network, leveraging shape datasets to learn a shape-aware prior for source-to-target alignment that is robust to shape incompleteness, and demonstrates that ALIGNet learns to align geometrically distinct shapes, and is able to infer plausible mappings even when the target shape is significantly incomplete.

Abstract: The process of aligning a pair of shapes is a fundamental operation in computer graphics. Traditional approaches rely heavily on matching corresponding points or features to guide the alignment, a paradigm that falters when significant shape portions are missing. These techniques generally do not incorporate prior knowledge about expected shape characteristics, which can help compensate for any misleading cues left by inaccuracies exhibited in the input shapes. We present an approach based on a deep neural network, leveraging shape datasets to learn a shape-aware prior for source-to-target alignment that is robust to shape incompleteness. In the absence of ground truth alignments for supervision, we train a network on the task of shape alignment using incomplete shapes generated from full shapes for self-supervision. Our network, called ALIGNet, is trained to warp complete source shapes to incomplete targets, as if the target shapes were complete, thus essentially rendering the alignment partial-shape agnostic. We aim for the network to develop specialized expertise over the common characteristics of the shapes in each dataset, thereby achieving a higher-level understanding of the expected shape space to which a local approach would be oblivious. We constrain ALIGNet through an anisotropic total variation identity regularization to promote piecewise smooth deformation fields, facilitating both partial-shape agnosticism and post-deformation applications. We demonstrate that ALIGNet learns to align geometrically distinct shapes and is able to infer plausible mappings even when the target shape is significantly incomplete. We show that our network learns the common expected characteristics of shape collections without over-fitting or memorization, enabling it to produce plausible deformations on unseen data during test time.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1145/3306346.3322959

MeshCNN: A Network with an Edge

Rana Hanocka, +5 more

- 16 Sep 2018

- arXiv: Learning

TL;DR: MeshCNN as discussed by the authors combines specialized convolution and pooling layers that operate on the mesh edges, by leveraging their intrinsic geodesic connections, and learns which edges to collapse, thus forming a task-driven process where the network exposes and expands the important features while discarding the redundant ones.

...read moreread less

578

•Journal Article•10.1186/S40537-019-0212-5

Intelligent video surveillance: a review through deep learning techniques for crowd analysis

G. Sreenu, +1 more

- 01 Dec 2019

- Journal of Big Data

TL;DR: The main focus of this survey is application of deep learning techniques in detecting the exact count, involved persons and the happened activity in a large crowd at all climate conditions.

...read moreread less

416

•Journal Article•10.1145/3306346.3322959

MeshCNN: a network with an edge

Rana Hanocka, +5 more

- 12 Jul 2019

- ACM Transactions on Graphics

TL;DR: This paper utilizes the unique properties of the mesh for a direct analysis of 3D shapes using MeshCNN, a convolutional neural network designed specifically for triangular meshes, and demonstrates the effectiveness of MeshCNN on various learning tasks applied to 3D meshes.

...read moreread less

414

•Journal Article•10.1145/3386569.3392415

Point2Mesh: a self-prior for deformable meshes

Rana Hanocka, +3 more

- 08 Jul 2020

- ACM Transactions on Graphics

TL;DR: This paper introduces Point2Mesh, a technique for reconstructing a surface mesh from an input point cloud that is robust to non-ideal conditions, and shows that shrink-wrapping a point cloud with a self-prior converges to a desirable solution.

...read moreread less

240

Journal Article•10.1016/J.DISPLA.2021.102053

Review of multi-view 3D object recognition methods based on deep learning

Shaohua Qi, +7 more

- 01 Sep 2021

- Displays

TL;DR: A comprehensive review and classification of the latest developments in the deep learning methods for multi-view 3D object recognition is presented, which summarizes the results of these methods on a few mainstream datasets, provides an insightful summary, and puts forward enlightening future research directions.

...read moreread less

197

...

Expand

References

Journal Article•10.1145/358669.358692

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981

- Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

27.9K

Journal Article•10.1109/34.121791

A method for registration of 3-D shapes

Paul J. Besl, +1 more

- 01 Feb 1992

- IEEE Transactions on Pattern Analysis an...

TL;DR: In this paper, the authors describe a general-purpose representation-independent method for the accurate and computationally efficient registration of 3D shapes including free-form curves and surfaces, based on the iterative closest point (ICP) algorithm, which requires only a procedure to find the closest point on a geometric entity to a given point.

...read moreread less

20.6K

•Book Chapter•10.1007/978-3-319-46475-6_43

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Justin Johnson, +2 more

- 08 Oct 2016

TL;DR: In this paper, the authors combine the benefits of both approaches, and propose the use of perceptual loss functions for training feed-forward networks for image style transfer, where a feedforward network is trained to solve the optimization problem proposed by Gatys et al. in real-time.

...read moreread less

10.1K

•Proceedings Article

Spatial transformer networks

Max Jaderberg, +3 more

- 07 Dec 2015

TL;DR: This work introduces a new learnable module, the Spatial Transformer, which explicitly allows the spatial manipulation of data within the network, and can be inserted into existing convolutional architectures, giving neural networks the ability to actively spatially transform feature maps.

...read moreread less

8.5K

Journal Article•10.1109/34.993558

Shape matching and object recognition using shape contexts

Serge Belongie, +2 more

- 01 Apr 2002

- IEEE Transactions on Pattern Analysis an...

TL;DR: This paper presents work on computing shape models that are computationally fast and invariant basic transformations like translation, scaling and rotation, and proposes shape detection using a feature called shape context, which is descriptive of the shape of the object.

...read moreread less

7.3K