Creating consistent scene graphs using a probabilistic grammar

doi:10.1145/2661229.2661243

Journal Article10.1145/2661229.2661243

Creating consistent scene graphs using a probabilistic grammar

Tianqiang Liu, +5 more

- 19 Nov 2014

- Vol. 33, Iss: 6, pp 211

93

TL;DR: The proposed algorithms can be used to provide consistent hierarchies for large collections of scenes within the same semantic class, and outperform alternative approaches that consider only shape similarities and/or spatial relationships without hierarchy.

Abstract: Growing numbers of 3D scenes in online repositories provide new opportunities for data-driven scene understanding, editing, and synthesis. Despite the plethora of data now available online, most of it cannot be effectively used for data-driven applications because it lacks consistent segmentations, category labels, and/or functional groupings required for co-analysis. In this paper, we develop algorithms that infer such information via parsing with a probabilistic grammar learned from examples. First, given a collection of scene graphs with consistent hierarchies and labels, we train a probabilistic hierarchical grammar to represent the distributions of shapes, cardinalities, and spatial relationships of semantic objects within the collection. Then, we use the learned grammar to parse new scenes to assign them segmentations, labels, and hierarchies consistent with the collection. During experiments with these algorithms, we find that: they work effectively for scene graphs for indoor scenes commonly found online (bedrooms, classrooms, and libraries); they outperform alternative approaches that consider only shape similarities and/or spatial relationships without hierarchy; they require relatively small sets of training data; they are robust to moderate over-segmentation in the inputs; and, they can robustly transfer labels from one data set to another. As a result, the proposed algorithms can be used to provide consistent hierarchies for large collections of scenes within the same semantic class.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

ShapeNet: An Information-Rich 3D Model Repository

Angel X. Chang, +12 more

- 09 Dec 2015

- arXiv: Graphics

TL;DR: ShapeNet contains 3D models from a multitude of semantic categories and organizes them under the WordNet taxonomy, a collection of datasets providing many semantic annotations for each 3D model such as consistent rigid alignments, parts and bilateral symmetry planes, physical sizes, keywords, as well as other planned annotations.

...read moreread less

4.8K

•Proceedings Article

Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation

Hao-Shu Fang, +4 more

- 27 Apr 2018

TL;DR: This paper proposes a pose grammar to tackle the problem of 3D human pose estimation, which takes 2D pose as input and learns a generalized 2D-3D mapping function and enforces high-level constraints over human poses.

...read moreread less

453

Proceedings Article•10.1109/CVPR.2018.00449

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification

Wenguan Wang, +3 more

- 18 Jun 2018

TL;DR: A knowledge-guided fashion network to solve the problem of visual fashion analysis, e.g., fashion landmark localization and clothing category classification is proposed and Bidirectional Convolutional Recurrent Neural Networks (BCRNNs) are introduced for efficiently approaching message passing over grammar topologies, and producing regularized landmark layouts.

...read moreread less

303

•Journal Article•10.1145/3355089.3356527

StructureNet: hierarchical graph networks for 3D shape generation

Kaichun Mo, +6 more

- 08 Nov 2019

- ACM Transactions on Graphics

TL;DR: In this paper, a hierarchical graph network is proposed to encode shapes represented as n-ary graphs, which can be robustly trained on large and complex shape families, and can be used to generate a great diversity of realistic structured shape geometries.

...read moreread less

250

•Journal Article•10.1145/3306346.3322941

PlanIT: planning and instantiating indoor scenes with relation graph and spatial prior networks

Kai Wang, +5 more

- 12 Jul 2019

- ACM Transactions on Graphics

TL;DR: A new framework for interior scene synthesis that combines a high-level relation graph representation with spatial prior neural networks, and generates scenes of comparable quality to those generated by prior approaches, while also providing the modeling flexibility of the intermediate relationship graph representation.

...read moreread less

237

...

Expand

References

Journal Article•10.1198/TECH.2007.S518

Pattern Recognition and Machine Learning

Radford M. Neal

- 01 Aug 2007

- Technometrics

TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.

...read moreread less

30.8K

•Book

Pattern Recognition and Machine Learning

Christopher M. Bishop

- 17 Aug 2006

TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.

...read moreread less

23.4K

•Journal Article•10.1214/AOMS/1177704472

On Estimation of a Probability Density Function and Mode

Emanuel Parzen

- 01 Sep 1962

- Annals of Mathematical Statistics

TL;DR: In this paper, the problem of the estimation of a probability density function and of determining the mode of the probability function is discussed. Only estimates which are consistent and asymptotically normal are constructed.

...read moreread less

11.4K

•Book

Pattern Recognition and Machine Learning (Information Science and Statistics)

Christopher M. Bishop

- 01 Aug 2006

TL;DR: Looking for competent reading resources?

...read moreread less

10.1K

Journal Article•10.1145/357980.358005

An efficient context-free parsing algorithm

Jay Earley

- 01 Feb 1970

- Communications of The ACM

TL;DR: In this article, a parsing algorithm which seems to be the most efficient general context-free algorithm known is described, which is similar to both Knuth's LR(k) algorithm and the familiar top-down algorithm.

...read moreread less

1.7K