Journal Article10.1145/2661229.2661243
Creating consistent scene graphs using a probabilistic grammar
Tianqiang Liu,Siddhartha Chaudhuri,Vladimir G. Kim,Qixing Huang,Niloy J. Mitra,Thomas Funkhouser +5 more
- 19 Nov 2014
- Vol. 33, Iss: 6, pp 211
TL;DR: The proposed algorithms can be used to provide consistent hierarchies for large collections of scenes within the same semantic class, and outperform alternative approaches that consider only shape similarities and/or spatial relationships without hierarchy.
read more
Abstract: Growing numbers of 3D scenes in online repositories provide new opportunities for data-driven scene understanding, editing, and synthesis. Despite the plethora of data now available online, most of it cannot be effectively used for data-driven applications because it lacks consistent segmentations, category labels, and/or functional groupings required for co-analysis. In this paper, we develop algorithms that infer such information via parsing with a probabilistic grammar learned from examples. First, given a collection of scene graphs with consistent hierarchies and labels, we train a probabilistic hierarchical grammar to represent the distributions of shapes, cardinalities, and spatial relationships of semantic objects within the collection. Then, we use the learned grammar to parse new scenes to assign them segmentations, labels, and hierarchies consistent with the collection. During experiments with these algorithms, we find that: they work effectively for scene graphs for indoor scenes commonly found online (bedrooms, classrooms, and libraries); they outperform alternative approaches that consider only shape similarities and/or spatial relationships without hierarchy; they require relatively small sets of training data; they are robust to moderate over-segmentation in the inputs; and, they can robustly transfer labels from one data set to another. As a result, the proposed algorithms can be used to provide consistent hierarchies for large collections of scenes within the same semantic class.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
ShapeNet: An Information-Rich 3D Model Repository
Angel X. Chang,Thomas Funkhouser,Leonidas J. Guibas,Pat Hanrahan,Qixing Huang,Zimo Li,Silvio Savarese,Manolis Savva,Shuran Song,Hao Su,Jianxiong Xiao,Li Yi,Fisher Yu +12 more
TL;DR: ShapeNet contains 3D models from a multitude of semantic categories and organizes them under the WordNet taxonomy, a collection of datasets providing many semantic annotations for each 3D model such as consistent rigid alignments, parts and bilateral symmetry planes, physical sizes, keywords, as well as other planned annotations.
•Proceedings Article
Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation
Hao-Shu Fang,Yuanlu Xu,Wenguan Wang,Xiaobai Liu,Song-Chun Zhu +4 more
- 27 Apr 2018
TL;DR: This paper proposes a pose grammar to tackle the problem of 3D human pose estimation, which takes 2D pose as input and learns a generalized 2D-3D mapping function and enforces high-level constraints over human poses.
453
Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification
Wenguan Wang,Yuanlu Xu,Jianbing Shen,Song-Chun Zhu +3 more
- 18 Jun 2018
TL;DR: A knowledge-guided fashion network to solve the problem of visual fashion analysis, e.g., fashion landmark localization and clothing category classification is proposed and Bidirectional Convolutional Recurrent Neural Networks (BCRNNs) are introduced for efficiently approaching message passing over grammar topologies, and producing regularized landmark layouts.
StructureNet: hierarchical graph networks for 3D shape generation
TL;DR: In this paper, a hierarchical graph network is proposed to encode shapes represented as n-ary graphs, which can be robustly trained on large and complex shape families, and can be used to generate a great diversity of realistic structured shape geometries.
PlanIT: planning and instantiating indoor scenes with relation graph and spatial prior networks
TL;DR: A new framework for interior scene synthesis that combines a high-level relation graph representation with spatial prior neural networks, and generates scenes of comparable quality to those generated by prior approaches, while also providing the modeling flexibility of the intermediate relationship graph representation.
References
Pattern Recognition and Machine Learning
TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.
30.8K
•Book
Pattern Recognition and Machine Learning
Christopher M. Bishop
- 17 Aug 2006
TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.
On Estimation of a Probability Density Function and Mode
TL;DR: In this paper, the problem of the estimation of a probability density function and of determining the mode of the probability function is discussed. Only estimates which are consistent and asymptotically normal are constructed.
•Book
Pattern Recognition and Machine Learning (Information Science and Statistics)
Christopher M. Bishop
- 01 Aug 2006
TL;DR: Looking for competent reading resources?
10.1K
An efficient context-free parsing algorithm
TL;DR: In this article, a parsing algorithm which seems to be the most efficient general context-free algorithm known is described, which is similar to both Knuth's LR(k) algorithm and the familiar top-down algorithm.