Journal Article10.2352/ei.2024.36.6.iriacv-276
Synthetic Data Generation for AI-based Machine Vision Applications
F. Seiler,Verena Eichinger,Ira Effenberger +2 more
TL;DR: A method for synthesizing sensor data for machine vision tasks is presented. It generates realistic images and annotations for object detection, segmentation, and pose estimation. The method uses physically based rendering techniques and incorporates material properties and lighting conditions. It also introduces synthetic defects for quality control applications.
read more
Abstract: This paper presents a method for synthesizing 2D and 3D sensor data for various machine vision tasks.Depending on the task, different processing steps can be applied to a 3D model of an object.For object detection, segmentation and pose estimation, random object arrangements are generated automatically.In addition, objects can be virtually deformed in order to create realistic images of non-rigid objects.For automatic visual inspection, synthetic defects are introduced into the objects.Thus sensor-realistic datasets with typical object defects for quality control applications can be created, even in the absence of defective parts.The simulation of realistic images uses physically based rendering techniques.Material properties and different lighting situations are taken into account in the 3D models.The resulting tuples of 2D images and their ground truth annotations can be used to train a machine learning model, which is subsequently applied to real data.In order to minimize the reality gap, a random parameter set is selected for each image, resulting in images with high variety.Considering the use cases damage detection and object detection, it has been shown that a machine learning model trained only on synthetic data can also achieve very good results on real data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

Figure 4. The original object model is bent (left and middle) and twisted (right). 
Figure 5. Synthetic scene of a random arrangement of syringes in a box. 
Figure 6. Simplified generation process for notches using a Boolean operator. 
Figure 2. Comparison of real camera images (top row) and synthetically generated images (bottom row). It is obvious that the differences between the real and synthetic data are minimal. Furthermore, the diversity present in the real data is also represented in the synthetic data. 
Figure 3. Basic structure of the washer shader (left) and different scratch patterns (middle and right). 
Table 1. Overview Results of the visual inspection
References
EfficientDet: Scalable and Efficient Object Detection
Mingxing Tan,Ruoming Pang,Quoc V. Le +2 more
- 14 Jun 2020
TL;DR: EfficientDetD7 as discussed by the authors proposes a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multi-scale feature fusion, and a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time.
Domain randomization for transferring deep neural networks from simulation to the real world
Josh Tobin,Rachel Fong,Alex Ray,Jonas Schneider,Wojciech Zaremba,Pieter Abbeel +5 more
- 20 Mar 2017
TL;DR: This paper explores domain randomization, a simple technique for training models on simulated images that transfer to real images by randomizing rendering in the simulator, and achieves the first successful transfer of a deep neural network trained only on simulated RGB images to the real world for the purpose of robotic control.
3.5K
•Book
Physically Based Rendering: From Theory to Implementation
Matt Pharr,Greg Humphreys +1 more
- 28 Sep 2004
TL;DR: Physically Based Rendering: From Theory to Implementation, Third Edition, describes both the mathematical theory behind a modern photorealistic rendering system and its practical implementation through a method known as 'literate programming', which serves as an essential resource on physically-based rendering.
2K
Domain adaptation for object recognition: An unsupervised approach
Raghuraman Gopalan,Ruonan Li,Rama Chellappa +2 more
- 06 Nov 2011
TL;DR: This paper presents one of the first studies on unsupervised domain adaptation in the context of object recognition, where data has been labeled only from the source domain (and therefore do not have correspondences between object categories across domains).
1.3K