Synthetic Data Generation for AI-based Machine Vision Applications

doi:10.2352/ei.2024.36.6.iriacv-276

Journal Article10.2352/ei.2024.36.6.iriacv-276

Synthetic Data Generation for AI-based Machine Vision Applications

F. Seiler, +2 more

- 21 Jan 2024

- IS&T International Symposium on Electron...

- Vol. 36, Iss: 6, pp 276-5

TL;DR: A method for synthesizing sensor data for machine vision tasks is presented. It generates realistic images and annotations for object detection, segmentation, and pose estimation. The method uses physically based rendering techniques and incorporates material properties and lighting conditions. It also introduces synthetic defects for quality control applications.

Abstract: This paper presents a method for synthesizing 2D and 3D sensor data for various machine vision tasks.Depending on the task, different processing steps can be applied to a 3D model of an object.For object detection, segmentation and pose estimation, random object arrangements are generated automatically.In addition, objects can be virtually deformed in order to create realistic images of non-rigid objects.For automatic visual inspection, synthetic defects are introduced into the objects.Thus sensor-realistic datasets with typical object defects for quality control applications can be created, even in the absence of defective parts.The simulation of realistic images uses physically based rendering techniques.Material properties and different lighting situations are taken into account in the 3D models.The resulting tuples of 2D images and their ground truth annotations can be used to train a machine learning model, which is subsequently applied to real data.In order to minimize the reality gap, a random parameter set is selected for each image, resulting in images with high variety.Considering the use cases damage detection and object detection, it has been shown that a machine learning model trained only on synthetic data can also achieve very good results on real data.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Figure 4. The original object model is bent (left and middle) and twisted (right).

Figure 5. Synthetic scene of a random arrangement of syringes in a box.

Figure 6. Simplified generation process for notches using a Boolean operator.

Figure 2. Comparison of real camera images (top row) and synthetically generated images (bottom row). It is obvious that the differences between the real and synthetic data are minimal. Furthermore, the diversity present in the real data is also represented in the synthetic data.

Figure 3. Basic structure of the washer shader (left) and different scratch patterns (middle and right).

Table 1. Overview Results of the visual inspection

References

•Proceedings Article•10.1109/CVPR42600.2020.01079

EfficientDet: Scalable and Efficient Object Detection

Mingxing Tan, +2 more

- 14 Jun 2020

TL;DR: EfficientDetD7 as discussed by the authors proposes a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multi-scale feature fusion, and a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time.

...read moreread less

7.2K

•Proceedings Article•10.1109/IROS.2017.8202133

Domain randomization for transferring deep neural networks from simulation to the real world

Josh Tobin, +5 more

- 20 Mar 2017

TL;DR: This paper explores domain randomization, a simple technique for training models on simulated images that transfer to real images by randomizing rendering in the simulator, and achieves the first successful transfer of a deep neural network trained only on simulated RGB images to the real world for the purpose of robotic control.

...read moreread less

3.5K

•Book

Physically Based Rendering: From Theory to Implementation

Matt Pharr, +1 more

- 28 Sep 2004

TL;DR: Physically Based Rendering: From Theory to Implementation, Third Edition, describes both the mathematical theory behind a modern photorealistic rendering system and its practical implementation through a method known as 'literate programming', which serves as an essential resource on physically-based rendering.

...read moreread less

2K

Proceedings Article•10.1109/ICCV.2011.6126344

Domain adaptation for object recognition: An unsupervised approach

Raghuraman Gopalan, +2 more

- 06 Nov 2011

TL;DR: This paper presents one of the first studies on unsupervised domain adaptation in the context of object recognition, where data has been labeled only from the source domain (and therefore do not have correspondences between object categories across domains).

...read moreread less

1.3K

•Journal Article•10.21105/joss.04901

BlenderProc2: A Procedural Pipeline for Photorealistic Rendering

Maximilian Denninger, +7 more

- 20 Feb 2023

- The Journal of Open Source Software

78