1. What strategies are employed to overcome data incompleteness?
In Section 2.3, the researchers describe their strategy to overcome data incompleteness. This step is crucial to ensure that all necessary parameters for procedural modeling are available and accurately documented. Among the strategies proposed, a machine learning approach is used to fill gaps in partially present fields in the dataset. This approach allows the user to choose target fields and their related predictors, enabling greater customization in the prediction process. The techniques employed to infer missing information are discussed, along with the significance and performance of the statistical data completion process. The researchers emphasize the dependency of quantitative results on the available data and compare the performances of algorithms on their dataset, avoiding comparisons with different case studies. This approach ensures that the generated data is accurate and reliable for further analysis and modeling.
read more
2. How do we create a diachronic vector dataset from historical maps?
To create a diachronic vector dataset from historical maps, we employ semantic segmentation to automatically extract building footprints from 10 historical maps published between 1838 and 1947. We vectorize the footprints as high-quality vector data by designing an algorithm specifically adapted for 3D modeling. The 10 maps are combined in a diachronic vector dataset by detecting the first and last appearance of the polygons. Additional details, such as the number of floors, construction materials, and roof types, are incorporated from a secondary source. The maps are first georeferenced manually using GIS software, and then semantically segmented to obtain a diachronic dataset that captures the temporal information regarding the initial and final occurrences of building footprints. The building footprints are obtained from two specific maps: one depicting the Old City in detail (scale 1:2500, from 1947) and one depicting its surroundings (scale 1:5000, from 1938). The semantic segmentation aims to extract the footprints or built-up areas by recognizing the contours on the historical maps. A total of 252 patches are manually annotated from the 10 maps, and 132 patches are selected from the Historical City Maps Semantic Segmentation Dataset. A convolutional neural network is trained for each task, using the dhSegment framework and a simple UNet architecture with a ResNet101 encoder. This step creates two binary masks: the first corresponding to building contours, and the second to built-up areas. The vectorization of 2D geometries is a demanding part of the 3D model creation process, requiring the simplification of geometries to avoid aliasing and the ability to parameterize the level of simplification without affecting local coherence. The vectorization algorithm comprises several steps, including georeferencing, semantic segmentation, and vectorization, which are illustrated in Figure 1 and accompanied by Python code.
read more
3. How is polygon orientation determined?
Polygon orientation is determined using Equation (2), which calculates the orientation of each polygon. The cycles can be reoriented clockwise or counterclockwise based on this calculation. The orientation convention follows the shapefile format, where 'donut holes' are oriented counterclockwise, and the outer cycle is oriented clockwise. This ensures consistency and accuracy in the vector data representation.
read more
4. What parameters are used for procedural modeling of buildings in LOD1?
For LOD1, a set of three parameters is used to define the height of the extrusion: Height (H), Floor Height (H f ), and Number of Floors (N f ). The relationship between these parameters ensures that the extrusion process accurately represents the specified number of floors, maintaining the proportional scaling of the building's vertical dimensions. This parameter interdependence becomes particularly valuable in situations where explicit height information is missing from the dataset, but the number of floors is known. In situations where the height or number of floors is unknown, a predictive method is employed to estimate the number of floors and/or floor height, thereby enabling the calculation of the total height. This approach generates credible height values, even when the dataset lacks specific height information.
read more