A deep natural language processing‐based method for ontology learning of project‐specific properties from building information models
14
TL;DR: The results show that the proposed approach incorporating reading comprehension of definitions outperforms the existing name similarity‐based methods in automatic ontology modeling of property concepts.
read more
Abstract: Element property is a crucial aspect of building information modeling (BIM) for almost all BIM‐based engineering tasks. Since there are limited properties predefined in Industry Foundation Classes (IFC) specifications, a vast number of property concepts were customized and stored in BIM models, which lack labor‐intensive data modeling and alignment for effective information management and reuse. To tackle the challenge, this study presents a natural language understanding (NLU)‐based method for the automatic ontological knowledge modeling of project‐specific property concepts from BIM models. A soft pattern matching model was used to acquire contextual definitions of concepts from a domain corpus before applying deep NLU models to transform the concept names and definitions into dense vector representations. These outputs were then fed into two stacking ensemble learning models to carry out two tasks: (a) classifying whether an unseen concept overlaps with the IFC ontology, and (b) aligning the repetitive concepts with the most relevant concepts in the ontology. Finally, all fresh properties were appended to an IFC ontology, either as new objects or new synonyms. The performance was evaluated based on 327 property concepts from real‐life BIM models. The results show that the proposed approach incorporating reading comprehension of definitions outperforms the existing name similarity‐based methods. Finally, a case study on a renovation project demonstrates the effectiveness of this study in automatic ontology modeling of property concepts.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Text mining and natural language processing in construction
Alireza Shamshiri,Kyeong Rok Ryu,June Young Park +2 more
TL;DR: This systematic review of 205 publications on text mining and natural language processing in construction identifies gaps and future directions, highlighting potential research opportunities in automation, data integration, and leveraging pre-trained models for construction management.
39
Comprehensive digital twin for infrastructure: A novel ontology and graph-based modelling paradigm
Tao Li,Yi Rui,Hehua Zhu,Linyuan Lü,Xiaojun Li +4 more
8
BIM and IFC Data Readiness for AI Integration in the Construction Industry: A Review Approach
TL;DR: This systematic review of 93 articles identifies common data types, frameworks, and conversion methods for integrating BIM and AI in construction, highlighting barriers in IFC data support, geometric information extraction, and toolchain limitations, with data readiness at an intermediate level.
5
Built environment defect mapping, modeling, and management(D3M): A BIM-based integrated framework
Junjie Chen,Weisheng Lu,Donghai Liu +2 more
- 01 Feb 2024
TL;DR: This paper proposes a BIM-based framework for integrated defect mapping, modeling, and management (D3M) in the built environment, leveraging BIM's geometric-semantic information to enhance defect inspection, documentation, and management in a data-driven manner.
2
Graph Database and Matrix-Based Intelligent Generation of the Assembly Sequence of Prefabricated Building Components
Bin Yang,Xinlong Li,Miaosi Dong,Ding Zhu,Yilong Han +4 more
TL;DR: A framework for intelligently generating assembly sequences of prefabricated building components based on graph databases and matrices is proposed. The framework utilizes adjacency and interference matrices to describe connections and constraints, and a genetic algorithm is employed for optimization.
2
References
Glove: Global Vectors for Word Representation
Jeffrey Pennington,Richard Socher,Christopher D. Manning +2 more
- 01 Oct 2014
TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.
FaceNet: A Unified Embedding for Face Recognition and Clustering
TL;DR: FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.
14.2K
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers,Iryna Gurevych +1 more
- 14 Aug 2019
TL;DR: Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity is presented.
Deep contextualized word representations
Matthew E. Peters,Mark Neumann,Mohit Iyyer,Matt Gardner,Christopher Clark,Kenton Lee,Luke Zettlemoyer +6 more
- 15 Feb 2018
TL;DR: This paper introduced a new type of deep contextualized word representation that models both complex characteristics of word use (e.g., syntax and semantics), and how these uses vary across linguistic contexts (i.e., to model polysemy).
•Book
Foundations of Statistical Natural Language Processing
Christopher D. Manning,Hinrich Schütze +1 more
- 28 May 1999
TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.
Related Papers (5)
Ana Marisa Salgueiro,Catarina Alves,João Balsa +2 more
- 24 Sep 2018
Henrik Bulskov,Rasmus Knappe,Troels Andreasen +2 more
- 24 Jun 2004