TY - JOUR
T1 - Classifying apartment defect repair tasks in South Korea
T2 - a machine learning approach
AU - Kim, Eunhye
AU - Ji, Honggeun
AU - Kim, Jina
AU - Park, Eunil
N1 - Publisher Copyright:
© 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group on behalf of the Architectural Institute of Japan, Architectural Institute of Korea and Architectural Society of China.
PY - 2022
Y1 - 2022
N2 - Managing building defects in the residential environment is an important social issue in South Korea. Therefore, most South Korean construction companies devote a large amount of human resources and economic costs in managing such defects. This paper proposes a machine learning approach for investigating whether a specific defect can be autonomously categorized into one of the categories of repair tasks. To this end, we employed a dataset of 310,044 defect cases (from 656,266 validated cases of 717,550 total collected cases). Three machine learning classifiers (support vector machine, random forest, and logistic regression) with three word embedding methods (bag-of-words, term frequency-inverse document frequency, and Word2Vec) were employed for the classification tasks. The highest yielded results showed more than 99% accuracy, precision, recall, and F1-scores for the random forest classifier with the Word2Vec embedding. Finally, based on these findings, the implications and limitations of this study are discussed. Representatively, the findings of this research can improve the defect management effectiveness of the apartment construction industry in South Korea. Moreover, to contribute to future research, we have made the dataset publicly available.
AB - Managing building defects in the residential environment is an important social issue in South Korea. Therefore, most South Korean construction companies devote a large amount of human resources and economic costs in managing such defects. This paper proposes a machine learning approach for investigating whether a specific defect can be autonomously categorized into one of the categories of repair tasks. To this end, we employed a dataset of 310,044 defect cases (from 656,266 validated cases of 717,550 total collected cases). Three machine learning classifiers (support vector machine, random forest, and logistic regression) with three word embedding methods (bag-of-words, term frequency-inverse document frequency, and Word2Vec) were employed for the classification tasks. The highest yielded results showed more than 99% accuracy, precision, recall, and F1-scores for the random forest classifier with the Word2Vec embedding. Finally, based on these findings, the implications and limitations of this study are discussed. Representatively, the findings of this research can improve the defect management effectiveness of the apartment construction industry in South Korea. Moreover, to contribute to future research, we have made the dataset publicly available.
KW - Apartment defect
KW - machine learning
KW - repair task
UR - https://www.scopus.com/pages/publications/85117262648
U2 - 10.1080/13467581.2021.1972808
DO - 10.1080/13467581.2021.1972808
M3 - Article
AN - SCOPUS:85117262648
SN - 1346-7581
VL - 21
SP - 2503
EP - 2510
JO - Journal of Asian Architecture and Building Engineering
JF - Journal of Asian Architecture and Building Engineering
IS - 6
ER -