TY - JOUR
T1 - Machine learning-derived model for predicting poor post-treatment quality of life in Korean cancer survivors
AU - Choe, Yu Hyeon
AU - Lee, Sujee
AU - Lim, Yooseok
AU - Kim, Soo Hyun
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
PY - 2024/3
Y1 - 2024/3
N2 - Purpose: A substantial number of cancer survivors have poor quality of life (QOL) even after completing cancer treatment. Thus, in this study, we used machine learning (ML) to develop predictive models for poor QOL in post-treatment cancer survivors in South Korea. Methods: This cross-sectional study used online survey data from 1,005 post-treatment cancer survivors in South Korea. The outcome variable was QOL, which was measured using the global QOL subscale of the European Organization of Cancer and Treatment for Cancer Quality of Life Questionnaire, where a global QOL score < 60.4 was defined as poor QOL. Three ML models (random forest (RF), support vector machine, and extreme gradient boosting) and three deep learning models were used to develop predictive models for poor QOL. Model performance regarding accuracy, area under the receiver operating characteristic curve, F1 score, precision, and recall was evaluated. The SHapely Additive exPlanation (SHAP) method was used to identify important features. Results: Of the 1,005 participants, 65.1% had poor QOL. Among the six models, the RF model had the best performance (accuracy = 0.85, F1 = 0.90). The SHAP method revealed that survivorship concerns (e.g., distress, pain, and fatigue) were the most important factors that affected poor QOL. Conclusions: The ML-based prediction model developed to predict poor QOL in Korean post-treatment cancer survivors showed good accuracy. The ML model proposed in this study can be used to support clinical decision-making in identifying survivors at risk of poor QOL.
AB - Purpose: A substantial number of cancer survivors have poor quality of life (QOL) even after completing cancer treatment. Thus, in this study, we used machine learning (ML) to develop predictive models for poor QOL in post-treatment cancer survivors in South Korea. Methods: This cross-sectional study used online survey data from 1,005 post-treatment cancer survivors in South Korea. The outcome variable was QOL, which was measured using the global QOL subscale of the European Organization of Cancer and Treatment for Cancer Quality of Life Questionnaire, where a global QOL score < 60.4 was defined as poor QOL. Three ML models (random forest (RF), support vector machine, and extreme gradient boosting) and three deep learning models were used to develop predictive models for poor QOL. Model performance regarding accuracy, area under the receiver operating characteristic curve, F1 score, precision, and recall was evaluated. The SHapely Additive exPlanation (SHAP) method was used to identify important features. Results: Of the 1,005 participants, 65.1% had poor QOL. Among the six models, the RF model had the best performance (accuracy = 0.85, F1 = 0.90). The SHAP method revealed that survivorship concerns (e.g., distress, pain, and fatigue) were the most important factors that affected poor QOL. Conclusions: The ML-based prediction model developed to predict poor QOL in Korean post-treatment cancer survivors showed good accuracy. The ML model proposed in this study can be used to support clinical decision-making in identifying survivors at risk of poor QOL.
KW - Cancer survivors
KW - Machine learning
KW - Prediction model
KW - Quality of life
UR - https://www.scopus.com/pages/publications/85184396195
U2 - 10.1007/s00520-024-08347-z
DO - 10.1007/s00520-024-08347-z
M3 - Article
C2 - 38315224
AN - SCOPUS:85184396195
SN - 0941-4355
VL - 32
JO - Supportive Care in Cancer
JF - Supportive Care in Cancer
IS - 3
M1 - 143
ER -