The major effects of health-related quality of life on 5-year survival prediction among lung cancer survivors: applications of machine learning

  • Jin ah Sim
  • , Young Ae Kim
  • , Ju Han Kim
  • , Jong Mog Lee
  • , Moon Soo Kim
  • , Young Mog Shim
  • , Jae Ill Zo
  • , Young Ho Yun

Research output: Contribution to journalArticlepeer-review

56 Scopus citations

Abstract

The primary goal of this study was to evaluate the major roles of health-related quality of life (HRQOL) in a 5-year lung cancer survival prediction model using machine learning techniques (MLTs). The predictive performances of the models were compared with data from 809 survivors who underwent lung cancer surgery. Each of the modeling technique was applied to two feature sets: feature set 1 included clinical and sociodemographic variables, and feature set 2 added HRQOL factors to the variables from feature set 1. One of each developed prediction model was trained with the decision tree (DT), logistic regression (LR), bagging, random forest (RF), and adaptive boosting (AdaBoost) methods, and then, the best algorithm for modeling was determined. The models’ performances were compared using fivefold cross-validation. For feature set 1, there were no significant differences in model accuracies (ranging from 0.647 to 0.713). Among the models in feature set 2, the AdaBoost and RF models outperformed the other prognostic models [area under the curve (AUC) = 0.850, 0.898, 0.981, 0.966, and 0.949 for the DT, LR, bagging, RF and AdaBoost models, respectively] in the test set. Overall, 5-year disease-free lung cancer survival prediction models with MLTs that included HRQOL as well as clinical variables improved predictive performance.

Original languageEnglish
Article number10693
JournalScientific Reports
Volume10
Issue number1
DOIs
StatePublished - 1 Dec 2020

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'The major effects of health-related quality of life on 5-year survival prediction among lung cancer survivors: applications of machine learning'. Together they form a unique fingerprint.

Cite this