TY - JOUR
T1 - Analysis of high-dimensional biomedical data using an evolutionary multi-objective emperor penguin optimizer
AU - Baliarsingh, Santos Kumar
AU - Vipsita, Swati
AU - Muhammad, Khan
AU - Bakshi, Sambit
N1 - Publisher Copyright:
© 2019 Elsevier B.V.
PY - 2019/8
Y1 - 2019/8
N2 - Over the last two decades, there has been an expeditious expansion in the generation and exploration of high-dimensional biomedical data. Identification of biomarkers from the genomics data poses a significant challenge in microarray data analysis. Therefore, for the methodical analysis of the genomics dataset, it is paramount to develop some effective algorithms. In this work, a multi-objective version of the emperor penguin optimization (EPO)algorithm with chaos, namely, multi-objective chaotic EPO (MOCEPO)is proposed. The suggested approach extends the original continuous single objective EPO to a competent binary multi-objective model. The objectives are to minimize the number of selected genes (NSG)and to maximize the classification accuracy (CA). In this work, Fisher score and minimum redundancy maximum relevance (mRMR)are independently used as initial filters. Further, the proposed MOCEPO is employed for the simultaneous optimal feature selection and cancer classification. The proposed algorithm is successfully experimented on seven well-known high-dimensional binary-class as well as multi-class datasets. To evaluate the effectiveness, the proposed method is compared with non-dominated sorting genetic algorithm (NSGA-II), multi-objective particle swarm optimization (MOPSO), chaotic version of GA for multi-objective optimization (CGAMO), and chaotic MOPSO methods. The experimental results show that the proposed framework achieves better CA with minimum NSG compared to the existing schemes. The presented approach exhibits its efficacy with regard to NSG, accuracy, sensitivity, specificity, and F-measure.
AB - Over the last two decades, there has been an expeditious expansion in the generation and exploration of high-dimensional biomedical data. Identification of biomarkers from the genomics data poses a significant challenge in microarray data analysis. Therefore, for the methodical analysis of the genomics dataset, it is paramount to develop some effective algorithms. In this work, a multi-objective version of the emperor penguin optimization (EPO)algorithm with chaos, namely, multi-objective chaotic EPO (MOCEPO)is proposed. The suggested approach extends the original continuous single objective EPO to a competent binary multi-objective model. The objectives are to minimize the number of selected genes (NSG)and to maximize the classification accuracy (CA). In this work, Fisher score and minimum redundancy maximum relevance (mRMR)are independently used as initial filters. Further, the proposed MOCEPO is employed for the simultaneous optimal feature selection and cancer classification. The proposed algorithm is successfully experimented on seven well-known high-dimensional binary-class as well as multi-class datasets. To evaluate the effectiveness, the proposed method is compared with non-dominated sorting genetic algorithm (NSGA-II), multi-objective particle swarm optimization (MOPSO), chaotic version of GA for multi-objective optimization (CGAMO), and chaotic MOPSO methods. The experimental results show that the proposed framework achieves better CA with minimum NSG compared to the existing schemes. The presented approach exhibits its efficacy with regard to NSG, accuracy, sensitivity, specificity, and F-measure.
KW - Fisher score
KW - Kernel ridge regression
KW - Microarray
KW - MOCEPO
KW - mRMR
UR - https://www.scopus.com/pages/publications/85065512442
U2 - 10.1016/j.swevo.2019.04.010
DO - 10.1016/j.swevo.2019.04.010
M3 - Article
AN - SCOPUS:85065512442
SN - 2210-6502
VL - 48
SP - 262
EP - 273
JO - Swarm and Evolutionary Computation
JF - Swarm and Evolutionary Computation
ER -