TY - JOUR
T1 - Analysis of high-dimensional genomic data employing a novel bio-inspired algorithm
AU - Baliarsingh, Santos Kumar
AU - Vipsita, Swati
AU - Muhammad, Khan
AU - Dash, Bodhisattva
AU - Bakshi, Sambit
N1 - Publisher Copyright:
© 2019 Elsevier B.V.
PY - 2019/4
Y1 - 2019/4
N2 - Over the last decade, there has been a rapid growth in the generation and analysis of the genomics data. Though the existing data analysis methods are capable of handling a particular problem, they cannot guarantee to solve all problems with different nature. Therefore, there always lie a scope of a new algorithm to solve a problem which cannot be efficiently solved by the existing algorithms. In the present work, a novel hybrid approach is proposed based on the improved version of a recently developed bio-inspired optimization technique, namely, salp swarm algorithm (SSA) for microarray classification. Initially, the Fisher score filter is employed to pre-select a subset of relevant genes from the original high-dimensional microarray dataset. Later, a weighted-chaotic SSA (WCSSA) is proposed for the simultaneous optimal gene selection and parameter optimization of the kernel extreme learning machine (KELM) classifier. The proposed scheme is experimented on both binary-class and multi-class microarray datasets. An extensive comparison is performed against original SSA-KELM, particle swarm optimized-KELM (PSO-KELM), and genetic algorithm-KELM (GA-KELM). Lastly, the proposed method is also compared against the results of sixteen existing techniques to emphasize its capacity and competitiveness to successfully reduce the number of original genes by more than 98%. The experimental results show that the genes selected by the proposed method yield higher classification accuracy compared to the alternative techniques. The performance of the proposed scheme demonstrates its effectiveness in terms of number of selected genes (NSG), accuracy, sensitivity, specificity, Matthews correlation coefficient (MCC), and F-measure. The proposed WCSSA-KELM method is validated using a ten-fold cross-validation technique.
AB - Over the last decade, there has been a rapid growth in the generation and analysis of the genomics data. Though the existing data analysis methods are capable of handling a particular problem, they cannot guarantee to solve all problems with different nature. Therefore, there always lie a scope of a new algorithm to solve a problem which cannot be efficiently solved by the existing algorithms. In the present work, a novel hybrid approach is proposed based on the improved version of a recently developed bio-inspired optimization technique, namely, salp swarm algorithm (SSA) for microarray classification. Initially, the Fisher score filter is employed to pre-select a subset of relevant genes from the original high-dimensional microarray dataset. Later, a weighted-chaotic SSA (WCSSA) is proposed for the simultaneous optimal gene selection and parameter optimization of the kernel extreme learning machine (KELM) classifier. The proposed scheme is experimented on both binary-class and multi-class microarray datasets. An extensive comparison is performed against original SSA-KELM, particle swarm optimized-KELM (PSO-KELM), and genetic algorithm-KELM (GA-KELM). Lastly, the proposed method is also compared against the results of sixteen existing techniques to emphasize its capacity and competitiveness to successfully reduce the number of original genes by more than 98%. The experimental results show that the genes selected by the proposed method yield higher classification accuracy compared to the alternative techniques. The performance of the proposed scheme demonstrates its effectiveness in terms of number of selected genes (NSG), accuracy, sensitivity, specificity, Matthews correlation coefficient (MCC), and F-measure. The proposed WCSSA-KELM method is validated using a ten-fold cross-validation technique.
KW - Classification
KW - Fisher score
KW - Kernel extreme learning machine (KELM)
KW - Microarray
KW - Salp swarm optimization algorithm (SSA)
UR - https://www.scopus.com/pages/publications/85061329819
U2 - 10.1016/j.asoc.2019.01.007
DO - 10.1016/j.asoc.2019.01.007
M3 - Article
AN - SCOPUS:85061329819
SN - 1568-4946
VL - 77
SP - 520
EP - 532
JO - Applied Soft Computing
JF - Applied Soft Computing
ER -