TY - JOUR
T1 - A robust ensemble framework for anticancer peptide classification using multi-model voting approach
AU - Abbas, Zeeshan
AU - Kim, Sunyeup
AU - Lee, Nangkyeong
AU - Kazmi, Syed Aadil Waheed
AU - Lee, Seung Won
N1 - Publisher Copyright:
© 2025
PY - 2025/4
Y1 - 2025/4
N2 - Anticancer peptides (ACPs) hold great potential for cancer therapeutics, yet accurately identifying them remains a challenging task due to the complexity of peptide sequences and their interactions with biological systems. In this study, we propose a novel machine learning-based framework for ACP classification, integrating multiple feature sets, including sequence composition, physicochemical properties, and embedding features derived from pre-trained language models. We evaluate the performance of various classifiers on benchmark datasets and compare our model against state-of-the-art methods. The results demonstrate that our model outperforms existing methods such as UniDL4BioPep, ACPred-Fuse, and iACP with an accuracy of 75.58%, an AUC of 0.8272, and an MCC of 0.5119. Our approach provides a more balanced sensitivity of 0.7384 and specificity of 0.773, ensuring robust identification of both ACPs and non-ACPs. These findings suggest that incorporating diverse feature sets can significantly enhance ACP classification, potentially facilitating the discovery of novel anticancer peptides for therapeutic applications.
AB - Anticancer peptides (ACPs) hold great potential for cancer therapeutics, yet accurately identifying them remains a challenging task due to the complexity of peptide sequences and their interactions with biological systems. In this study, we propose a novel machine learning-based framework for ACP classification, integrating multiple feature sets, including sequence composition, physicochemical properties, and embedding features derived from pre-trained language models. We evaluate the performance of various classifiers on benchmark datasets and compare our model against state-of-the-art methods. The results demonstrate that our model outperforms existing methods such as UniDL4BioPep, ACPred-Fuse, and iACP with an accuracy of 75.58%, an AUC of 0.8272, and an MCC of 0.5119. Our approach provides a more balanced sensitivity of 0.7384 and specificity of 0.773, ensuring robust identification of both ACPs and non-ACPs. These findings suggest that incorporating diverse feature sets can significantly enhance ACP classification, potentially facilitating the discovery of novel anticancer peptides for therapeutic applications.
KW - Anticancer peptide
KW - Bioinformatics
KW - Machine learning
KW - Motifs
KW - Voting classifier
UR - https://www.scopus.com/pages/publications/85218913919
U2 - 10.1016/j.compbiomed.2025.109750
DO - 10.1016/j.compbiomed.2025.109750
M3 - Article
C2 - 40032410
AN - SCOPUS:85218913919
SN - 0010-4825
VL - 188
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
M1 - 109750
ER -