SREMIC: Spatial Relation Extraction-based Malware Image Classification

  • Inzamamul Alam
  • , Md Samiullah
  • , Upama Kabir
  • , Simon Woo
  • , Carson K. Leung
  • , Hoang Hai Nguyen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Around 800,000 people fall prey to cyberattacks annually, most often by 'malware'. Malware has the potential to become a destructive weapon in Cyber-world. It is a difficult task to manually thwart an assault by malware. It is crucial to properly categorize malware binaries in order to identify their origins. Furthermore, malware structure discovery through basic feature extraction approaches are time-consuming and challenging. Malware classification was previously solved using naive machine learning approaches like support vector machine (SVM) and extreme gradient boosting (XGBoost). Recently, deep learning (DL) has shown to be impactful in finding malicious patterns. Without DL, analysis of the vast amounts of available data tends to impossible. Existing methods (e.g., transfer learning, fusion methodology, ensemble learning) may not be effective on actual malware binary files. Moreover, some single image-based malware classification used rudimentary convolutional neural network (CNN) that does not perform well. Faced with these challenges, we propose in this paper a novel model with of a spatial CNN with sufficient regularization and data augmentation that can identify and classify malware in images effectively and efficiently. Our model is evaluated using datasets like MalImg and Microfsoft-Big. The proposed model achieves validation score of 99.93% for MalImg and 99.72% for Microsoft-Big datasets. Our approach outperforms VGG16, VGG19, ResNet50, EfficientNetB1, and Google's Inception v3, including state-of-the-art (SOTA) techniques.

Original languageEnglish
Title of host publicationProceedings of the 2024 18th International Conference on Ubiquitous Information Management and Communication, IMCOM 2024
EditorsSukhan Lee, Hyunseung Choo, Roslan Ismail
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350331011
DOIs
StatePublished - 2024
Event18th International Conference on Ubiquitous Information Management and Communication, IMCOM 2024 - Kuala Lumpur, Malaysia
Duration: 3 Jan 20245 Jan 2024

Publication series

NameProceedings of the 2024 18th International Conference on Ubiquitous Information Management and Communication, IMCOM 2024

Conference

Conference18th International Conference on Ubiquitous Information Management and Communication, IMCOM 2024
Country/TerritoryMalaysia
CityKuala Lumpur
Period3/01/245/01/24

Keywords

  • Convolutional neural network (CNN)
  • Deep learning
  • Malware classification
  • Spatial relation

Fingerprint

Dive into the research topics of 'SREMIC: Spatial Relation Extraction-based Malware Image Classification'. Together they form a unique fingerprint.

Cite this