An optimized Stacking Ensemble Framework for Malware Detection Using XGBoost and Random Forest

Authors

  • Mohammed M Elsheh Department of Computer Science  Libyan Academy  Misrata Libya
  • Fatima A Aboudabous Department of Computer Science  Libyan Academy Misrata  Libya

Keywords:

XGBoost, Feature Selection, Stacking Ensemble, CICMalDroid 2020, SMOTE

Abstract

. Android devices are increasingly targeted by malware due to their widespread adoption and open ecosystem. Traditional signature-based detection methods are insufficient for evolving threats, highlighting the need for intelligent approaches. This study proposes an enhanced Android malware classification framework using XGBoost, Random Forest for feature selection, and a stacking ensemble. The CICMalDroid 2020 dataset was preprocessed through outlier removal, robust scaling, binary transformation, and SMOTE-based class balancing. Experimental results show that the stacking ensemble achieves the highest detection performance with 98.14% accuracy, 93.62% F1-score, and 99.63% ROC-AUC, outperforming individual models. Additionally, using RF as features selector  improves computational efficiency, reducing training and testing times. These findings demonstrate that integrating advanced machine learning techniques enhances both the effectiveness and efficiency of Android malware detection.

Author Biography

Fatima A Aboudabous, Department of Computer Science  Libyan Academy Misrata  Libya

 

 

 

References

Y. Zhou and X. Jiang, “Dissecting Android malware: Characterization and evolution,” Proc. IEEE Symposium on Security and Privacy, San Francisco, CA, USA, pp. 95–109, 2012.

A. Raval and M. Anwar, “Android malware detection: An empirical investigation into machine learning classifiers,” in Proc. 2024 IEEE Int. Conf. on Information Reuse and Integration for Data Science (IRI), 2024, pp. 180–187, doi: 10.1109/IRI62200.2024.00039

F. Saleem, A. R. Javed, F. Iqbal, A. Castiglione, X. Chang, and A. Almomani, “Android malware detection using feature ranking of permissions,” arXiv preprint, arXiv:2201.08468, 2022.

D. B. Ansori, J. Slamet, M. Z. Ghufron, A. Kurniawan, and D. I. Sensuse, “Android malware classification using gain ratio and ensembled machine learning,” Int. J. Safety and Security Engineering, vol. 14, no. 1, pp. 245–254, 2024, doi: 10.18280/ijsse.140126.

C. Palma, A. Ferreira, and M. Figueiredo, “Explainable machine learning for malware detection on Android applications,” Information, vol. 15, no. 1, Art. no. 25, 2024, doi: 10.3390/info15010025.

Wang, X., Zhang, L., Zhao, K., Ding, X., & Yu, M. (2022). MFDroid: A stacking ensemble learning framework for Android malware detection. Sensors, 22(7), 2597.‏

R. Adriansyah, P. Sukarno, and A. A. Wardana, “Android malware detection using ensemble learning and feature selection with insights from SHAP explainable AI,” in Proc. Int. Seminar on Intelligent Technology and Its Applications (ISITIA), 2024, pp. 1–6, doi: 10.1109/ISCMI63661.2024.10851666.

E. Alsharif and M. Alharby, “An ensemble machine learning approach for detecting and classifying malware attacks on mobile devices,” Arabian Journal for Science and Engineering, 2025, doi: 10.1007/s13369-025-10011-5.

A. Alhogail and R. A. Alharbi, “Effective machine learning-based Android malware detection and categorization,” Electronics, vol. 14, no. 8, Art. no. 1486, 2025, doi: 10.3390/electronics14081486.

Downloads

Published

2026-01-02

How to Cite

Elsheh, M. M., & Aboudabous, F. A. (2026). An optimized Stacking Ensemble Framework for Malware Detection Using XGBoost and Random Forest. Journal of Academic Research, 30(01), 43–51. Retrieved from https://lam-journal.ly/index.php/jar/article/view/1346

Issue

Section

العلوم الهندسية والتطبيقية

Most read articles by the same author(s)