FEDetect: A Federated Learning-Based Malware Detection and Classification Using Deep Neural Network Algorithms - UTU Research Portal

A1 Refereed original research article in a scientific journal

FEDetect: A Federated Learning-Based Malware Detection and Classification Using Deep Neural Network Algorithms

Authors: Çıplak, Zeki; Yıldız, Kazım; Altınkaya, Sahsene

Publisher: Springer Science and Business Media LLC

Publishing place: HEIDELBERG

Publication year: 2025

Journal: Arabian Journal for Science and Engineering

Journal name in source: Arabian Journal for Science and Engineering

Journal acronym: ARAB J SCI ENG

Number of pages: 28

ISSN: 2193-567X

eISSN: 2191-4281

DOI: https://doi.org/10.1007/s13369-025-10043-x

Publication's open availability at the time of reporting: Open Access

Publication channel's open availability : Partially Open Access publication channel

Web address : https://doi.org/10.1007/s13369-025-10043-x

Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/491507825

Self-archived copy's licence: CC BY

Self-archived copy's version: Publisher`s PDF

Abstract

The growing importance of data security in modern information systems extends beyond the preventing malicious software and includes the critical topic of data privacy. Centralized data processing in traditional machine learning methods presents significant challenges, including greater risk of data breaches and attacks on centralized systems. This study addresses the critical issue of maintaining data privacy while obtaining effective malware detection and classification. The motivation stems from the growing requirement for robust and privacy-preserving machine learning methodologies in response to rising threats to centralized data systems. Federated learning offers a novel solution that eliminates the requirement for centralized data collecting while preserving privacy. In this paper, we investigate the performance of federated learning-based models and compare them classic non-federated approaches. Using the CIC-MalMem-2022 dataset, we built 22 models with feedforward neural networks and long short-term memory methods, including four non-federated models. The results show that federated learning performed outstanding performance with an accuracy of 0.999 in binary classification and 0.845 in multiclass classification, despite different numbers of users. This study contributes significantly to understanding the practical implementation and impact of federated learning. By examining the impact of various factors on classification performance, we highlight the potential of federated learning as a privacy-preserving alternative to centralized machine learning methods, filling a major gap in the field of secure data processing.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

s13369-025-10043-x.pdf

Funding information in the publication:
Open access funding provided by the Scientific and Technological Research Council of Türkiye (TÜB˙ITAK). This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.