Malmö University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
FedCSD: A Federated Learning Based Approach for Code-Smell Detection
Blekinge Inst Technol, Dept Comp Sci, S-37179 Karlskrona, Sweden; Univ Santiago de Compostela, Comp Graph & Data Engn COGRADE Res Grp, Santiago De Compostela 15705, Spain.
Al Balqa Appl Univ, Prince Abdullah bin Ghazi Fac Informat & Commun Te, Software Engn Dept, As Salt 19117, Jordan.ORCID iD: 0000-0002-3182-418X
Malmö University, Faculty of Technology and Society (TS), Department of Computer Science and Media Technology (DVMT). Malmö University, Internet of Things and People (IOTAP).ORCID iD: 0000-0002-8025-4734
Blekinge Inst Technol, Dept Comp Sci, S-37179 Karlskrona, Sweden.
Show others and affiliations
2024 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 12, p. 44888-44904Article in journal (Refereed) Published
Abstract [en]

Software quality is critical, as low quality, or "Code smell," increases technical debt and maintenance costs. There is a timely need for a collaborative model that detects and manages code smells by learning from diverse and distributed data sources while respecting privacy and providing a scalable solution for continuously integrating new patterns and practices in code quality management. However, the current literature is still missing such capabilities. This paper addresses the previous challenges by proposing a Federated Learning Code Smell Detection (FedCSD) approach, specifically targeting "God Class," to enable organizations to train distributed ML models while safeguarding data privacy collaboratively. We conduct experiments using manually validated datasets to detect and analyze code smell scenarios to validate our approach. Experiment 1, a centralized training experiment, revealed varying accuracies across datasets, with dataset two achieving the lowest accuracy (92.30%) and datasets one and three achieving the highest (98.90% and 99.5%, respectively). Experiment 2, focusing on cross-evaluation, showed a significant drop in accuracy (lowest: 63.80%) when fewer smells were present in the training dataset, reflecting technical debt. Experiment 3 involved splitting the dataset across 10 companies, resulting in a global model accuracy of 98.34%, comparable to the centralized model's highest accuracy. The application of federated ML techniques demonstrates promising performance improvements in code-smell detection, benefiting both software developers and researchers.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024. Vol. 12, p. 44888-44904
Keywords [en]
Software quality, technical debit, federated learning, privacy-preserving, code smell detection
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:mau:diva-66923DOI: 10.1109/ACCESS.2024.3380167ISI: 001193664800001Scopus ID: 2-s2.0-85189169469OAI: oai:DiVA.org:mau-66923DiVA, id: diva2:1854566
Available from: 2024-04-26 Created: 2024-04-26 Last updated: 2024-09-03Bibliographically approved

Open Access in DiVA

fulltext(2285 kB)131 downloads
File information
File name FULLTEXT01.pdfFile size 2285 kBChecksum SHA-512
74c244f54d5a0719cdbb6f8a6503d31b26e3c9e6895dae0ea671590a1a102310548c525a890cee4dac6deda2363093527cb9da64dbbf02750a5df4357c6c068a
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Alawadi, SadiAlkhabbas, Fahed

Search in DiVA

By author/editor
Alawadi, SadiAlkharabsheh, KhalidAlkhabbas, FahedPalomba, FabioAwad, Mohammed
By organisation
Department of Computer Science and Media Technology (DVMT)Internet of Things and People (IOTAP)
In the same journal
IEEE Access
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 131 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 339 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf