ScaleFusionNet: transformer-guided multi-scale feature fusion for skin lesion segmentationShow others and affiliations
2025 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 15, no 1, article id 34393
Article in journal (Refereed) Published
Abstract [en]
Melanoma is a malignant tumor that originates from skin cell lesions. Accurate and efficient segmentation of skin lesions is essential for quantitative analysis but remains a challenge owing to blurred lesion boundaries, gradual color changes, and irregular shapes. To address this, we propose ScaleFusionNet, a hybrid model that integrates a Cross-Attention Transformer Module (CATM) and adaptive fusion block (AFB) to enhance feature extraction and fusion by capturing both local and global features. We introduce CATM, which utilizes Swin transformer blocks and Cross Attention Fusion (CAF) to adaptively refine feature fusion and reduce semantic gaps in the encoder-decoder to improve segmentation accuracy. Additionally, the AFB uses Swin Transformer-based attention and deformable convolution-based adaptive feature extraction to help the model gather local and global contextual information through parallel pathways. This enhancement refines the lesion boundaries and preserves fine-grained details. ScaleFusionNet achieves Dice scores of 92.94%, 91.80%, and 95.37% on the ISIC-2016, ISIC-2018, and HAM10000 datasets, respectively, demonstrating its effectiveness in skin lesion analysis. Simultaneously, independent validation experiments were conducted on the PH<sup>2</sup> dataset using the pretrained model weights. The results show that ScaleFusionNet demonstrates significant performance improvements compared with other state-of-the-art methods. Our code implementation is publicly available at https://github.com/sqbqamar/ScaleFusionNet.
Place, publisher, year, edition, pages
Springer Nature , 2025. Vol. 15, no 1, article id 34393
Keywords [en]
feature enhancement, Image Segmentation, Information fusion, Skin Lesion, Transformer
National Category
Computer graphics and computer vision Medical Imaging
Identifiers
URN: urn:nbn:se:kth:diva-372357DOI: 10.1038/s41598-025-17300-xISI: 001587011800006PubMedID: 41038982Scopus ID: 2-s2.0-105017626854OAI: oai:DiVA.org:kth-372357DiVA, id: diva2:2011880
Note
QC 20251106
2025-11-062025-11-062025-11-06Bibliographically approved