kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (10 of 11) Show all publications
Yang, Z., Liu, Y., Smedby, Ö. & Moreno, R. (2026). Combining shallow and deep neural networks on pseudo-color enhanced images for digital breast tomosynthesis lesion classification. Frontiers in Digital Health, 7, Article ID 1705044.
Open this publication in new window or tab >>Combining shallow and deep neural networks on pseudo-color enhanced images for digital breast tomosynthesis lesion classification
2026 (English)In: Frontiers in Digital Health, E-ISSN 2673-253X, Vol. 7, article id 1705044Article in journal (Refereed) Published
Abstract [en]

Introduction: The classification of lesion types in Digital Breast Tomosynthesis (DBT) images is crucial for the early diagnosis of breast cancer. However, the task remains challenging due to the complexity of breast tissue and the subtle nature of lesions. To alleviate radiologists’ workload, computer-aided diagnosis (CAD) systems have been developed. The breast lesion regions vary in size and complexity, which leads to performance degradation. Methods: To tackle this problem, we propose a novel DBT Dual-Net architecture comprising two complementary neural network branches that extract both low-level and high-level features. By fusing different-level feature representations, the model can better capture subtle structure. Furthermore, we introduced a pseudo-color enhancement procedure to improve the visibility of lesions on DBT. Moreover, most existing DBT classification studies rely on two-dimensional (2D) slice-level analysis, neglecting the rich three-dimensional (3D) spatial context within DBT volumes. To address this limitation, we used majority voting for image-level classification from predictions across slices. Results: We evaluated our method on a public DBT dataset and compared its performance with several existing classification approaches. The results showed that our method outperforms baseline models. Discussion: The use of pseudo-color enhancement, extracting high and low-level features and inter-slice majority voting proposed method is effective for lesion classification in DBT. The code is available at https://github.com/xiaoerlaigeid/DBT-Dual-Net.

Place, publisher, year, edition, pages
Frontiers Media SA, 2026
Keywords
computer aided diagnosis, deep learning, digital breast tomosynthesis, dual-branch network, pseudo-color enhancement
National Category
Medical Imaging Radiology and Medical Imaging
Identifiers
urn:nbn:se:kth:diva-376504 (URN)10.3389/fdgth.2025.1705044 (DOI)001667578600001 ()41586202 (PubMedID)2-s2.0-105028595083 (Scopus ID)
Note

QC 20260219

Available from: 2026-02-19 Created: 2026-02-19 Last updated: 2026-02-19Bibliographically approved
Huang, Z., Wang, H., Ye, J., Ji, Y., Hu, X., Liu, L., . . . He, J. (2026). MedSegAgent: A Universal and Scalable Multi-Agent System for Instructive Medical Image Segmentation. IEEE journal of biomedical and health informatics
Open this publication in new window or tab >>MedSegAgent: A Universal and Scalable Multi-Agent System for Instructive Medical Image Segmentation
Show others...
2026 (English)In: IEEE journal of biomedical and health informatics, ISSN 2168-2194, E-ISSN 2168-2208Article in journal (Refereed) Epub ahead of print
Abstract [en]

Medical image segmentation is vital for clinical diagnosis and treatment; however, current solutions face three major limitations: (1) the lack of a universal framework capable of handling diverse modalities and anatomical targets, (2) the limited scalability to adapt to evolving clinical needs and new datasets, and (3) the lack of instructive interfaces that make models usable for non-expert users. To address these challenges, this paper presents MedSegAgent, a universal and scalable multi-agent system for instructive medical image segmentation. Specifically, MedSegAgent comprises five agents: one query parsing agent that processes natural language requests, three coarse-to-fine filtering agents (modality filtering, anatomical filtering, and label selection) for identifying relevant datasets and label values, and one execution agent responsible for model inference and result integration. Based on this framework, MedSegAgent utilizes 23 diverse datasets and pre-trained models to perform 343 types of segmentation across various modalities and anatomical targets. Experimental results demonstrate that MedSegAgent simplifies model selection while maintaining high performance, accurately identifying matching datasets and labels in 94.27% of queries and locating at least one suitable match in 99.03% of queries. MedSegAgent offers a universal and scalable solution for diverse medical image segmentation tasks, bridging the gap between user-friendly queries and the complexities of model selection and deployment.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
Multi-Agent System, Natural Language Instruction, Universal Medical Image Segmentation
National Category
Computer Sciences Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-380198 (URN)10.1109/JBHI.2026.3677444 (DOI)41880261 (PubMedID)2-s2.0-105034444689 (Scopus ID)
Note

QC 20260424

Available from: 2026-04-24 Created: 2026-04-24 Last updated: 2026-04-24Bibliographically approved
Yue, X., Zhao, Q., Liu, X., Li, J., Bai, J., Song, C., . . . Fu, G. (2026). Morphology-enhanced CAM-guided SAM for weakly supervised breast lesion segmentation. Biomedical Signal Processing and Control, 116, Article ID 109509.
Open this publication in new window or tab >>Morphology-enhanced CAM-guided SAM for weakly supervised breast lesion segmentation
Show others...
2026 (English)In: Biomedical Signal Processing and Control, ISSN 1746-8094, E-ISSN 1746-8108, Vol. 116, article id 109509Article in journal (Refereed) Published
Abstract [en]

Ultrasound imaging is vital for the early detection of breast cancer, where accurate lesion segmentation supports clinical diagnosis and treatment planning. However, existing deep learning-based methods rely on pixel-level annotations, which are costly and labor-intensive to obtain. This study presents a weakly supervised framework for breast lesion segmentation in ultrasound images. The framework combines morphological enhancement with Class Activation Map (CAM)-guided lesion localization and utilizes the Segment Anything Model (SAM) for refined segmentation without pixel-level labels. By adopting a lightweight region synthesis strategy and relying solely on SAM inference, the proposed approach substantially reduces model complexity and computational cost while maintaining high segmentation accuracy. Experimental results on the BUSI dataset show that our method achieves a Dice coefficient of 0.7063 under five-fold cross-validation and outperforms several fully supervised models in Hausdorff distance metrics. These results demonstrate that the proposed framework effectively balances segmentation accuracy, computational efficiency, and annotation cost, offering a practical and low-complexity solution for breast ultrasound analysis. The code for this study is available at: https://github.com/YueXin18/MorSeg-CAM-SAM-Segmentation.

Place, publisher, year, edition, pages
Elsevier BV, 2026
Keywords
SAM, Segmentation, Ultrasound images, Weakly supervised learning
National Category
Medical Imaging Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-375929 (URN)10.1016/j.bspc.2026.109509 (DOI)001664691600001 ()2-s2.0-105027210880 (Scopus ID)
Note

QC 20260127

Available from: 2026-01-27 Created: 2026-01-27 Last updated: 2026-01-27Bibliographically approved
Wang, D., Huang, Z., Zhang, J., Wu, W., Yang, Z. & Gu, L. (2025). Airway segmentation using Uncertainty-based Double Attention Detail Supplement Network. Biomedical Signal Processing and Control, 105, Article ID 107648.
Open this publication in new window or tab >>Airway segmentation using Uncertainty-based Double Attention Detail Supplement Network
Show others...
2025 (English)In: Biomedical Signal Processing and Control, ISSN 1746-8094, E-ISSN 1746-8108, Vol. 105, article id 107648Article in journal (Refereed) Published
Abstract [en]

Automatic pulmonary airway segmentation from thoracic computed tomography (CT) is an essential step for the diagnosis and interventional surgical treatment of pulmonary disease. While deep learning algorithms have shown promising results in segmenting the main and larger bronchi, segmentation of the distal small bronchi remains challenging due to their limited size and divergent spatial distribution. The study aims to address the challenges associated with segmenting the pulmonary airway, particularly focusing on the distal small bronchi. Specifically, we aim to improve the accuracy and completeness of airway segmentation by developing a novel deep-learning model. To achieve this purpose, we propose an Uncertainty-based Double Attention Detail Supplement Network (UDADS-Net) to identify and supply these missing details of the airway. We introduce the Uncertainty-based Double Attention Module (UDA), which utilizes the uncertainty-based attention module to obtain the regions with high uncertainty and utilizes another attention module to identify the missing details. Moreover, we also propose the Adaptive Multi-scale Module (AMS) to optimize the process of extracting details. Evaluation of our method on the ATM’2022 airway segmentation dataset demonstrates its effectiveness, especially for segmenting distal small bronchi. Our method significantly reduces missing and fragmented parts, leading to more accurate and complete airway segmentation, and achieving higher evaluation metrics compared to the state-of-the-art (SOTA) methods.

Place, publisher, year, edition, pages
Elsevier BV, 2025
Keywords
Attention mechanism, Network uncertainty, Pulmonary airway, Segmentation
National Category
Medical Imaging Respiratory Medicine and Allergy Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-360177 (URN)10.1016/j.bspc.2025.107648 (DOI)001427360700001 ()2-s2.0-85217428637 (Scopus ID)
Note

QC 20250220

Available from: 2025-02-19 Created: 2025-02-19 Last updated: 2025-12-08Bibliographically approved
Yang, Z., Astaraki, M., Smedby, Ö. & Moreno, R. (2025). Efficient Generation of Synthetic Breast CT Slices By Combining Generative and Super-Resolution Models. In: Artificial Intelligence and Imaging for Diagnostic and Treatment Challenges in Breast Care - 1st Deep Breast Workshop, Deep-Breath 2024, Held in Conjunction with MICCAI 2024, Proceedings: . Paper presented at 1st Deep Breast Workshop on AI and Imaging for Diagnostic and Treatment Challenges in Breast Care, Deep-Breath 2024, Marrakesh, Morocco, Oct 10 2024 - Oct 10 2024 (pp. 65-74). Springer Nature
Open this publication in new window or tab >>Efficient Generation of Synthetic Breast CT Slices By Combining Generative and Super-Resolution Models
2025 (English)In: Artificial Intelligence and Imaging for Diagnostic and Treatment Challenges in Breast Care - 1st Deep Breast Workshop, Deep-Breath 2024, Held in Conjunction with MICCAI 2024, Proceedings, Springer Nature , 2025, p. 65-74Conference paper, Published paper (Refereed)
Abstract [en]

High-quality synthetic medical images can enlarge training datasets in different deep learning-based applications. Recently, diffusion-based methods for image synthesis have outperformed GAN-based methods, even for medical images. Unfortunately, using diffusion models is costly in terms of training time and computational resources. We propose a two-stage method that combines diffusion models and GANs to tackle this problem. First, we use diffusion models or GANs to generate low-resolution images. Then, we use a GAN-based super-resolution model to interpolate high-resolution images from these low-resolution images. Experimental results on synthetic breast CT slices show that the proposed framework is more efficient and performs better than state-of-the-art methods that generate the images in a single step. The proposed methods will be available at https://github.com/xiaoerlaigeid/Image-Frequency-Score.git.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Diffusion Model, Frequency Information, Generative Adversarial Network, Medical Image Generation, Super-Resolution
National Category
Medical Imaging Signal Processing Computer graphics and computer vision Probability Theory and Statistics
Identifiers
urn:nbn:se:kth:diva-361151 (URN)10.1007/978-3-031-77789-9_7 (DOI)001544124300007 ()2-s2.0-85219213535 (Scopus ID)
Conference
1st Deep Breast Workshop on AI and Imaging for Diagnostic and Treatment Challenges in Breast Care, Deep-Breath 2024, Marrakesh, Morocco, Oct 10 2024 - Oct 10 2024
Note

Part of ISBN 9783031777882

QC 20250313

Available from: 2025-03-12 Created: 2025-03-12 Last updated: 2025-12-08Bibliographically approved
Zhuo, Y., Liu, M., Liu, J., Yang, Z., Liu, R., Xue, P. & Gu, L. (2025). FIND: A Framework for Iterative to Non-Iterative Distillation for Lightweight Deformable Registration. IEEE journal of biomedical and health informatics, 29(8), 5722-5735
Open this publication in new window or tab >>FIND: A Framework for Iterative to Non-Iterative Distillation for Lightweight Deformable Registration
Show others...
2025 (English)In: IEEE journal of biomedical and health informatics, ISSN 2168-2194, E-ISSN 2168-2208, Vol. 29, no 8, p. 5722-5735Article in journal (Refereed) Published
Abstract [en]

Deformable image registration is crucial for medical image analysis, yet the complexity of deep learning networks often limits their deployment on resource-limited devices. Current distillation methods in registration tasks fail to effectively transfer complex deformation handling capabilities to non-iterative lightweight networks, leading to insignificant performance improvement. To address this, we propose the Framework for Iterative to Non-iterative Distillation (FIND), which efficiently transfers these capabilities to a Non-Iterative Lightweight (NIL) network. FIND employs a dual-step process: first, using recurrent distillation to derive a high-performance non-iterative teacher assistant from an iterative network; second, using advanced feature distillation from the assistant to the lightweight network. This enables NIL to perform rapid, effective registration on resource-limited devices. Experiments across four datasets show that NIL can achieve up to 60 times faster performance on CPU and 89 times on GPU than compared deep learning methods, with superior registration accuracy improvements of up to 3.5 points in Dice scores.

Place, publisher, year, edition, pages
IEEE, 2025
Keywords
Image registration, knowledge distillation, lightweight network, time efficiency, resource limited
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-373489 (URN)10.1109/jbhi.2025.3556676 (DOI)001548121100032 ()40168217 (PubMedID)2-s2.0-105002036584 (Scopus ID)
Note

QC 20251201

Available from: 2025-12-01 Created: 2025-12-01 Last updated: 2025-12-01Bibliographically approved
Gao, J., Chen, M., Wang, Y., Yang, Z., Tang, L., Xu, P. & Chen, W. (2025). Force density method's energy principle and application in membrane-cable-strut-beam hybrid structures. Journal of Building Engineering, 99, Article ID 111523.
Open this publication in new window or tab >>Force density method's energy principle and application in membrane-cable-strut-beam hybrid structures
Show others...
2025 (English)In: Journal of Building Engineering, E-ISSN 2352-7102, Vol. 99, article id 111523Article in journal (Refereed) Published
Abstract [en]

The force density method (FDM) is a classical method for form finding and static analysis of membrane-cable-strut-beam hybrid structures (HS). This study focuses on the energy principle of FDM, and establishes the potential energies for HS in the form-finding and static analysis stages. The HS model is discretized into link, T and beam elements. Equilibrium equations and stiffness matrices are formulated using the principle of stationary potential energy. In the integrated analysis, the compatibility between the link and beam element is resolved by adopting (x, y, z, θx, θy, θz) as degrees of freedom (DOF) and transforming the beam's DOF into coordinate differences. The global matrix is assembled using nodal global DOF numbers, showing a computational advantage over the conventional topological matrix method. FDM has demonstrated consistency in form-finding and static analysis stages by applying constant and elastic force densities. The algorithm has been implemented in a program called TMCAD with all the calculation details illustrated, and FDM's effectiveness is verified through five HS examples.

Place, publisher, year, edition, pages
Elsevier BV, 2025
Keywords
Energy principle, Force density method, Form finding, Hybrid structure, Static analysis
National Category
Applied Mechanics Computational Mathematics
Identifiers
urn:nbn:se:kth:diva-357936 (URN)10.1016/j.jobe.2024.111523 (DOI)001383324900001 ()2-s2.0-85211357978 (Scopus ID)
Note

QC 20241219

Available from: 2024-12-19 Created: 2024-12-19 Last updated: 2025-01-28Bibliographically approved
Huang, Z., Ye, J., Wang, H., Deng, Z., Yang, Z., Su, Y., . . . He, J. (2025). Revisiting model scaling with a U-net benchmark for 3D medical image segmentation. Scientific Reports, 15(1), Article ID 29795.
Open this publication in new window or tab >>Revisiting model scaling with a U-net benchmark for 3D medical image segmentation
Show others...
2025 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 15, no 1, article id 29795Article in journal (Refereed) Published
Abstract [en]

Are larger models always better for 3D medical image segmentation? Despite the widespread adoption of 3D U-Net in various medical imaging tasks, this critical question remains underexplored. To challenge the common assumption, we systematically benchmark 18 U-Net variants—adjusting resolution stages, depth, and width—across 42 diverse public datasets. Our findings reveal that the answer is no: optimal architectures are highly task-specific, with smaller models often performing competitively. Specifically, we identify three key insights: (1) increasing resolution stages provides limited benefits for datasets with larger voxel spacing; (2) deeper networks offer limited advantages for anatomically complex shapes; and (3) wider networks provide minimal advantages for tasks with limited segmentation classes. Based on these insights, we provide practical guidelines for optimizing U-Net architectures according to dataset characteristics. Our findings highlight the limitations of the“bigger is better”paradigm while establishing a framework for balancing performance and computational efficiency in 3D medical image segmentation tasks.

Place, publisher, year, edition, pages
Nature Research, 2025
National Category
Medical Imaging
Identifiers
urn:nbn:se:kth:diva-369353 (URN)10.1038/s41598-025-15617-1 (DOI)001552512000012 ()40813440 (PubMedID)2-s2.0-105013313813 (Scopus ID)
Note

QC 20250904

Available from: 2025-09-04 Created: 2025-09-04 Last updated: 2025-09-04Bibliographically approved
Yang, Z., Xiao, Y., Öktem, O., Smedby, Ö. & Moreno, R. (2025). Two-Stage Convolutional Neural Network for Breast CT Reconstruction. In: Medical Imaging 2025: Physics of Medical Imaging: . Paper presented at Medical Imaging 2025: Physics of Medical Imaging, San Diego, United States of America, Feb 17 2025 - Feb 21 2025. SPIE-Intl Soc Optical Eng, Article ID 1340544.
Open this publication in new window or tab >>Two-Stage Convolutional Neural Network for Breast CT Reconstruction
Show others...
2025 (English)In: Medical Imaging 2025: Physics of Medical Imaging, SPIE-Intl Soc Optical Eng , 2025, article id 1340544Conference paper, Published paper (Refereed)
Abstract [en]

In this study, we propose a deep learning based two-stage breast CT reconstruction in the image domain. Unlike most methods, we use two separate models to improve the Breast CT image quality. In the first stage, a deep learning-based denoiser was used to remove the noise. In the second stage, a deep learning based image enhancement model is used to improve the image quality. We evaluated the proposed method on the AAPM 2021 sparse view CT reconstruction challenge dataset.1 The experimental results demonstrate that the proposed method performs better than all comparison methods.

Place, publisher, year, edition, pages
SPIE-Intl Soc Optical Eng, 2025
Keywords
Breast CT, Image Denoise, Image Enhancement, Sparse-view CT reconstruction, Two stage method
National Category
Computer graphics and computer vision Medical Imaging Signal Processing
Identifiers
urn:nbn:se:kth:diva-363749 (URN)10.1117/12.3048825 (DOI)001487074500128 ()2-s2.0-105004584141 (Scopus ID)
Conference
Medical Imaging 2025: Physics of Medical Imaging, San Diego, United States of America, Feb 17 2025 - Feb 21 2025
Note

 Part of ISBN 9781510685888

QC 20250523

Available from: 2025-05-21 Created: 2025-05-21 Last updated: 2025-07-04Bibliographically approved
Yang, Z., Fan, T., Smedby, Ö. & Moreno, R. (2024). 3D Breast Ultrasound Image Classification Using 2.5D Deep learning. In: 17th International Workshop on Breast Imaging, IWBI 2024: . Paper presented at 17th International Workshop on Breast Imaging, IWBI 2024, Chicago, United States of America, Jun 9 2024 - Jun 12 2024. SPIE, 13174, Article ID 131741R.
Open this publication in new window or tab >>3D Breast Ultrasound Image Classification Using 2.5D Deep learning
2024 (English)In: 17th International Workshop on Breast Imaging, IWBI 2024, SPIE , 2024, Vol. 13174, article id 131741RConference paper, Published paper (Refereed)
Abstract [en]

The 3D breast ultrasound is a radiation-free and effective imaging technology for breast tumor diagnosis. However, checking the 3D breast ultrasound is time-consuming compared to mammograms. To reduce the workload of radiologists, we proposed a 2.5D deep learning-based breast ultrasound tumor classification system. First, we used the pre-trained STU-Net to finetune and segment the tumor in 3D. Then, we fine-tuned the DenseNet-121 for classification using the 10 slices with the biggest tumoral area and their adjacent slices. The Tumor Detection, Segmentation, and Classification on Automated 3D Breast Ultrasound (TDSC-ABUS) MICCAI Challenge 2023 dataset was used to train and validate the performance of the proposed method. Compared to a 3D convolutional neural network model and radiomics, our proposed method has better performance.

Place, publisher, year, edition, pages
SPIE, 2024
Series
Proceedings of SPIE - The International Society for Optical Engineering, ISSN 0277-786X ; 13174
Keywords
2.5D, 3D Breast Ultrasound, Deep learning, Tumor Classification
National Category
Radiology, Nuclear Medicine and Medical Imaging
Identifiers
urn:nbn:se:kth:diva-348289 (URN)10.1117/12.3025534 (DOI)001239315300062 ()2-s2.0-85195360791 (Scopus ID)
Conference
17th International Workshop on Breast Imaging, IWBI 2024, Chicago, United States of America, Jun 9 2024 - Jun 12 2024
Note

QC 20240624

Part of ISBN 978-151068020-3

Available from: 2024-06-20 Created: 2024-06-20 Last updated: 2024-07-05Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0009-0005-5560-1684

Search in DiVA

Show all publications